In-line near-infrared analysis of milk coupled with machine learning methods for the daily prediction of blood metabolic profile in dairy cattle

Giannuzzi, Diana; Mota, Lucio Flavio Macedo; Pegolo, Sara; Gallo, Luigi; Schiavon, Stefano; Tagliapietra, Franco; Katz, Gil; Fainboym, David; Minuti, Andrea; Trevisi, Erminio; Cecchinato, Alessio

doi:10.1038/s41598-022-11799-0

Download PDF

Article
Open access
Published: 16 May 2022

In-line near-infrared analysis of milk coupled with machine learning methods for the daily prediction of blood metabolic profile in dairy cattle

Diana Giannuzzi¹,
Lucio Flavio Macedo Mota¹,
Sara Pegolo¹,
Luigi Gallo¹,
Stefano Schiavon¹,
Franco Tagliapietra¹,
Gil Katz²,
David Fainboym²,
Andrea Minuti³,
Erminio Trevisi³ &
…
Alessio Cecchinato¹

Scientific Reports volume 12, Article number: 8058 (2022) Cite this article

4078 Accesses
10 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Precision livestock farming technologies are used to monitor animal health and welfare parameters continuously and in real time in order to optimize nutrition and productivity and to detect health issues at an early stage. The possibility of predicting blood metabolites from milk samples obtained during routine milking by means of infrared spectroscopy has become increasingly attractive. We developed, for the first time, prediction equations for a set of blood metabolites using diverse machine learning methods and milk near-infrared spectra collected by the AfiLab instrument. Our dataset was obtained from 385 Holstein Friesian dairy cows. Stacking ensemble and multi-layer feedforward artificial neural network outperformed the other machine learning methods tested, with a reduction in the root mean square error of between 3 and 6% in most blood parameters. We obtained moderate correlations (r) between the observed and predicted phenotypes for γ-glutamyl transferase (r = 0.58), alkaline phosphatase (0.54), haptoglobin (0.66), globulins (0.61), total reactive oxygen metabolites (0.60) and thiol groups (0.57). The AfiLab instrument has strong potential but may not yet be ready to predict the metabolic stress of dairy cows in practice. Further research is needed to find out methods that allow an improvement in accuracy of prediction equations.

Relationship of the blood metabolome to subsequent carcass traits at slaughter in feedlot Wagyu crossbred steers

Article Open access 22 October 2019

Differences in the serum metabolome profile of dairy cows according to the BHB concentration revealed by proton nuclear magnetic resonance spectroscopy (1H-NMR)

Article Open access 15 February 2022

Changes in the blood metabolome of Wagyu crossbred steers with time in the feedlot and relationships with marbling

Article Open access 04 November 2020

Introduction

Health monitoring of dairy herds is pivotal to improving their health and welfare and to attain greater efficiency and sustainability in farming. Nutritional imbalances, dietary deficiencies or improper management, especially during the transition and early lactation phases, can generate a range of health disorders, which are generally categorized as metabolic diseases, and include ketosis, hepatic lipidosis, hypocalcemia and hypomagnesemia¹. Typically, as milk production increases and herds become larger, the incidence of metabolic diseases increases, becoming a major source of economic losses^2,3.

Blood biochemistry is commonly used as part of a diagnostic evaluation to confirm the suspected disease, assess prognosis, control the progression of disease, and appraise the effectiveness of treatments⁴. On the other hand, analysis of blood biochemical constituents, known as metabolic profiling, is a well-established test that, in association with animal, diet and management assessments, helps determine disease risk in clinically healthy dairy cows, rather than diagnose disease^4,5. Serum metabolic profiling is of particular interest for identifying subclinical disorders, which are highly prevalent and have serious consequences for the cows’ welfare and production levels^6,7,8. Detection of individuals with alterations to their blood parameters could allow early nutritional or management interventions to be delivered to avert the onset of overt affections. Anyhow, blood sampling remains a labor-intensive, invasive procedure that causes the cows distress⁹. Furthermore, laboratory analyses of comprehensive metabolic panels are costly and time-consuming^10,11, limiting their application at the farm level. In contrast, collecting milk samples is noninvasive and can be easily incorporated into standard milking procedures. Moreover, milk composition reflects the metabolic status of cows, in that deviations from metabolic homeostasis are manifested by alterations in milk composition^12,13. Infrared spectroscopy uses infrared light to scan milk samples and detect the presence of specific chemical bonds, producing an absorption profile comprising the absorbance values for individual infrared light wavenumbers associated with milk components¹⁴. The use of milk infrared spectroscopy as an alternative source of information on animal health status has therefore become increasingly attractive^12,15. Indeed, the use of minimally invasive sensors and technology able to collect a large amount of data in order to understand and predict the status of animals is fundamental to improving sustainable production systems and precision livestock farming¹⁶.

In dairy herds, Fourier-transform mid-infrared (2500 to 25,000 nm; FTIR) spectra of milk are widely used as a cost-effective means of assessing raw milk composition¹⁷. Diverse studies have pointed out the ability of FTIR in predicting metabolites dissolved in milk, such as acetone, β-hydroxybutyric acid (BHBA) and citrate¹⁸. More ambitiously, in recent years numerous efforts have been made to indirectly predict blood metabolites, such as glucose, IGFs, non-esterified fatty acids (NEFA), BHBA, urea, proteins and minerals using milk FTIR spectra at the individual cow level^8,9,19. Although the milk FTIR predictions in these studies did not provide precise quantification of blood components, they nonetheless highlighted the potential of FTIR analysis to screen for high or low values of individual blood metabolites or combinations of them as metabolic status indicators. However, FTIR spectroscopy requires a large laboratory, so that while milk samples can be taken from each cow once or twice per month for herd level analyses, to date the technology is not suitable for on-farm daily management.

Near-infrared (350 to 2500 nm; NIR) spectra, on the other hand, appear to be just as reliable as FTIR spectra for analyzing raw milk composition and cheese-making traits^20,21,22. The visible to low-NIR range (350 to 1000 nm) seems to be suitable for automated in-line analysis of milk, as the optical sensors are less expensive²³. Other advantages include rapidity, simplicity, simultaneous nondestructive measurements, lower sensitivity to environmental noise, and greater robustness compared with far-NIR (1000 to 2500 nm) and FTIR measurements^24,25. Moreover, fine milk components such as fatty acids, that are known to represent a fingerprint of the cow’s nutritional and metabolic status, have been reported to be satisfactorily predictable using NIR instruments²⁶. AfiLab (Afimilk, Kibbutz Afikim, Israel) is an in-line NIR spectrometric milk analyzer that can be installed in the milking parlor to provide real-time measurements during milking sessions²⁷. As pointed out in a recent review¹⁵, the possibility of predicting blood components from the milk of individual cows with a continuous system could be an important step forward in evaluating the health status of cows and identifying individual susceptibility to metabolic disorders.

To the best of our knowledge, no equations to indirectly predict blood parameters from milk NIR spectra have ever been developed. Previous studies on phenotype prediction of milk traits from milk NIR spectra have mainly used partial least squares (PLS) regression models^23,28. However, several new machine learning methods, including deep and ensemble learning can now be applied, providing greater flexibility in modeling complex associations and improving prediction accuracy compared with PLS methods^29,30,31.

The objectives of the present study were twofold. First, we assessed the feasibility of using daily milk NIR spectra, through the AfiLab real-time milk analyzer, to predict a broad set of 29 hematochemical parameters, such as metabolites related to energy metabolism, liver function/hepatic damage, oxidative stress, inflammation/innate immunity, and minerals in clinically healthy, high-yielding Holstein Friesian dairy cows. Second, we explored the inner structure of the NIR spectra predictions, identifying for each hematochemical indicator the best prediction model and the relevance of individual wavelengths in explaining the variations in all the traits of concern.

Results and discussion

Blood metabolic profile of cows

Descriptive statistics of blood metabolite concentrations within our cohort are reported in Table 1. As all the cows involved in the study were clinically healthy, the range of variability in the data is representative of a physiological condition. Although the cows did not manifest clinical disease, the high variability in some blood biomarkers means we cannot exclude the presence in certain individuals of subclinical conditions, an expected finding in a large population³².

Table 1 Descriptive statistics for hematochemical parameters in all cows involved in the study.

Full size table

Looking at the major blood indicators of metabolic impairment in dairy cows and their well-documented thresholds, we observed a certain degree of alteration in serum proteins, with 11% of cows presenting with elevated globulins concentrations (> 50 g/l), and 2% with low albumin concentrations (< 30 g/l)³³. We found 1.5% of cows (n = 6) with BHBA concentrations equal to or greater than 1.2 mmol/l³⁴, whereas fewer than 1% of cows (n = 3) had elevated NEFA concentrations, in accordance with a threshold of ≥ 0.70 mmol/l³⁵. Of the six hyperketotic cows, one had a concurrent elevated concentration of NEFA. The low prevalence of hyperketonemia may be due to the fact that the cows in our study ranged across all lactation stages (between 3 and 504 days in milk [DIM]), with only 1.5% (n = 6) in the first 5d postpartum, when the peak incidence of hyperketonemia occurs⁷. Regarding urea concentration, 43% of cows exceeded the threshold of ≥ 6.78 mmol/l³⁶. The level of blood urea reflects the effects of dietary intake of crude protein and its digestive utilization, milk protein secretion, body protein turnover and nitrogen (N) urinary excretion³⁷. High-producing dairy cattle are commonly fed diets with crude protein levels exceeding 16% to ensure maximum milk output³⁸, as was the case in the study herd (i.e. 16.5% dietary crude protein with an average milk yield of 33.9 kg/days). The significance of high values of blood or milk urea in high-yielding cows is controversial; several studies reported negative effects on the conception rate in dairy cows^39,40, while others found no such effects^41,42. Nevertheless, if protein fed to ruminants exceeds microbial needs, it degrades to ammonia, is absorbed into the blood, converted to urea in the liver and excreted in the urine. Impairment of the N balance may result in laminitis and metabolic dysfunction⁴³. Conversely, no animals had serum urea concentrations below the optimal range (< 1.7 mmol/l)⁴⁴. Less than 1% of cows were suspected of having hypomagnesemia (< 0.70 mmol/l) or hypocalcemia (< 2.0 mmol/l)⁴⁵.

Model performances

Regarding the performances of the machine learning models, the prediction accuracy obtained through random 10-10-fold cross-validation (CV) was greater than through PLS regression for all the blood traits except for BHBA, which had a correlation coefficient (r) of 0.63 with PLS. In fact, the r values for the PLS regression predictions were < 0.25 for 23 of the 29 blood traits, making them unsuitable for further analysis (data not in Table). It is worth noting that we applied a completely independent CV (random or leave-one-batch-out), so none of the records in the training sets was present in the validation sets for all folds generated. Applying the leave-one-batch-out CV, where each batch is constituted by a different sampling date, accuracy dropped by between 5 and 75% compared with 10-fold CV for all traits except for globulins and hematocrit, where accuracy increased slightly, by 3% and 9%, respectively (Table 2). This increases the intrinsic source variability in blood metabolite measurements, which are subject to critical fluctuations in relation to physiological variations, environmental conditions, sampling procedure and timing^46,47. The leave-one-batch-out CV technique reflects a more conservative scenario, where there is less dependence between the training and validation sets, which reduces the predictive ability of the models compared with the 10-fold scheme^48,49. It is worth nothing that in our dataset there is no batch dependence bias, so the different batches may under or overperform the prediction depending on the blood metabolites.

Table 2 Near-infrared AfiLab milk prediction performance considering the systematic effect of days in milk and parity through different cross-validation scheme for hematochemical parameters using machine learning.

Full size table

Because no previous studies have been conducted on the prediction of blood parameters from NIR spectra captured from milk using the AfiLab system, and due to the peculiarity of the data structure of each blood trait, we applied an automatic machine learning (autoML) method that used different penalized regression models (ridge regression [RR], least absolute shrinkage and selection operator [LASSO], elastic net [EN]), machine learning techniques (random forest [RF], gradient boosting machine [GBM], and multi-layer feedforward artificial neural network [ANN]), and stacking ensemble model, a combination of them to find the best model without assuming any prior knowledge. Of the different machine learning methods tested, the stacking ensemble and ANN were those with the highest predictive ability for most of the blood metabolite-related traits, as shown in Table 3 and Supplementary Table S1. Indeed, we obtained a reduction in the root mean square error (RMSE) of between 3 and 6% for both the stacking ensemble and ANN compared with the other models tested. Moreover, in the models that outperformed the others, there was consistency between the two CV schemes in all blood metabolites, with the exception of advanced oxidation protein products (AOPP) and minerals (Supplementary Table S1). On the one hand, stacking uses a learn-to-learn approach, whereby different algorithms are used as base learners, and the models with greater prediction accuracy are combined in a final prediction model with appropriate penalization, which provides more accurate predictions⁵⁰. On the other hand, ANN has the ability to self-learn relationships from labeled data and generalize to unlabeled conditions, producing high performances on low variability traits⁵¹. A recent study tested ANN and found its performance to be accurate in predicting BHBA from milk FTIR spectra³⁰.

Table 3 Best machine learning approaches through different cross-validation (CV) scheme for hematochemical parameters expressed as frequencies (how many times each model is the best model for a given CV scheme for each hematochemical parameter prediction).

Full size table

As pointed out by Eskildsen et al.⁵², the predictive ability of a spectrum equation can be affected by individual factors (e.g. breed, parity, DIM) and herd-related factors (e.g. feed, management). Indeed, the inclusion of DIM and parity effects has been reported to enhance the robustness of prediction equations^22,53. Here, to examine the potential of NIR analysis, we built prediction equations on the basis of a single breed and a single herd, and included in the models the main sources of variation in lactating cows (i.e. DIM and parity), as these data are provided by the management software of the Afimilk system installed in the farm. It is worth noting that, unlike other studies that used mid-infrared devices within milk recording schemes, in our study we developed—for the first time—tailored NIR AfiLab prediction equations that could be used to check blood indicators on a daily basis at the farm level without the need for milk sampling or any type of processing, making the detection of individuals with putative metabolic dysfunctions faster and easier and enabling timely intervention.

Relationships between light emitting diodes and blood traits

The AfiLab spectrophotometer consists of an array of light emitting diodes (LEDs) at 32 discreet wavelengths^54,55. As milk is a complex fluid, being a suspension of emulsified butterfat globules and casein micelles in a water-based solution, the dynamic interactions between light and matter in it are highly nonlinear. Overall, we found low correlations between each LED and the blood metabolites (r < 0.50); the red regime, green regime, and water absorption NIR lines (LEDs 7, 10 and 16) appeared to contain the most informative wavelengths for blood traits (Fig. 1). As expected, we observed agreement in LED relationships between specific and derived blood traits (i.e. globulins from total proteins).

Predictive ability, meaning and biological link with milk of all blood traits

We explored which—if any—of the blood metabolites in a metabolic profile could be reliably predicted from milk NIR spectra. The fitting statistics of the prediction models of blood metabolites are reported in Table 2. The predictive ability of a spectrum equation is trait dependent, and it relies on the covariance structures between the investigated traits and milk composition⁵². As our findings show, blood traits that are well known to be secreted in the udder obtained the highest r values (Fig. 2). Since this is the first attempt to predict a wide range of serum metabolites from milk NIR spectra using the AfiLab system, we are unable to draw parallels with previous literature.

Among the energy-related metabolites, glucose, total cholesterol, and urea attained an r higher than 0.6 and BHBA 0.54. For this category of traits, the most informative LED was the 16 (Supplementary Fig. S1). A recent study on milk NIR spectra obtained greater accuracy (r = 0.71) than we did in the prediction of urea, but the reference values in that study were collected on the same matrix as the predicted values (milk urea)⁵⁶. To our knowledge, BHBA and NEFA have not previously been estimated from milk NIR spectra, but when predicted from milk FTIR spectra, they exhibited high (r > 0.70) prediction accuracies, leading to this technique being proposed as a monitoring tool for detecting hyperketonemia^30,57,58. In the current study, NEFA prediction equations had weak reliability; the unsatisfactory estimation was probably due to poor variability in the samples and the non-normal distribution of the data, which included a very low proportion of high values. Moreover, logarithm transformation did not significantly improve the predictive ability (data not in Table). For BHBA, we obtained a moderate r value of CV with both the random (0.54) and leave-one-batch-out (0.50) schemes, but the low number of samples with a BHBA higher than 1.2 mmol/l means our equations were unable to correctly categorize ketotic animals. A higher number of hyperketotic cows therefore need to be included in the analysis to confirm the potential usefulness of our equations in screening dairy cows for subclinical ketosis. However, given the nature of circulant BHBA, which drops after feeding⁵⁹, even if the equations are poorly calibrated two or three milking sessions a day might be a better way of screening for subclinical ketosis in cows than daily blood sampling at random times.

Within the category of blood indicators of liver function and hepatic damage, albumin and γ-glutamyl transferase (GGT) had an r of 0.58, and alkaline phosphatase (ALP) an r of 0.54. Again, LED 7 and LED 10 were the wavelengths with the greatest explanatory capacity (Supplementary Fig. S2). Hepatic enzymes are physiologically present in cows’ milk as a result of spontaneous diffusion of low molecular weight enzymes from plasma or active secretion by the mammary gland epithelium, or they are released after disruption of somatic cells, most often leukocytes. In the case of GGT and ALP, their activity has been reported to be much greater in milk than in blood serum, and they not only correlate with increased plasma activity, but also with stage of lactation, seasonality, milk yield, and mammary gland health^60,61. In any case, a relationship between energy metabolism and aspartate aminotransferase- glutamate oxaloacetate transaminase (AST), ALP and GGT activity in the mammary gland has been shown, with a strong correlation between blood and milk hepatic enzymes, and with mild degrees of hepatic lesions showing an increase in enzymes in both matrices^61,62,63. Therefore, the potential to properly predict hepatic enzymes from milk on a daily basis deserves further investigation.

Blood indicators of oxidative stress were found to be reasonably predictable from milk NIR spectra, with total reactive oxygen metabolites (ROMt), ferric reducing antioxidant power (FRAP) and thiol groups (SHp) showing r values between 0.57 and 0.60. The LED 25 was the most informative for this group of traits (Supplementary Fig. S3). Parameters that measure oxidative stress conditions have not previously been predicted using infrared spectroscopy techniques. Plasma levels of ROMt indicate free radical production; conversely, total SHp of plasma are considered a significant element in the extracellular antioxidant defense system against oxidative stress and can be used to describe antioxidant potential in dairy cows⁶⁴.Total reactive oxygen metabolites and SHp are present in milk⁶⁵, and SHp can be detected by NIR spectra wavelengths⁶⁶. Information on the relationship between the blood levels of oxidant/antioxidant metabolites and milk concentrations is scarce, but suggests there is a relationship between oxidative status and innate immune status in blood and milk^67,68. The ability to predict the oxidative status of cows from milk NIR spectra might shed light on the overall stress conditions of each animal, but further research is needed to understand whether the levels of oxidative stress metabolites in milk are representative of blood concentrations.

With regard to blood indicators of inflammation and innate immunity, prediction accuracies were low for ceruloplasmin and myeloperoxidase, but were better for haptoglobin, total proteins and globulins (r = 0.66, 0.57, and 0.61, respectively). The LEDs providing most information on these traits are 7, 10, 13 and 16 (Supplementary Fig. S4). Haptoglobin is an acute-phase protein and is a sensitive indicator of local or systemic inflammation, which can occur prior to the development of typical signs of inflammation (e.g. inflammatory leucogram)⁶⁹. It is known to be diffused from blood into the milk, but it is also produced endogenously by milk leukocytes and epithelial cells during mammary gland processes⁷⁰. For these reasons, haptoglobin has been proposed has an inflammatory indicator in dairy cows⁷⁰. Animals with signs of clinical or subclinical mastitis were excluded from this study, so the levels of haptoglobin detected might be suggestive of blood diffusion. Further studies including greater numbers of individuals might lead to predicted milk haptoglobin being used as an indicator of the metabolic status of dairy cows without ongoing mammary gland disorders. Total proteins, globulins and albumin in blood, on the other hand, are indicators of both physiological (e.g. age, breed, parity, stage of lactation, nutrition, climate, season) and pathological variations in dairy cows^71,72. The presence of blood-derived proteins in milk is well documented, although the relationships in active and passive transfers are complex⁷³. Notably, repeated measures have been highlighted as a successful strategy for correctly evaluating their concentrations⁷¹. As such, the potential to accurately evaluate serum protein with a continuous system, avoiding invasive, time-consuming blood sampling, becomes much more relevant.

Among blood minerals, only sodium and zinc appeared to be predictable from milk NIR spectra, their r values being 0.65 and 0.59, respectively. Sodium was predicted using mainly LED 16, zinc using LEDs 7, 10 and 16 (Supplementary Fig. S5). These results are not unexpected, for two main reasons. First, the minerals in this dataset exhibited extremely low variability compared to other blood metabolites (Table 1). Moreover, the minerals exhibiting greater variability had higher r values, supporting the hypothesis that traits with a wider range of variability provide better prediction models. Second, minerals do not have specific band absorptions in the near-infrared spectrum, so their predictability from NIR spectra might be related to their occurrence in organic compounds, such as proteins or organic molecules⁷⁴, or to the effect of the element on the water absorption band⁷⁵, as already shown in other food matrices^76,77. In this study, we predicted blood metabolites from milk spectra. Whereas a large proportion of reference values for minerals are in their ionized forms, such as 50% for calcium and 70% for magnesium, the majority of minerals in milk are in organic compounds: two-thirds of calcium is bound in organic compounds within the colloidal phase, more than 50% of total phosphorus is bound to casein micelles, and only 16% of magnesium is present as free ions⁷⁸. Furthermore, inorganic forms such as inorganic phosphorus (representing 80% of milk phosphorus) are difficult to detect using NIR spectra⁷⁴. Sodium, instead, is found mainly as free ions in both blood and milk⁷⁸, as is zinc, which is present prevalently in its ionized form⁷⁹. Furthermore, the amounts of sodium and zinc are markedly affected by inflammatory conditions, in both blood and milk. We can speculate that this similarity of chemical form in the two matrices, coupled with the alteration they bring about in the water region of the spectrum⁸⁰, might explain better predictive ability of our equations compared to the rest of the minerals⁷⁶.

Overall, the models we developed for predicting blood traits were found to have moderate predictive ability (r < 0.70) according to recommended thresholds⁸¹ and compared to predictions of milk blood metabolites from FTIR spectra^8,9,57. Although it has major advantages over FTIR, including rapidity of acquisition, cost-effectiveness, and continuous recording, NIR spectrometry uses light scattering principles to measure particle size, which makes it difficult to capture the nonlinear diffusive scattering produced by milk, a complex, heterogeneous matrix²⁵. Near-infrared spectra are highly influenced by the presence or absence of water¹⁵, and raw milk has a high water content²³. Despite all this, while FTIR is difficult to implement in the in-line process due to operational difficulties at the individual cow level and the need for repeated reference measurements, NIR has been successfully installed in in-line instruments (such as the AfiLab milk analyzer). With the implementation of prediction equations in in-line devices, it is possible to check the metabolic conditions of cows multiple times daily, allowing for real-time intervention to improve health status. From a managerial perspective, the availability of real-time metabolic indicators is a crucial step forward in the monitoring of subclinical diseases. Improvements to the prediction equations of milk NIR spectra installed in in-line devices could be the answer to the problem of monitoring the health of individual cows.

In conclusion, milk is a complex matrix, whose composition mirrors the metabolic status of the cow. By means of infrared spectroscopy, a wealth of information in the form of absorption profiles can be extracted on a daily basis using a non-invasive technique and stored for subsequent individual health evaluations. Moreover, the development of prediction equations based on milk NIR spectra using machine learning methods yielded better results than using traditional techniques (i.e. PLS), especially for hepatic enzymes and inflammatory indicators. We showed that prediction equations based on NIR spectra are not only able to predict components directly traceable in milk, but can also provide information on indirect indicators, such as blood metabolites, offering insights into the health status of the cow. The prediction equations for haptoglobin and hepatic enzymes, and for oxidative stress parameters, in particular, yielded promising results.

Nevertheless, in view of applying these equations in daily practice to monitor the health status and management of cows, further research is needed to better dissect the relationships between the NIR spectra and the milk matrix, improve accuracy, and identify the most informative NIR wavelengths for blood indicators of metabolic distress. Moreover, a larger sample that includes animals with clinical disease, thereby widening the population variability and improving the accuracy and repeatability of the predictions, is required to improve the accuracy of the prediction equations.

Methods

Study design and field data

This study is part of a broader project (PROH-DAIRY) funded by the Ministero degli Affari Esteri e della Cooperazione Internazionale (MAECI) within the Italy-Israel R&D Cooperation Program aimed at developing new precision livestock breeding tools as a contribution towards One Health in the Italian and Israeli dairy chains. To accomplish the objectives of the project and develop prediction equations able to predict diverse traits from milk using infrared technology, an AfiLab real-time milk analyzer (Afimilk, Kibbutz Afikim, Israel) was installed de novo in the milking parlor of the study farm to collect milk infrared spectra. The Afifarm management program (Afimilk, Kibbutz Afikim, Israel) was used for daily herd management and for storing the data, including the AfiLab spectra per cow per milking sessions (morning and evening).

The farm participating in the study is located in the production area of Grana Padano Protected Designation of Origin (PDO) hard cheese in Piacenza province (northwestern Italy). It holds a commercial herd comprising 965 lactating Holstein Friesian cows with an average milk yield around 9500 to 10,000 kg per cow per year. For this study, milk spectra data were obtained from 385 cows in their first to fifth parity from October 2019 to December 2020. The cows’ number of days in milk covered the entire lactation period (3 to 504 DIM). They were housed in free stalls and were fed on total mixed rations based mainly on corn silage, sorghum silage, and concentrates (Supplementary Table S2). Drinking water was available in automatic water bowls, and the cows were milked twice daily. Management information (e.g. feeding, DIM, parity, health status) was available, but was not experimentally manipulated.

Approval for the study was granted by the ethical committee of the Organismo Preposto al Benessere degli Animali (OPBA; Organization responsible for animal welfare) of the Università Cattolica del Sacro Cuore, and by the Italian Ministry of Health (protocol number 510/2019-PR of 19/07/2019). The study was carried out following the recommendations of the ARRIVE guidelines.

Blood sampling

Animals with clinical signs of disease or undergoing medical treatment were excluded from the trial. Blood samples (5 ml) were collected from the jugular vein of 385 animals and placed in vacuum tubes containing 150 USP units of lithium heparin (Vacumed; FL Medical, Torreglia, Padua, Italy). Blood sampling of each cow was carried out after the morning milking and before feeding from September 2019 to February 2020 (9 different herd/dates).

NIR spectra collection and preprocessing

The AfiLab system is a spectrophotometer that consists of an array of 32 discreet wavelengths in the range of visible to low-NIR (350 nm to 1000 nm) based on LEDs as previously described by Schmilovitch et al.²⁵. During the morning milking, NIR spectra were obtained from individual cows using the AfiLab system (Afimilk, Kibbutz Afikim, Israel; internal control), which collects infrared information from each 200 ml of milk flowing through the machine, then averages the information from each cow in each milking session. The AfiLab system is routinely calibrated automatically once a month to avoid accumulating bias in the infrared collection system.

The milk spectra acquired from each animal were preprocessed by the first derivative⁸², which was then normalized using a Standard Normal Variate equation $\left[ {SVN_{i} = {\raise0.7ex\hbox{${x_{i} - \overline{{x_{i} }} }$} \!\mathord{\left/ {\vphantom {{x_{i} - \overline{{x_{i} }} } {s_{i} }}}\right.\kern-\nulldelimiterspace} \!\lower0.7ex\hbox{${s_{i} }$}}} \right]$ (Fig. 3); this procedure is used to remove the baseline variations that can occur over time. Quality control of infrared spectra to remove possible outliers was performed combining principal component analysis and Mahalanobis distance, using a significance level of 5%⁸³; three animals were removed from the subsequent analysis.

Reference blood metabolic profile analysis and preprocessing

Once collected, the blood samples were kept on ice until centrifugation (Hettich Universal 16R Centrifuge, 3500 g, 16 min, 6 °C) and were processed within two hours of collection. Hematocrit was determined using a small fraction of the sampled blood (ALC Centrifugette 4203, 15,300 g, 12 min). The plasma obtained from centrifugation was stored at − 20 °C until analysis. An ILAB 650 (Instrumentation Laboratory, Bedford, MA) clinical auto-analyzer was used to determine the concentrations of glucose, NEFA, BHBA, urea, creatinine, calcium, phosphorus, magnesium, sodium, potassium, chlorine, zinc, AST, GGT, ALP, total proteins, haptoglobin, ceruloplasmin, albumin, total bilirubin, cholesterol and globulins, according to Calamari et al.⁴; ROMt, FRAP and paraoxonase, according to Bionaz et al.³²; SHp, according to Minuti et al.⁸⁴.; myeloperoxidase, according to Bradley et al.⁸⁵.; and AOPP, according to Hanasand et al.⁸⁶.

Mathematical treatments were applied to the blood metabolites prior to model development. The original values of hematocrit, NEFA, BHBA, AST, GGT, creatinine, total bilirubin, haptoglobin and FRAP had strongly skewed distributions with lower values over-represented, which could affect prediction accuracy¹⁸. After having tested some other transformation procedures, such as square root and natural logaritmic ones, we applied a logarithmic transformation (base-10), which gave the best improvement in reducing skewness and normalizing distribution (Supplementary Fig. S6).

Prediction analysis and cross-validation scenarios

The performance of the infrared prediction models for blood metabolites were evaluated with 10-fold and leave-one-batch-out CV strategies. For the 10-fold CV, the dataset was split randomly into ten non-overlapping folds; in each iteration nine of the folds were used as the training set, and the remaining fold was assigned to the validation set. This process was repeated ten times, with each fold used once as the validation set. For the leave-one-batch-out CV, the dataset was randomly split on the basis of blood sampling date (i.e. into five batches); the training models thus comprised four batches, and one batch was assigned to the validation set. This process was repeated five times with each batch assigned once to the validation set.

Statistical analysis

Blood metabolites were predicted using the automatic machine learning (autoML) algorithm of the h2o R package (https://github.com/h2oai/h2o-3), which automatically trains and adjusts the models. Within the h2o.automl function, fixed parameters were a maximum of 100 models and random combinations of the hyperparameters over 45 min. For each individual, models considered the AfiLab infrared data along with the on-farm information related to DIM and parity. To find the model with the best prediction accuracy, the autoML was run using three machine learning models (RF, GBM, and ANN), three penalized regression models (RR, LASSO, and EN), and combining the models with the best predictions using a stacking ensemble.

Penalized regression

This approach for phenotypic prediction uses LASSO, RR or EN, which is a combination of LASSO (λ₁ regularization term) and RR penalties (λ₂ regularization term), providing a balance between the two methods. Ridge regression uses the λ₂ penalization in proportion to the sum of the squares of the regression coefficients in order to shrink the variables making lower contributions to close to zero, while LASSO considers λ₁ penalization using the sum of the absolute values of the coefficients to shrink the regression coefficients making lower contributions towards zero, and performing variable selection to reduce model complexity. Elastic net regression considers both the λ₁ and λ₂ norm to effectively shrink the coefficients and set some of them to zero.

Random Forest and GBM are regression tree approaches to learning that combine individual trees through bootstrap aggregation (RF) or boosting (GBM). The RF algorithm fits different regression trees and ultimately combines them in the final predictive model⁸⁷. The GBM fits a prediction model that converts weak learning into strong learning, adding models sequentially to the previous weak model to reduce variation and bias in the model⁸⁸. Multi-layer feedforward artificial neural network, a deep learning technique, transforms the input information non-linearly through multiple hidden layers (neural network) before making the final prediction; the number of hidden layers defines the depth of the neural network, while the number of neurons in the layers defines its width.

To identify the best combination of hyperparameters of the penalized regression (RR, LASSO and EN) and machine learning (RF, GBM and ANN) techniques, a random search was carried out of the main parameters of each model using the default values of the AutoML function in the h2o R package (https://docs.h2o.ai/h2o/latest-stable/h2o-docs/automl.html). Stacking ensemble uses the concept of learn-to-learn, and makes predictions using different, previously-trained base learners (penalized regression and machine learning), and combines the best models in the final predictive model⁸⁹. The base learner combination from the stacking ensemble uses a generalized linear model with a lambda search.

The predictive ability of the models was assessed by Pearson’s correlations (r) between the observed and predicted phenotypes, and the RMSE. The slope of the linear regression of the observed and predicted values in each model and CV strategy for the blood metabolite traits evaluated was used to assess the model’s unbiasedness. Each cow was used only once in each loop of the 10-fold CV in order to avoid having the same cow in both the test and training datasets at the same time but with different lactation orders. One loop ended when each sample had been used exactly once in testing. Each CV loop was repeated 500 times for each machine learning algorithm.

To compare the models’ performances with those using a traditional method, prediction equations for blood metabolic profiles were also developed using PLS regression implemented in the pls R package⁹⁰, and model performance was evaluated with random 10-fold CV.

Data availability

The data that support the findings of this study are deposited with Afimilk Ltd., and access is restricted as they were used under license for the current study and are therefore not publicly available. However, they can be obtained from the authors upon reasonable request and with the permission of Afimilk Ltd.

Code availability

The in-house code developed to perform the study is available from the authors upon request.

References

Esposito, G., Irons, P., Webb, E. & Chapwanya, A. Interactions between negative energy balance, metabolic diseases, uterine health and immune response in transition dairy cows. Anim. Reprod. Sci. 144, 60–71. https://doi.org/10.1016/j.anireprosci.2013.11.007 (2014).
Article CAS PubMed Google Scholar
Oetzel, G. Monitoring and testing dairy herds for metabolic disease. Vet. Clin. North Am. 20, 651–674. https://doi.org/10.1016/j.cvfa.2004.06.006 (2004).
Article Google Scholar
McArt, J., Nydam, D. & Overton, M. Hyperketonemia in early lactation dairy cattle: A deterministic estimate of component and total cost per case. J. Dairy Sci. 98, 2043–2054. https://doi.org/10.3168/jds.2014-8740 (2015).
Article CAS PubMed Google Scholar
Calamari, L., Ferrari, A., Minuti, A. & Trevisi, E. Assessment of the main plasma parameters included in a metabolic profile of dairy cow based on Fourier Transform mid-infrared spectroscopy: Preliminary results. BMC Vet. Res. 12, 4. https://doi.org/10.1186/s12917-015-0621-4 (2016).
Article CAS PubMed PubMed Central Google Scholar
Payne, J., Dew, S., Manston, R. & Faulks, M. The use of a metabolic profile test in dairy herds. Vet. Rec. 87, 150–158. https://doi.org/10.1136/vr.87.6.150 (1970).
Article CAS PubMed Google Scholar
Suthar, V., Canelas-Raposo, J., Deniz, A. & Heuwieser, W. Prevalence of subclinical ketosis and relationships with postpartum diseases in European dairy cows. J. Dairy Sci. 96, 2925–2938. https://doi.org/10.3168/jds.2012-6035 (2013).
Article CAS PubMed Google Scholar
McArt, J., Nydam, D. & Oetzel, G. Epidemiology of subclinical ketosis in early lactation dairy cattle. J. Dairy Sci. 95, 5056–5066. https://doi.org/10.3168/jds.2012-5443 (2012).
Article CAS PubMed Google Scholar
Luke, T. et al. Metabolic profiling of early-lactation dairy cows using milk mid-infrared spectra. J. Dairy Sci. 102, 1747–1760. https://doi.org/10.3168/jds.2018-15103 (2019).
Article CAS PubMed Google Scholar
Aernouts, B. et al. Mid-infrared spectroscopic analysis of raw milk to predict the blood nonesterified fatty acid concentrations in dairy cows. J. Dairy Sci. 103, 6422–6438. https://doi.org/10.3168/jds.2019-17952 (2020).
Article CAS PubMed Google Scholar
Sailer, K. et al. Technical note: Validation of the BHBCheck blood β-hydroxybutyrate meter as a diagnostic tool for hyperketonemia in dairy cows. J. Dairy Sci. 101, 1524–1529. https://doi.org/10.3168/jds.2017-13583 (2018).
Article CAS PubMed Google Scholar
McArt, J., Nydam, D., Oetzel, G. & Guard, C. An economic analysis of hyperketonemia testing and propylene glycol treatment strategies in early lactation dairy cattle. Prev. Vet. Med. 117, 170–179. https://doi.org/10.1016/j.prevetmed.2014.06.017 (2014).
Article CAS PubMed Google Scholar
Gross, J. & Bruckmaier, R. Review: Metabolic challenges in lactating dairy cows and their assessment via established and novel indicators in milk. Animal 13, s75–s81. https://doi.org/10.1017/S175173111800349X (2019).
Article CAS PubMed Google Scholar
Overton, T., McArt, J. & Nydam, D. A 100-year review: Metabolic health indicators and management of dairy cattle. J. Dairy Sci. 100, 10398–10417. https://doi.org/10.3168/jds.2017-13054 (2017).
Article CAS PubMed Google Scholar
Tiplady, K., Lopdell, T., Littlejohn, M. & Garrick, D. The evolving role of Fourier-transform mid-infrared spectroscopy in genetic improvement of dairy cattle. J. Anim. Sci. Biotechnol. 11, 39. https://doi.org/10.1186/s40104-020-00445-2 (2020).
Article CAS PubMed PubMed Central Google Scholar
Evangelista, C., Basiricò, L. & Bernabucci, U. An overview on the use of near infrared spectroscopy (NIRS) on farms for the management of dairy cows. Agriculture 11, 296. https://doi.org/10.3390/agriculture11040296 (2021).
Article Google Scholar
Fournel, S., Rousseau, A. & Laberge, B. Rethinking environment control strategy ofconfined animal housing systems throughprecision livestock farming. Biosyst. Eng. 155, 96–123 (2017).
Article Google Scholar
De Marchi, M., Toffanin, V., Cassandro, M. & Penasa, M. Invited review: Mid-infrared spectroscopy as phenotyping tool for milk traits. J. Dairy Sci. 97, 1171–1186. https://doi.org/10.3168/jds.2013-6799 (2014).
Article CAS PubMed Google Scholar
Grelet, C. et al. Development of Fourier transform mid-infrared calibrations to predict acetone, β-hydroxybutyrate, and citrate contents in bovine milk through a European dairy network. J. Dairy Sci. 99, 4816–4825. https://doi.org/10.3168/jds.2015-10477 (2016).
Article CAS PubMed Google Scholar
Grelet, C. et al. Potential of milk mid-IR spectra to predict metabolic status of cows through blood components and an innovative clustering approach. Animal 13, 649–658. https://doi.org/10.1017/S1751731118001751 (2019).
Article CAS PubMed Google Scholar
Diaz-Olivares, J., Adriaens, I., Stevens, E., Saeys, W. & Aernouts, B. Online milk composition analysis with an on-farm near-infrared sensor. Comput. Electron. Agric. 178, 105734 (2020).
Article Google Scholar
Melfsen, A., Hartung, E. & Haeussermann, A. Accuracy of in-line milk composition analysis with diffuse reflectance near-infrared spectroscopy. J. Dairy Sci. 95, 6465–6476. https://doi.org/10.3168/jds.2012-5388 (2012).
Article CAS PubMed Google Scholar
Mota, L. et al. Real-time milk analysis integrated with stacking ensemble learning as a tool for the daily prediction of cheese-making traits in Holstein cattle. J. Dairy Sci. https://doi.org/10.3168/jds.2021-21426 (2022).
Article PubMed Google Scholar
Tsenkova, R. et al. Near-infrared spectroscopy for dairy management: Measurement of unhomogenized milk composition. J. Dairy Sci. 82, 2344–2351. https://doi.org/10.3168/jds.S0022-0302(99)75484-6 (1999).
Article CAS PubMed Google Scholar
Tsenkova, R., Atanassova, S., Itoh, K., Ozaki, Y. & Toyoda, K. Near infrared spectroscopy for biomonitoring: Cow milk composition measurement in a spectral region from 1,100 to 2,400 nanometers. J. Anim. Sci. 78, 515–522. https://doi.org/10.2527/2000.783515x (2000).
Article CAS PubMed Google Scholar
Schmilovitch, Z., Shmuelevich, I., Notea, A. & Maltz, E. Near infrared spectrometry of milk in its heterogeneous state. Comput. Electron. Agric. 29, 195–207 (2000).
Article Google Scholar
Nguyen, H. et al. Potential of near infrared spectroscopy for on-line analysis at the milking parlour using a fibre-optic probe presentation. NIR News 22, 11–13 (2011).
Article Google Scholar
Katz, G. et al. Current and near term technologies for automated recording of animal data for precision dairy farming. J. Anim. Sci. 85, 377 (2007).
Google Scholar
Aernouts, B., Polshin, E., Lammertyn, J. & Saeys, W. Visible and near-infrared spectroscopic analysis of raw milk for cow health monitoring: Reflectance or transmittance?. J. Dairy Sci. 94, 5315–5329. https://doi.org/10.3168/jds.2011-4354 (2011).
Article CAS PubMed Google Scholar
Mota, L. et al. Evaluating the performance of machine learning methods and variable selection methods for predicting difficult-to-measure traits in Holstein dairy cattle using milk infrared spectral data. J. Dairy Sci. https://doi.org/10.3168/jds.2020-19861 (2021).
Article PubMed Google Scholar
Pralle, R., Weigel, K. & White, H. Predicting blood β-hydroxybutyrate using milk Fourier transform infrared spectrum, milk composition, and producer-reported variables with multiple linear regression, partial least squares regression, and artificial neural network. J. Dairy Sci. 101, 4378–4387. https://doi.org/10.3168/jds.2017-14076 (2018).
Article CAS PubMed Google Scholar
Soyeurt, H. et al. A comparison of 4 different machine learning algorithms to predict lactoferrin content in bovine milk from mid-infrared spectra. J. Dairy Sci. 103, 11585–11596. https://doi.org/10.3168/jds.2020-18870 (2020).
Article CAS PubMed Google Scholar
Bionaz, M. et al. Plasma paraoxonase, health, inflammatory conditions, and liver function in transition dairy cows. J. Dairy Sci. 90, 1740–1750. https://doi.org/10.3168/jds.2006-445 (2007).
Article CAS PubMed Google Scholar
Macrae, A. Interpreting blood haematology/biochemistry in cattle and sheep in the field. Livestock 22, 28–32 (2017).
Article Google Scholar
Iwersen, M., Falkenberg, U., Voigtsberger, R., Forderung, D. & Heuwieser, W. Evaluation of an electronic cowside test to detect subclinical ketosis in dairy cows. J. Dairy Sci. 92, 2618–2624. https://doi.org/10.3168/jds.2008-1795 (2009).
Article CAS PubMed Google Scholar
Ospina, P., Nydam, D., Stokol, T. & Overton, T. Evaluation of nonesterified fatty acids and beta-hydroxybutyrate in transition dairy cattle in the northeastern United States: Critical thresholds for prediction of clinical diseases. J. Dairy Sci. 93, 546–554. https://doi.org/10.3168/jds.2009-2277 (2010).
Article CAS PubMed Google Scholar
Butler, W., Calaman, J. & Beam, S. Plasma and milk urea nitrogen in relation to pregnancy rate in lactating dairy cattle. J. Anim. Sci. 74, 858–865. https://doi.org/10.2527/1996.744858x (1996).
Article CAS PubMed Google Scholar
Kessler, E., Bruckmaier, R. & Gross, J. Milk urea nitrogen concentration is higher in Brown Swiss than in Holstein dairy cows despite identical feeding. J. Anim. Physiol. Anim. Nutr. 104, 1671–1677. https://doi.org/10.1111/jpn.13408 (2020).
Article CAS Google Scholar
Colmenero, J. & Broderick, G. Effect of dietary crude protein concentration on milk production and nitrogen utilization in lactating dairy cows. J. Dairy Sci. 89, 1704–1712. https://doi.org/10.3168/jds.S0022-0302(06)72238-X (2006).
Article CAS PubMed Google Scholar
Ferguson, J., Galligan, D., Blanchard, T. & Reeves, M. Serum urea nitrogen and conception rate: The usefulness of test information. J. Dairy Sci. 76, 3742–3746. https://doi.org/10.3168/jds.S0022-0302(93)77716-4 (1993).
Article CAS PubMed Google Scholar
Rajala-Schultz, P., Saville, W., Frazer, G. & Wittum, T. Association between milk urea nitrogen and fertility in Ohio dairy cows. J. Dairy Sci. 84, 482–489. https://doi.org/10.3168/jds.s0022-0302(01)74498-0 (2001).
Article CAS PubMed Google Scholar
Godden, S. et al. Milk urea testing as a tool to monitor reproductive performance in Ontario dairy herds. J. Dairy Sci. 84, 1397–1406. https://doi.org/10.3168/jds.S0022-0302(01)70171-3 (2001).
Article CAS PubMed Google Scholar
Guo, K., Russek-Cohen, E., Varner, M. & Kohn, R. Effects of milk urea nitrogen and other factors on probability of conception of dairy cows. J. Dairy Sci. 87, 1878–1885. https://doi.org/10.3168/jds.s0022-0302(04)73346-9 (2004).
Article CAS PubMed Google Scholar
Nocek, J. Bovine acidosis: Implications on laminitis. J. Dairy Sci. 80, 1005–1028. https://doi.org/10.3168/jds.S0022-0302(97)76026-0 (1997).
Article CAS PubMed Google Scholar
Macrae, A., Whitaker, D., Burrough, E., Dowell, A. & Kelly, J. Use of metabolic profiles for the assessment of dietary adequacy in UK dairy herds. Vet. Rec. 159, 655–661. https://doi.org/10.1136/vr.159.20.655 (2006).
Article CAS PubMed Google Scholar
Martín-Tereso, J. & Martens, H. Calcium and magnesium physiology and nutrition in relation to the prevention of milk fever and tetany (dietary management of macrominerals in preventing disease). Vet. Clin. North Am. 30, 643–670. https://doi.org/10.1016/j.cvfa.2014.07.007 (2014).
Article Google Scholar
Peterson, R. & Waldern, D. Repeatabilities of serum constituents in Holstein-Friesians affected by feeding, age, lactation, and pregnancy. J. Dairy Sci. 64, 822–831. https://doi.org/10.3168/jds.S0022-0302(81)82653-7 (1981).
Article CAS PubMed Google Scholar
Hussein, H., Thurmann, J. & Staufenbiel, R. 24-h variations of blood serum metabolites in high yielding dairy cows and calves. BMC Vet. Res. 16, 327. https://doi.org/10.1186/s12917-020-02551-9 (2020).
Article CAS PubMed PubMed Central Google Scholar
Wang, Q. & Bovenhuis, H. Validation strategy can result in an overoptimistic view of the ability of milk infrared spectra to predict methane emission of dairy cattle. J. Dairy Sci. 102, 6288–6295. https://doi.org/10.3168/jds.2018-15684 (2019).
Article CAS PubMed Google Scholar
Mota, L. et al. Evaluating the performance of machine learning methods and variable selection methods for predicting difficult-to-measure traits in Holstein dairy cattle using milk infrared spectral data. J. Dairy Sci. 104, 8107–8121. https://doi.org/10.3168/jds.2020-19861 (2021).
Article CAS PubMed Google Scholar
Breiman, L. Stacked regressions. Mach. Learn. 24, 49–64. https://doi.org/10.1007/BF00117832 (1996).
Article MATH Google Scholar
Vapnik, V. The Nature of Statistical Learning Theory 2nd edn. (Springer, 2000).
Book Google Scholar
Eskildsen, C. et al. Quantification of individual fatty acids in bovine milk by infrared spectroscopy and chemometrics: Understanding predictions of highly collinear reference variables. J. Dairy Sci. 97, 7940–7951. https://doi.org/10.3168/jds.2014-8337 (2014).
Article CAS PubMed Google Scholar
Vanlierde, A. et al. Hot topic: Innovative lactation-stage-dependent prediction of methane emissions from milk mid-infrared spectra. J. Dairy Sci. 98, 5740–5747. https://doi.org/10.3168/jds.2014-8436 (2015).
Article CAS PubMed Google Scholar
Schmilovitch, Z. et al. Spectroscopic Fluid Analyzer. (2007).
Pinsky, N. et al. System and Method for Analyzing Fluids. United States patent (2010).
Iweka, P., Kawamura, S., Mitani, T., Kawaguchi, T. & Koseki, S. Online milk quality assessment during milking using near-infrared spectroscopic sensing system. Environ. Control Biol. 58, 1–6. https://doi.org/10.2525/ecb.58.1 (2020).
Article CAS Google Scholar
Ho, P., Luke, T. & Pryce, J. Validation of milk mid-infrared spectroscopy for predicting the metabolic status of lactating dairy cows in Australia. J. Dairy Sci. 104, 4467–4477. https://doi.org/10.3168/jds.2020-19603 (2021).
Article CAS PubMed Google Scholar
Bach, K., Barbano, D. & McArt, J. Association of mid-infrared-predicted milk and blood constituents with early-lactation disease, removal, and production outcomes in Holstein cows. J. Dairy Sci. 102, 10129–10139. https://doi.org/10.3168/jds.2019-16926 (2019).
Article CAS PubMed Google Scholar
Nielsen, N., Ingvartsen, K. & Larsen, T. Diurnal variation and the effect of feed restriction on plasma and milk metabolites in TMR-fed dairy cows. J. Vet. Med. A 50, 88–97. https://doi.org/10.1046/j.1439-0442.2003.00496.x (2003).
Article CAS Google Scholar
Calamari, L., Maianti, M., Bani, P. & Sarti, L. Seasonal variations of some enzyme activities of cow milk. Ital. J. Anim. Sci. 4, 212–214 (2005).
Article Google Scholar
Liu, P. et al. Bioactivity evaluation of certain hepatic enzymes in blood plasma and milk of holstein cows. Pak. Vet. J. 32, 601–604 (2012).
CAS Google Scholar
Calamari, L., Gobbi, L., Russo, F. & Piccioli-Cappelli, F. Pattern of γ-glutamyl transferase activity in cow milk throughout lactation and relationships with metabolic conditions and milk composition. J. Anim. Sci. 93, 3891–3900. https://doi.org/10.2527/jas.2015-9022 (2015).
Article CAS PubMed Google Scholar
Ghada, A. Investigation of some enzymes level in blood and milk serum in two stages of milk yield dairy cows at Assiut city. Assiut Vet. Med. J. 60, 110–120 (2014).
Article Google Scholar
Bernabucci, U., Ronchi, B., Lacetera, N. & Nardone, A. Influence of body condition score on relationships between metabolic status and oxidative stress in periparturient dairy cows. J. Dairy Sci. 88, 2017–2026. https://doi.org/10.3168/jds.S0022-0302(05)72878-2 (2005).
Article CAS PubMed Google Scholar
Gutierrez, A., Boylston, T. & Clark, S. Effects of pro-oxidants and antioxidants on the total antioxidant capacity and lipid oxidation products of milk during refrigerated storage. J. Food Sci. 83, 275–283. https://doi.org/10.1111/1750-3841.14016 (2018).
Article CAS PubMed Google Scholar
Wang, S., Huang, Y. & Guan, X. Fluorescent probes for live cell thiol detection. Molecules 26, 3575. https://doi.org/10.3390/molecules26123575 (2021).
Article CAS PubMed PubMed Central Google Scholar
Trevisi, E., Zecconi, A., Bertoni, G. & Piccinini, R. Blood and milk immune and inflammatory profiles in periparturient dairy cows showing a different liver activity index. J. Dairy Res. 77, 310–317. https://doi.org/10.1017/S0022029910000178 (2010).
Article CAS PubMed Google Scholar
Bani Ismail, Z., Abutarbush, S., Al-Qudah, K. & Omoush, F. Serum and milk concentrations of oxidant and anti-oxidant markers in dairy cows affected with bloody milk. Pol. J. Vet. Sci. 23, 341–347. https://doi.org/10.24425/pjvs.2020.133651 (2020).
Article CAS PubMed Google Scholar
Baumann, H. & Gauldie, J. The acute phase response. Immunol. Today 15, 74–80. https://doi.org/10.1016/0167-5699(94)90137-6 (1994).
Article CAS PubMed Google Scholar
Nielsen, B., Jacobsen, S., Andersen, P., Niewold, T. & Heegaard, P. Acute phase protein concentrations in serum and milk from healthy cows, cows with clinical mastitis and cows with extramammary inflammatory conditions. Vet. Rec. 154, 361–365. https://doi.org/10.1136/vr.154.12.361 (2004).
Article CAS PubMed Google Scholar
Bobbo, T. et al. Variation in blood serum proteins and association with somatic cell count in dairy cattle from multi-breed herds. Animal 11, 2309–2319. https://doi.org/10.1017/S1751731117001227 (2017).
Article CAS PubMed Google Scholar
Bertoni, G., Trevisi, E., Han, X. & Bionaz, M. Effects of inflammatory conditions on liver activity in puerperium period and consequences for performance in dairy cows. J. Dairy Sci. 91, 3300–3310. https://doi.org/10.3168/jds.2008-0995 (2008).
Article CAS PubMed Google Scholar
Wall, S., Gross, J., Kessler, E., Villez, K. & Bruckmaier, R. Blood-derived proteins in milk at start of lactation: Indicators of active or passive transfer. J. Dairy Sci. 98, 7748–7756. https://doi.org/10.3168/jds.2015-9440 (2015).
Article CAS PubMed Google Scholar
Clark, D., Mayland, H. & Lamb, R. Mineral analysis of forages with near infrared reflectance spectroscopy. Agron. J. 79, 485–490 (1987).
Article CAS Google Scholar
Büning-Pfaue, H. Analysis of water in food by near infrared spectroscopy. J. Food Chem. 82, 107–115 (2003).
Article Google Scholar
Gonzales Ronquillo, M. & Angelez Hernandez, J. C. Antibiotic and synthetic growth promoters in animal diets: Review of impact and analytical methods. Food Control 72, 255–267 (2017).
Article Google Scholar
Goi, A., Manuelian, C., Currò, S. & Marchi, M. Prediction of mineral composition in commercial extruded dry dog food by near-infrared reflectance spectroscopy. Animals 9, 640. https://doi.org/10.3390/ani9090640 (2019).
Article PubMed Central Google Scholar
Zamberlin, S., Antunac, N., Havranek, J. & Samaržija, D. Mineral elements in milk and dairy products. Mljekarstvo 62, 111–125 (2012).
CAS Google Scholar
Parkash, S. & Jenness, R. Status of zinc in cow’s milk. J. Dairy Sci. 50, 127–134. https://doi.org/10.3168/jds.S0022-0302(67)87376-4 (1967).
Article CAS PubMed Google Scholar
De Marchi, M. et al. Prediction of sodium content in commercial processed meat products using near infrared spectroscopy. Meat Sci. 125, 61–65. https://doi.org/10.1016/j.meatsci.2016.11.014 (2017).
Article ADS CAS PubMed Google Scholar
Chang, C.-W., Laird, D. A., Mausbach, M. J. & Hurburgh, C. R. Near-infrared reflectance spectroscopy-principal components regression analyses of soil properties. Soil Sci. Soc. Am. J. 65, 480–490. https://doi.org/10.2136/sssaj2001.652480x (2001).
Article ADS CAS Google Scholar
Owen, A. Uses of Derivative Spectroscopy, Uv-Visible Spectroscopy, Application Note. (Agilent Te, 1995).
Shah, N. & Gemperline, P. A program for calculating Mahalanobis Distances using principal component analysis. TrAC Trends Anal. Chem. 8, 357–361 (1989).
Article CAS Google Scholar
Minuti, A. et al. Experimental acute rumen acidosis in sheep: Consequences on clinical, rumen, and gastrointestinal permeability conditions and blood chemistry. J. Anim. Sci. 92, 3966–3977. https://doi.org/10.2527/jas.2014-7594 (2014).
Article CAS PubMed Google Scholar
Bradley, P., Priebat, D., Christensen, R. & Rothstein, G. Measurement of cutaneous inflammation: Estimation of neutrophil content with an enzyme marker. J. Investig. Dermatol. 78, 206–209. https://doi.org/10.1111/1523-1747.ep12506462 (1982).
Article CAS PubMed Google Scholar
Hanasand, M. et al. Improved detection of advanced oxidation protein products in plasma. Clin. Chim. Acta 413, 901–906. https://doi.org/10.1016/j.cca.2012.01.038 (2012).
Article CAS PubMed Google Scholar
Breiman, L. Random Forests. Mach. Learn. 45, 5–32. https://doi.org/10.1023/A:1010933404324 (2001).
Article MATH Google Scholar
Natekin, A. & Knoll, A. Gradient boosting machines, a tutorial. Front. Neurorobot. 7, 21. https://doi.org/10.3389/fnbot.2013.00021 (2013).
Article PubMed PubMed Central Google Scholar
Wolpert, D. Stacked generalization. Neural Netw. 5, 241–259. https://doi.org/10.1016/S0893-6080(05)80023-1 (1992).
Article Google Scholar
Mevik, B.-H. & Wehrens, R. The pls Package: Principal component and partial least squares regression in R. J. Stat. Softw. 18, 91213 (2007).
Article Google Scholar

Download references

Acknowledgements

This research was part of the PROH-DAIRY project funded by the Ministero degli Affari Esteri e della Cooperazione Internazionale (MAECI). We would also like to thank the study farm, the Aziende Agricole Fugazza (Gragnanino, PC, Italy), for allowing us to carry out the sampling.

Author information

Authors and Affiliations

Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), University of Padua, 35020, Legnaro (PD), Italy
Diana Giannuzzi, Lucio Flavio Macedo Mota, Sara Pegolo, Luigi Gallo, Stefano Schiavon, Franco Tagliapietra & Alessio Cecchinato
Afimilk Ltd., 1514800, Kibbutz Afikim, Israel
Gil Katz & David Fainboym
Department of Animal Science, Food and Nutrition (DIANA) and the Romeo and Enrica Invernizzi Research Center for Sustainable Dairy Production (CREI), Faculty of Agricultural, Food and Environmental Sciences, Università Cattolica del Sacro Cuore, 29122, Piacenza, Italy
Andrea Minuti & Erminio Trevisi

Authors

Diana Giannuzzi
View author publications
You can also search for this author in PubMed Google Scholar
Lucio Flavio Macedo Mota
View author publications
You can also search for this author in PubMed Google Scholar
Sara Pegolo
View author publications
You can also search for this author in PubMed Google Scholar
Luigi Gallo
View author publications
You can also search for this author in PubMed Google Scholar
Stefano Schiavon
View author publications
You can also search for this author in PubMed Google Scholar
Franco Tagliapietra
View author publications
You can also search for this author in PubMed Google Scholar
Gil Katz
View author publications
You can also search for this author in PubMed Google Scholar
David Fainboym
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Minuti
View author publications
You can also search for this author in PubMed Google Scholar
Erminio Trevisi
View author publications
You can also search for this author in PubMed Google Scholar
Alessio Cecchinato
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.C. designed the study and supervised the project. A.M. and E.T. collected the blood samples, performed the analysis, and aided in interpreting the results. D.G. and L.F.M.M. chose the methods, performed data analysis and data visualization. D.G., D.F., G.K., and A.C. interpreted the results. D.G. and L.F.M.M. drafted the first version of the paper. A.C., D.G., D.F., E.T., F.T., G.K., L.G., S.P. and S.S. reviewed the manuscript.

Corresponding author

Correspondence to Diana Giannuzzi.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Giannuzzi, D., Mota, L.F.M., Pegolo, S. et al. In-line near-infrared analysis of milk coupled with machine learning methods for the daily prediction of blood metabolic profile in dairy cattle. Sci Rep 12, 8058 (2022). https://doi.org/10.1038/s41598-022-11799-0

Download citation

Received: 25 October 2021
Accepted: 12 April 2022
Published: 16 May 2022
DOI: https://doi.org/10.1038/s41598-022-11799-0

This article is cited by

Integrating on-farm and genomic information improves the predictive ability of milk infrared prediction of blood indicators of metabolic disorders in dairy cows
- Lucio F. M. Mota
- Diana Giannuzzi
- Alessio Cecchinato
Genetics Selection Evolution (2023)
Comparison of machine learning models for bluetongue risk prediction: a seroprevalence study on small ruminants
- Hagar F. Gouda
- Fardos A. M. Hassan
- Sherif A. Moawed
BMC Veterinary Research (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Relationship of the blood metabolome to subsequent carcass traits at slaughter in feedlot Wagyu crossbred steers

Differences in the serum metabolome profile of dairy cows according to the BHB concentration revealed by proton nuclear magnetic resonance spectroscopy (1H-NMR)

Changes in the blood metabolome of Wagyu crossbred steers with time in the feedlot and relationships with marbling

Introduction

Results and discussion

Blood metabolic profile of cows

Model performances

Relationships between light emitting diodes and blood traits

Predictive ability, meaning and biological link with milk of all blood traits

Methods

Study design and field data

Blood sampling

NIR spectra collection and preprocessing

Reference blood metabolic profile analysis and preprocessing

Prediction analysis and cross-validation scenarios

Statistical analysis

Penalized regression

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Integrating on-farm and genomic information improves the predictive ability of milk infrared prediction of blood indicators of metabolic disorders in dairy cows

Comparison of machine learning models for bluetongue risk prediction: a seroprevalence study on small ruminants

Comments

Search

Quick links