Optimization of Parameter Selection for Partial Least Squares Model Development

Zhao, Na; Wu, Zhi-sheng; Zhang, Qiao; Shi, Xin-yuan; Ma, Qun; Qiao, Yan-jiang

doi:10.1038/srep11647

Download PDF

Article
Open access
Published: 13 July 2015

Optimization of Parameter Selection for Partial Least Squares Model Development

Na Zhao^1,2,3,
Zhi-sheng Wu^1,2,3,
Qiao Zhang^1,2,3,
Xin-yuan Shi^1,2,3,
Qun Ma^1,2,3 &
…
Yan-jiang Qiao^1,2,3

Scientific Reports volume 5, Article number: 11647 (2015) Cite this article

10k Accesses
57 Citations
1 Altmetric
Metrics details

Subjects

Abstract

In multivariate calibration using a spectral dataset, it is difficult to optimize nonsystematic parameters in a quantitative model, i.e., spectral pretreatment, latent factors and variable selection. In this study, we describe a novel and systematic approach that uses a processing trajectory to select three parameters including different spectral pretreatments, variable importance in the projection (VIP) for variable selection and latent factors in the Partial Least-Square (PLS) model. The root mean square errors of calibration (RMSEC), the root mean square errors of prediction (RMSEP), the ratio of standard error of prediction to standard deviation (RPD) and the determination coefficient of calibration (R_cal²) and validation (R_pre²) were simultaneously assessed to optimize the best modeling path. We used three different near-infrared (NIR) datasets, which illustrated that there was more than one modeling path to ensure good modeling. The PLS model optimizes modeling parameters step-by-step, but the robust model described here demonstrates better efficiency than other published papers.

Self-supervised learning for human activity recognition using 700,000 person-days of wearable data

Article Open access 12 April 2024

Plant responses to changing rainfall frequency and intensity

Article 09 April 2024

Predicting and improving complex beer flavor through machine learning

Article Open access 26 March 2024

Introduction

Near-infrared (NIR) Spectroscopy combined with Partial Least-Square (PLS) has gained wide acceptance because it is rapid, nondestructive and environmentally friendly. It has been successfully applied to different fields including agriculture^1,2, food stuffs³, petrochemistry⁴ and pharmacy^5,6,7. In addition, NIR technology offers sufficient accuracy and precision for solid and liquid systems without any sample pretreatment.

However, establishing a reliable and robust PLS model is a vital procedure for successful application of NIR. There are many parameters that should be optimized in the PLS model. These include spectral pretreatment, variable selection and latent factors. The NIR spectra contain unexpected chemical interferences related to sampling error, instrument error, etc.

In the process of establishing a PLS model, spectral pretreatment methods have been applied to extract relevant information and reduce the effect of noise and baseline drift^8,9. Variable selection is then also used to identify highly informative features and eliminate useless variables from the original spectral dataset^10,11. The latent factors that explain the spectral matrix are sorted in decreasing order according to their contribution to the spectral features. It is crucial to determine the latent factors and avoid over-fitting or under-fitting^12,13.

Most published work about PLS models offer few details about the model development. It is assumed that there is a univariate used to select these parameters one by one according the root mean square error of cross validation (RMSECV) and the root mean square errors of prediction (RMSEP). The inter-influence among the three parameters is rarely considered in model development. This is risky because the modeling path is not necessarily the best approach to step-by-step optimization.

Wenlong Li et al. established quantitative methods based on NIR to determine the active compounds of Lonicerae Japonicae Flos¹⁴. Ioan Tomuta et al. applied NIR for the determination of active pharmaceutical ingredients (API) and pharmaceutical properties including the crushing strength and disintegration time of meloxicam tablets¹⁵. Nádia Reis et al. developed a quantitative method for multiple adulterants in roasted coffee using diffuse reflectance infrared Fourier transform spectroscopy¹⁶. Yajie Xi et al. developed a NIR method to determinate the trace dimethyl fumarate in milk¹⁷. In the above reports, spectral pretreatment methods were compared according to the predictive performance of the calibration models. Using the optimal pretreatment method, latent factors were selected with a cross validation method. The optimum wavelength range was then tested and selected based on these pretreatment and latent variables. Although quantitative models could be obtained, they did not investigate the trajectory routes of PLS models that consider the combination of different parameters.

Based on this consideration, we propose a novel and systematic approach to improve the efficiency and accuracy in the development of PLS models. The tracking procedure of PLS modeling provided a systematic profile that combined different spectral pretreatment methods, latent factors and the variable importance in the projection (VIP) variable selection. The model performance was assessed using the root mean square errors of calibration (RMSEC), RMSEP, the determination coefficient of calibration (R_cal²) and validation (R_pre²) as well as the ratio of standard error of prediction to standard deviation (RPD)^18,19. To demonstrate the advantage of this novel approach, three different NIR spectral datasets were analyzed including two standard and open source datasets. The proposed procedure was used to predict water, baicalin and API in corn, Yinhuang granules and pharmaceutical tablets, respectively. These specific NIR applications were selected because they are important to quality control.

Result

Near-infrared spectral features

The NIR spectra (1,100–2,498 nm) of typical corn samples are shown in Fig. 1a. There were several broad peaks located near 1,190, 1,450 and 1,940 nm. These represent the characteristic peaks regarding the water in the corn spectral dataset. The strong absorption bands of water appeared around 1,400–1,450 nm and 1,900–1,950 nm due to the first overtone of OH stretching and combination of the OH stretching band with OH bending.

On the other hand, the NIR spectra of Yinhuang granules are shown in Fig. 1b. There was a weak absorption in the second overtones region (SCOT, 10,000–7,100 cm⁻¹) of the fundamental CH stretching bands. There was much fluctuation in the region of first combination-overtone (FCOT, 7,100–4,900 cm⁻¹) and combination region (CR, 4,900–4,000 cm⁻¹). The NIR spectra of pharmaceutical tablets are shown in Fig. 1c. As seen in the raw spectra, there were many clear absorption bands of the API from 900 to 1,630 nm.

There was also severe spectral overlap and baseline drift in the spectra of corn, Yinhuang granules and pharmaceutical tablets. Light scattering affected the raw spectra as well; this depended on the physical conditions such as particle size, density and variation among lots.

Reference method of three spectral datasets

The response variable was the relative content of water in the corn (%, w/w). A detailed description of sample conditions can be found at http://www.eigenvector.com/data/Corn/index.html. The other standard dataset of pharmaceutical tablets included 655 samples. The assay values of the API (%, w/w) were used as a reference value.

Baicalin in Yinhuang granules was quantitated with a calibration curve based on the concentration range from 25.4 to 50.8 μg mL⁻¹ of baicalin using consecutive injections of different concentrations. The regression equation was y = 20.25x + 35.10 (r = 1.0000) with y being the peak area in mAUs and x being the concentration of injection (μg mL⁻¹). We concluded that high performance liquid chromatography (HPLC) was suitable for quantitation. Baicalin concentrations varied 1.61% to 6.66% (mg mg⁻¹). The reference values of baicalin were accurate and could be used in NIR models.

The selection of sample sets using three datasets

The Kennard-Stone (K-S) algorithm selected the calibration and validation sets. The corn samples included 53 samples for calibration set and 27 samples for external validation. In addition, the Yinhuang granule dataset used 48 and 24 samples in the calibration and validation sets, respectively. The three different subsets of pharmaceutical tablets were used as the calibration set (155), validation set (460) and test set (40). The statistics for water, baicalin and API contents in the calibration, validation and test sets are summarized in Table 1.

Table 1 The statistic of water, baicalin and API contents in the calibration and validation sets.

Full size table

Processing trajectory of PLS model

Using the corn dataset as an example, the calibration spectra were preprocessed with different methods including Standard Normal Variate (SNV) and Savitzky-Golay smoothing with 9 points (SG(9)), as well as SG(9) combined with the derivative spectra. The latent factor was set from 1 to 10 to avoid over-fitting. VIP was then used to select variables with different latent factors. Finally, the process routines from PLS model development and validation were selected (Fig. 2). The parameters for PLS models in water, baicalin and API are shown in Table. 1s-3s. There are various trends in the model evaluation indexes. In Fig. 2a, we see that the RMSEC and RMSEP decreased and the R_cal², R_pre² and RPD increased with increasing latent factor coupled with different pretreatment methods.

The model for Yinhuang granules dataset is shown in Fig. 2b. This was different from the corn dataset model. The Yinhuang granules dataset model used first derivative spectra (1D) combined with SG(9) and SNV to preprocess the spectra. This was superior to other spectra pretreatment methods in which the various trends of the model evaluation indexes were unclear. The model for pharmaceutical tablets is shown in Fig. 2c. Changes in model evaluation indexes were not obvious when the latent factor increased to a certain value.

However, this result showed that there was more than one modeling path that can ensure a successful model. There were two fair PLS models with RPD between 2.5 to 3 (Fig. 3a) including: 1) a combinational method of SG(9) spectral pretreatment, VIP and 10 factors and 2) a combinational method using the second derivative spectra (2D) combined with SG(9) spectral pretreatment, VIP and 10 factors. Most of PLS models were fair and there were also some good model paths with RPD values greater than 3 (Fig. 3b). In Fig. 3c, there were many models with good performance that adopted a processing trajectory.

In the previous modeling process routine, the parameters included spectral pretreatment methods, variable selection and latent factors were optimized one at a time. Table 4s shows that this is a good approach to path modeling versus step-by-step parameter optimization. The optimal nonsystematic parameters of the water PLS model were the raw spectra and VIP-selecting variables under 7 factors; the model performance was poor. However, processing trajectory showed that two fair model could be obtained through combinational methods using SG(9) pretreatment, VIP and 10 factors as well as a combinational method of 2D + SG(9) pretreatment, VIP and 10 factors. The best nonsystematic parameter combination for the baicalin PLS model was SNV pretreatment and VIP-selecting variables under 4 factors. This gave good model performance. However, the processing trajectory showed eight good models with different systematic parameter combinations. The best parameter combination of the baicalin PLS model was a SNV spectra pretreatment with VIP-selecting variables under 3 factors.

The parameter combination for the API model used raw spectra with VIP-selecting variables under 4 factors. These were optimized one by one. The model performance was very good. However, there were 18 very good models and 5 excellent models with RPD values above 4 that were obtained by processing trajectory. The best parameter combination for the PLS models were SG(9) spectra pretreatment and VIP selecting variables under 10 factors.

The best models were obtained through a step-by-step optimization and processing trajectory that was tested with a test set of 40 independent samples. The RMSEP of the two models were 0.7904% and 0.5681%, respectively. This demonstrated that the model obtained through the processing trajectory was better than the model optimized step-by-step. This illustrates that the best systematic optimal model parameters were obtained via the processing trajectory.

Development and validation of calibration models

The model validity was characterized with RMSEC, RMSEP, R_cal², R_pre² and RPD. Taking the calibration model of the corn dataset as an example, Fig. 2a showed that the PLS model of water with SG(9) pretreatment and VIP-selecting variables under 10 factors had the best performance. The RMSEC and R_cal² of the calibration set were 0.1042% and 0.9321, respectively. The RMSEP was 0.1256% – quite close to RMSEC. The RPD and R_pre² of the validation set were 2.6387 and 0.8554, respectively. These results demonstrated that the PLS model of water also had a good predictive performance.

Similarly, we developed the PLS model of baicalin with SNV spectral pretreatment and VIP-selecting variables under 3 factors. The RMSEC and R_cal² of the calibration set were 0.5609% and 0.8250, respectively. The RMSEP, R_pre² and RPD of the baicalin validation set were 0.3524%, 0.9066 and 3.2723, respectively. The model of API with SG(9) pretreatment and VIP-selecting variables under 10 factors was also established. The RMSEC and R_cal² of the calibration set were 1.0048% and 0.9706. The RMSEP, R_pre² and RPD of the API validation set were 0.9581%, 0.9493 and 4.4581, respectively.

Figure 4 presents data for the PLS models using three datasets. The prediction values were quite close to the wet analysis. The parameters for water, baicalin and API models indicated that NIR could be used for the determination of water, baicalin and API of corn, Yinhuang granules and pharmaceutical tablets, respectively.

Discussion

We proposed the use of processing trajectory to develop and optimize multivariate calibration models. The PLS models were established with different spectral pretreatments and VIP variable selection methods with different latent factors. Based chemometric indicators (RMSEP, RMSEC, R_cal², R_pre² and RPD), different PLS models were used to assessed the water, baicalin and API in corn, Yinhuang granules and pharmaceutical tablets, respectively. The present work demonstrates the feasibility and advantages of processing trajectory in the development and optimization of multivariate calibration models. In conclusion, a systematic procedure for model optimization based on the processing trajectory shows excellent results to develop a robust model. The proposed approach should be integrated into PLS software to improve on the available PLS models.

Methods

Datasets

Corn spectral dataset

NIR diffuse spectra of corn were obtained from the website as standard data (http://www.eigenvector.com/data/Corn/index.html)²⁰. The dataset consists of 80 corn samples. The NIR spectra of corn samples were measured with three spectrometers; the moisture content was included (%, w/w). The spectra were measured with a mp5 NIR spectrometer and the water values were used as reference value. The spectral acquisition range was 1,100–2,498 nm at 2 nm intervals resulting in a total of 700 variables per sample.

Yinhuang granules spectral dataset

The dataset consists of diffuse reflectance NIR spectra with 72 Yinhuang granule samples. The response variable was the relative content of active baicalin in the granules (%, w/w). The baicalin content was measured with HPLC as described in the Ch.P. (2010 Edition, Volume I). We used an Agilent 1100 HPLC system (Agilent Technologies, USA) with a vacuum degasser, a quaternary pump, an auto sampler, a thermostatic column compartment and a diode array detector (DAD). Separation was performed on an ODS column (150 mm, 4.6 mm, 5 mm, Waters, USA) with isocratic elution of the mobile phase consisting of methanol, water and phosphoric acid (50:50:0.2, v/v) at a flow rate of 1.0 mL/min. The column temperature was 30 °C and the detection wavelength was 274 nm.

The NIR spectra were collected in an integrating sphere diffuse mode with an Antaris Nicolet FT-NIR system (Thermo Fisher Scientific Inc., USA). Each sample spectrum was the result of 32 scans from 10,000 to 4,000 cm⁻¹ (a total of 1557 wavenumber variables per sample) at ambient temperature using 8 cm⁻¹ resolution. Every sample was scanned three times and the final spectrum used for each sample was an average of the three results. All NIR spectra were collected and archived using the Thermo Scientific Result software.

Pharmaceutical tablets spectral dataset

The dataset for the pharmaceutical tablets was available at http://www.eigenvector.com/data/tablets/index.html. It contains 1,308 spectra of 655 pharmaceutical tablets measured on two similar instruments (Foss/NIR Systems Multitab Spectrometers). The spectral region was from 600 to 1,898 nm with 2 nm increments. The data of each instrument were organized into three different subsets. The assay value of the active ingredient, tablet weight and tablet hardness were provided. In this work, we used the spectra of the first instrument and the assay values of the API (%, w/w). There were three different sets of 155, 460 and 40 spectra that were used as calibration, validation and test sets in this study, respectively.

Multivariate data analyses

The spectra pretreatment and model development were performed with Unscrambler 9.7 software package (CAMO Software AS, Norway). The VIP-based variable selection methods were implemented with custom routines in MATLAB (MATLAB, The MathWorks, Massachussetts).

Summary of the proposed procedure

The procedure we used to track and evaluate modeling processes with different spectral pretreatment methods, latent factors and VIP variable selection is summarized in Fig. 2 and is detailed in the following section.

Step 1. The modeling parameters and their levels were defined, which contained spectral pretreatment methods, latent factors and variable selection methods.

Step 2. The evaluation indexes of model were established to include RMSEC, R_cal², RMSEP and R_pre². Important parameters for the RPD were also included to assure the model assessment.

Step 3. The calibration and validation data sets were selected to ensure that both datasets were representative of the experimental design.

Step 4. We ran the PLS models established for all systematic parameters optimization. The results for the evaluation indexes of each model were registered.

Step 5. The results were analyzed and the trajectory routes for PLS modeling were defined.

Step 6. PLS model was established at the best systematic optimization of parameters. When necessary, the model was further refined to exclude samples with higher spectral and concentration residuals.

Step 7. The model was applied to routine analysis and was continuously monitored to check for needed updates.

Multivariate calibration

The PLS was used to build calibration models. First, the K-S algorithm was used to split the data set into calibration and validation sets (2:1) such that the samples could be selected to represent the entire experimental domain. We used spectral pretreatment to decrease baseline shifts and remove the scattering effect created by diffuse reflectance and overlapping peaks. These caused detrimental effects on the signal-to-noise ratio. We used SNV, SG(9) and SG(9) combined with derivative spectra to remedy this. Meanwhile, VIP was utilized to select variables for identifying highly informative features²¹. The performance of the regression models was evaluated in terms of the R_cal² and RMSEC. The prediction ability of the external validation was assessed by RMSEP, R_pre² and RPD.

Generally, the optimal number of latent factors was determined from the result of 10-fold cross validation tests after spectral pretreatment. The VIP variable selection algorithms were then used to select the characteristic variables. The ideal methods for spectral pretreatment, the number of latent factors and models were selected based on the RMSEC, RMSEP, R_cal², R_pre², RPD, etc. Unlike the previous method, this approach uses the global optimal strategy and trajectory routes of all possible parameter combinations.

Additional Information

How to cite this article: Zhao, N. et al. Optimization of Parameter Selection for Partial Least Squares Model Development. Sci. Rep. 5, 11647; doi: 10.1038/srep11647 (2015).

References

Sileoni, V., van den Berg, F., Marconi, O., Perretti, G. & Fantozzi, P. Internal and external validation strategies for the evaluation of long-term effects in NIR calibration models. J. Agric. Food Chem. 59, 1541–1547 (2011).
Article CAS Google Scholar
Devos, O., Downey, G. & Duponchel, L. Simultaneous data pre-processing and SVM classification model selection based on a parallel genetic algorithm applied to spectroscopic data of olive oils. Food Chem. 148, 124–130 (2014).
Article CAS Google Scholar
Cozzolino, D. & Murray, I. A. review on the application of infrared technologies to determine and monitor composition and other quality characteristics in raw fish, fish products and seafood. Appl. Spectrosc. Rev. 47. 207–218 (2012).
Article ADS Google Scholar
Reboucas, M. V., Santos, J. B., Pimentel, M. F. & Teixeira, L. S. A novel approach for development of a multivariate calibration model using a Doehlert experimental design: Application for prediction of key gasoline properties by Near-infrared Spectroscopy. Chemometr. Intell. Lab. 107, 185–193 (2011).
Article CAS Google Scholar
Wu, Z., et al. NIR spectroscopy as a process analytical technology (PAT) tool for monitoring and understanding of a hydrolysis process. Bioresource. Technol. 137, 394–399 (2013).
Article CAS Google Scholar
Märk, J., Karner, M., Andre, M., Rueland, J. & Huck, C. W. Online process control of a pharmaceutical intermediate in a fluidized-bed drier environment using near-infrared spectroscopy. Anal. Chem. 82, 4209–4215 (2010).
Article Google Scholar
Xu, B., Wu, Z., Lin, Z., Sui, C., Shi, X. & Qiao, Y. NIR analysis for batch process of ethanol precipitation coupled with a new calibration model updating strategy. Anal. Chim. Acta 720, 22–28 (2012).
Article CAS Google Scholar
Fernández-Cabanás, V. M., Garrido-Varo, A., Olmo, J. G., Pedro, E. D. & Dardenne, P. Optimisation of the spectral pre-treatments used for Iberian pig fat NIR calibrations. Chemometr. Intell. Lab. 87, 104–112 (2007).
Article Google Scholar
Wu, Z. et al. Multivariate detection limits of on-line NIR model for extraction process of chlorogenic acid from Lonicera japonica. J. Pharmaceut. Biomed. 77, 16–20 (2013).
Article CAS Google Scholar
Wu, Z. et al. A novel model selection strategy using total error concept. Talanta 107, 248–254 (2013).
Article CAS Google Scholar
Lee, H. W., Bawn, A. & Yoon, S. Reproducibility, complementary measure of predictability for robustness improvement of multivariate calibration models via variable selections. Anal. Chim. Acta 757, 11–18 (2012).
Article CAS Google Scholar
Sileoni, V., Marconi, O., Perretti, G. & Fantozzi, P. Evaluation of different validation strategies and long term effects in NIR calibration models. Food Chem. 141, 2639–2648 (2013).
Article CAS Google Scholar
Wu, Z., Du, M., Xu, B., Lin, Z., Shi, X. & Qiao, Y. Absorption characteristics and quantitative contribution of overtones and combination of NIR: Method development and validation. J. Mol. Struct. 1019, 97–102 (2012).
Article ADS CAS Google Scholar
Li, W., Cheng, Z., Wang, Y. & Qu, H. Quality control of Lonicerae Japonicae Flos using near infrared spectroscopy and chemometrics. J. Pharmaceut. Biomed. 72, 33–39 (2013).
Article CAS Google Scholar
Tomuta, I., Iovanov, R., Bodoki, E. & Vonica, L. Development and validation of NIR-chemometric methods for chemical and pharmaceutical characterization of meloxicam tablets. Drug Dev. Ind. Pharm. 40, 549–559 (2013).
Article Google Scholar
Reis, N., Franca, A. S. & Oliveira, L. S. Quantitative evaluation of multiple adulterants in roasted coffee by Diffuse Reflectance Infrared Fourier Transform Spectroscopy (DRIFTS) and chemometrics. Talanta 115, 563–568 (2013).
Article CAS Google Scholar
Xie, Y. J., Wang, Z., Hu, W. P. & Xu, S. Fast determination of trace dimethyl fumarate in milk with near infrared spectroscopy following fluidized bed enrichment. Anal. Bioanal. Chem. 404, 3189–3194 (2012).
Article CAS Google Scholar
Esbensen, K. H., Geladi, P. & Larsen, A. The RPD myth…. NIR news. 25, 24–28 (2014).
Article Google Scholar
Williams, P. Tutorial: The RPD statistic: a tutorial note. NIR news. 25, 22–26 (2014).
Article Google Scholar
Shan, R., Cai, W. & Shao, X. Variable selection based on locally linear embedding mapping for near-infrared spectral analysis. Chemometr. Intell. Lab. 131, 31–36 (2014).
Article CAS Google Scholar
Sills, D. L. & Gossett, J. M. Using FTIR spectroscopy to model alkaline pretreatment and enzymatic saccharification of six lignocellulosic biomasses. Biotechnol. Bioeng. 109, 894–903 (2012).
Article CAS Google Scholar

Download references

Acknowledgements

This work was supported by the Beijing Municipal Government for the university affliated with the Party Central Committee, BUCM Found for Excellent Young Scholars, National Natural Science Foundation of China (81303218) and Doctoral Fund of Ministry of Education of China (20130013120006). The authors thank Eigenvector for providing the corn and pharmaceutical tablets NIR datasets. The authors thank the Key Laboratory of TCM-Information Engineering of State Administration of Traditional Chinese Medicine, (Beijing, China) for the assistance in data processing.

Author information

Authors and Affiliations

Beijing University of Chinese Medicine, Beijing, 100102, China
Na Zhao, Zhi-sheng Wu, Qiao Zhang, Xin-yuan Shi, Qun Ma & Yan-jiang Qiao
Beijing Key Laboratory for Basic and Development Research on Chinese Medicine, Beijing, 100102, China
Na Zhao, Zhi-sheng Wu, Qiao Zhang, Xin-yuan Shi, Qun Ma & Yan-jiang Qiao
Key Laboratory of TCM-information Engineer of State Administration of TCM, Beijing, 100102, China
Na Zhao, Zhi-sheng Wu, Qiao Zhang, Xin-yuan Shi, Qun Ma & Yan-jiang Qiao

Authors

Na Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Zhi-sheng Wu
View author publications
You can also search for this author in PubMed Google Scholar
Qiao Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Xin-yuan Shi
View author publications
You can also search for this author in PubMed Google Scholar
Qun Ma
View author publications
You can also search for this author in PubMed Google Scholar
Yan-jiang Qiao
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Q.Y.J. and W.Z.S. conceived this study and designed the experiments. Z.N. performed the experiments with the help of Z.Q. and M.Q. Z.N. analyzed the data and wrote the manuscript. All authors reviewed the manuscript.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Zhao, N., Wu, Zs., Zhang, Q. et al. Optimization of Parameter Selection for Partial Least Squares Model Development. Sci Rep 5, 11647 (2015). https://doi.org/10.1038/srep11647

Download citation

Received: 29 January 2015
Accepted: 28 May 2015
Published: 13 July 2015
DOI: https://doi.org/10.1038/srep11647

This article is cited by

FTIR spectral analysis combined with chemometrics in evaluation of composite mixtures of coconut testa flour and wheat flour
- Rasika Gunarathne
- Nazrim Marikkar
- Eresha Mendis
Journal of Food Measurement and Characterization (2022)
Near-Infrared Spectroscopy for Mapping of Human Meniscus Biochemical Constituents
- Juho Ala-Myllymäki
- Tommi Paakkonen
- Isaac O. Afara
Annals of Biomedical Engineering (2021)
A critical review of recent trends, and a future perspective of optical spectroscopy as PAT in biopharmaceutical downstream processing
- Laura Rolinger
- Matthias Rüdt
- Jürgen Hubbuch
Analytical and Bioanalytical Chemistry (2020)
Systematic discovery about NIR spectral assignment from chemical structural property to natural chemical compounds
- Lijuan Ma
- Yanfang Peng
- Zhisheng Wu
Scientific Reports (2019)
High-throughput analysis of chemical components and theoretical ethanol yield of dedicated bioenergy sorghum using dual-optimized partial least squares calibration models
- Meng Li
- Jun Wang
- Guang Hui Xie
Biotechnology for Biofuels (2017)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.