Abstract
Building-integrated photovoltaics (BIPV) is an emerging technology in the solar energy field. It involves using luminescent solar concentrators to convert traditional windows into energy generators by utilizing light harvesting and conversion materials. This study investigates the application of machine learning (ML) to advance the fundamental understanding of optical material design. By leveraging accessible photoluminescent measurements, ML models estimate optical properties, streamlining the process of developing novel materials, offering a cost-effective and efficient alternative to traditional methods, and facilitating the selection of competitive materials. Regression and clustering methods were used to estimate the optical conversion efficiency and power conversion efficiency. The regression models achieved a Mean Absolute Error (MAE) of 10%, which demonstrates accuracy within a 10% range of possible values. Both regression and clustering models showed high agreement, with a minimal MAE of 7%, highlighting the efficacy of ML in predicting optical properties of luminescent materials for BIPV.
Similar content being viewed by others
Introduction
The need for the prediction of crystal structures based on chemical or material composition from produced or published experimental data is still a huge challenge1,2. Traditional methods of discovering new materials or materials with new properties, such as laboratory experiments have long development cycles and high costs3. Machine learning (ML) algorithms are a new promising tool to tackle this issue, opening the possibility of managing and extracting valuable insights from vast amounts of data4. The application of ML to materials science is recent5, with an emphasis on basic models in nature, being also possible to use ML as a simple fitting procedure for small low-dimensional datasets1.
Over the past decade, the progress of ML has greatly impacted the entire spectrum of physical sciences, including materials science6 and a reflection of that is the creation of the material genome initiative7 as a path for accelerating materials development and rationally designing materials through the use of data-driven methods. Other works report the use of ML linear regression methods to build force constant models for thermodynamic properties of materials used in physics and chemistry8 and the data-centric approach was used for polymer crystals9, pharmaceutical science10 and material constitutive modeling for metal forming processes11, with likely many more important developments to come shortly. Also, the importance of the increasing quantity of data recently collected (from experiments and simulations), for the advancement of the ML as a tool for phosphor design has already been discussed12. ML is thus poised to further enhance its role in materials science by playing a crucial role in identifying upcoming opportunities and challenges within this rapidly growing field13.
In an era overshadowed by climate change, environmental degradation, and the recent energy crisis, the fields of materials science and engineering have gained significant relevance. These fields offer crucial avenues to mitigate the repercussions of these challenges through the innovative capacity of materials, enabling them to predict, control, and enhance not only the behavior of materials but also their sustainable manufacturing, usage, and recyclability14. Photonics emerges as one of the main enabling technologies in the twenty-first century due to its fulminant growth and its potential to increase innovation in several industries such as photovoltaics that stands out recently in the field of building integrated photovoltaics to cope with energy needs and in-situ energy generation in buildings15,16.
Luminescent solar concentrators (LSCs)17,18 represent a promising strategy for converting passive glass windows into self-sustaining energy sources15. By harnessing solar energy and transforming it into low-energy photons capable of generating electricity in solar cells, LSCs offer a revolutionary approach to energy generation (Fig. 1a)15. Essentially, an LSC comprises a waveguide—either planar or fiber—infused with optically active centers. These centers absorb the solar photons and subsequently convert it into low-energy photons. Guided by total internal reflection, these photons are directed towards photovoltaic cells positioned along the edges of the waveguide and efficiently converted into electricity. Notably, LSCs enable the utilization of solar radiation through large-area devices, utilizing minimal photovoltaic material—only required at the window edges. Electrical output values of 10 W per window are envisaged for a surface area of 0.05 to 0.1 m2 with the current figures of merit, granting, for instance, the power of Internet of Things (IoT) devices19,20,21. This opened the door to the consideration of larger windows, typical for residential buildings, with dimensions on the order of square meters, which could be conceptualized as aggregates of LSC with smaller dimensions without compromising the electrical output.
The optimization of the LSC performance is a complex task as its operating principles account for multiple events, such as emission from the optically active center, Fresnel reflections, surface scattering, waveguide attenuation, transmitted radiation, re-absorption by neighbor centers, non-radiative relaxation, and emission within the escape cone15 (Fig. 1a). Challenges include the design of spectral converters able to shape the sunlight to cope with the mismatch between the solar irradiance on Earth and the photovoltaic cells’ absorption since typical low-cost silicon solar cell presents low performance in the UV spectral region. Moreover, for application in facades, LSCs must ensure that the visible component of radiation is not absorbed to maintain transparency, redirecting the remaining components to the spectral region aligned with the PV's maximum absorption. Thus, we propose to use ML algorithms for data-driven research in the field of optical materials for photon management with an emphasis on low-power excitation, namely diffuse sunlight conditions such as cloudy days, where the relative intensity of UV photons related to that of the visible/NIR is higher than that found in clear sky conditions22.
The objective here is to discuss the use of ML as a valuable resource for decision-making tools for device design without extensive experimental measurements, as recently sugested23. A key step is the identification of the photoluminescence figures-of-merit related to performance (e.g. absorption and emission spectral ranges, quantum yield, photostability) that enable the outcomes (e.g. families of materials, concentration, preparation methods) to be properly benchmarked24,25. The independent identification will permit to impact on fundamental research, as typically stability and optical performance are addressed by distinct research fields, whereas engineers and industry can combine both simultaneously, as the commercial application requires stability and high performance.
We propose the use of classical ML-based regression and clustering methods as relevant fitting procedures for the estimation of underlying optical properties of luminescent materials. It is demonstrated that the ηopt, and the power conversion efficiency (PCE) can be estimated from easily accessible, measurable optical features (i.e. absorption and emission spectra). The comparative study of the five most typically applied ML regression models, namely Gradient Boosting Regressor (GBR), Linear Regression, K- Nearest Neighbors Regressor, Random Forest Regressor, and XGBoost, demonstrate similar performance. After cleaning and removal of data outliers, the performance of the proposed regressors was further improved, demonstrating the high potential of a data-driven approach for the estimation of optical properties of new materials even for a small-size data set24,25.
Results and discussion
Data records
The dataset consists of several numerical optical features of the first 260 entries of dataset available elsewhere,24 and shown in Fig. 2. The selected numerical features are the (i) peak absorption wavelength (Ap), (ii) minimum absorption wavelength (Amin), (iii) maximum absorption wavelength (Amax) (iv) peak emission wavelength (Ep) (v) minimum emission wavelength (Emin), (vi) maximum emission wavelength (Emax), and (vii) absolute emission quantum yield (ηyield defined as the ratio between the number of absorbed and emitted photons by a sample)25. To illustrate the physical meaning of each experimental parameter, Fig. 1b illustrates the absorption and emission spectra of a selected material. The parameters related with the absorption and emission bands are particularly important since they define the self-absorption, quantified by the overlap between the absorption and emission spectra which lead to re-absorption losses (e.g. large Stokes-shift15,26 defined for organic molecules). This has been pointed out as one of the most critical aspects for the device performance15,16,27,28,29,30,31, although its quantification is available in few works25,27,28,29. Photostability is also a crucial parameter for real application of LSC devices. Nevertheless, as this is often not reported in the published works and without standard conditions to be performed or reported, this was not included as a feature in this study.
Two categorical features (mat0 and mat1) related to either the type of the optical center were also considered, namely organic dye (dye), lanthanide ions (Ln), quantum dot (QD), carbon dot (CD), nanoparticle (NP) or the processing feature (e.g. bulk, fiber, solution, film), respectively. The figures of merit that characterize the LSC performance are ηopt, given by the ratio between the output optical power (Pout) and the input optical power (Pin) and PCE that accounts for the percentage of the solar energy incident on the LSC that is converted into usable electricity (Eq. S1-S4 in Supplementary Information). Table 1 shows a sample of the dataset illustrating the general optical features and the performance optical features for reported LSCs according to the optically active center.
Regression stage
The analysis started with a quantitative assessment of the numerical distribution of each feature (Fig. 2) and a visual inspection of the mutual correlation between each pair of features and the correlation of the features with the estimated output variables (Figs. 3 and S1 in Supplementary Information), as detailed in the "Methods" section. Note that there is no substantial linear correlation between any of the input variables (the predictors) and the predicted outputs. A linear correlation was only observed between some of the features, namely Emax vs. Ep and ηopt vs. PCE (Fig. 3). The linear correlation between the optical features may be rationalized by attending to the typical Gaussian profile of the photoluminescence spectra, inducing the correlation between the Emax and Ep features23. The correlation between ηopt and PCE arises due to the fact that the larger number of incident photons are converted (quantified by ηopt), the larger the probability to generate electrons in the photovoltaic cell (quantified by PCE). Noticeably, the PCE is mainly dependent on the semiconductor type used to fabricate the photovoltaic cell and on its efficiency15. As silicon-based photovoltaic cells were used in 90% of the data25, the optical response is similar, and therefore, PCE linearly correlates well with ηopt.
We note that the performance in terms of ηopt is independent of the photovoltaic technology as it only quantifies the spectral conversion of the emitting layers, and thus this parameter can be predicted using ML algorithms, as recently shown using artificial neural networks23. Only the PCE values acquired with other semiconductor-based technologies (e.g. GaAs, Perovskite, CIGS—Copper Indium Gallium Selenide, CuInSe2, organic sensitive dye,) lie outside the linear correlation because those materials display a distinct responsivity compared with silicon. Nevertheless, it is shown that correlation between the real and predicted values is high even for the cases of the dataset where the photovoltaic technology is not silicon-based, proving that the algorithm will work also for these cases, considering the error.
Tables 2 and 3 list the estimated values for PCE and ηopt, respectively, considering 6, 7, or 9 predictors (input variables) for the scenarios where the models were trained with all available data with the outliers, or after the removal of the outliers (see details in the "Methods" section). The error distribution, in the shape of violin plots for the 5 regression models (details in the "Methods" section) provided with different input features are represented in Figs. S2–S5 in Supplementary Information. In each of the scenarios, the data set used to fit the regression models vary because the samples with missing values were discarded. For example, in the baseline model with 6 input features (Ap, Amin, Amax, Ep, Emin, Emax), only the rows with available values for PCE or ηopt, were considered. For the scenarios of 7 (Ap, Amin, Amax, Ep, Emin, Emax, ηyield) or 9 predictors (Ap, Amin, Amax, Ep, Emin, Emax, ηyield, mat0, mat1), only rows with available ηyield and PCE or ηopt were taken. These constraints are due to the supervised approach of learning, where the models are fitted with complete input–output information.
The error-related metrics mean absolute error (MAE) and mean square error (MSE), demonstrate that shallow learning models can estimate PCE and ηopt with a substantial level of accuracy, based only on the absorption and emission features (the scenario with 6 input variables). A marginal positive effect is observed when extra features such as ηyield and the characteristics of the materials (mat0 and mat1) are also included. These results point out the performance is mainly determined by the absorption spectra (overlap with AM1.5) and the emission spectra (overlap with the photovoltaic cell operation range).
Except for the linear regression (the case without outlier removal), the other regression methods converge to similar performance metrics that leverage confidence in the obtained results. Since PCE values range in (0–10) intervals (see Fig. 2k), MAE around 1 or less than 1, for most of the scenarios, corresponds to a maximum 10% error in the estimation of PCE which is rather an acceptable result considering that the estimation is based on standard photoluminescent readily accessible measurements. Similar conclusions can be drawn regarding the ηopt ranging in (0–50) interval (see Fig. 2j), and MAE less than 4, in most of the cases. Though the models demonstrate to be equally competitive, the Random Forest (RF) regressor (with removed outliers) slightly outperforms the other models. Similar results were reported, using RF to estimate the data uncertainty32.
We conclude that training an ensemble of decision trees parallel models on different replicas of the data is favorable to outputting more robust estimation in the presence of data uncertainty. In all models, the estimation accuracy improves by removing the outliers (Figs. S2–S5 in Supplementary Information). However, it should be noted that removing the outliers may hinder the models' ability to estimate high levels of PCE and ηopt because they were truncated. Note that the results in Tables 2 and 3 are concerning the test samples from the validation (10% of the complete dataset) which is a relatively small number of samples. Therefore, the explained variances measured by the Coefficient of determination (R2) metrics do not exhibit high values.
Clustering stage
The K-Means clustering algorithm was used for further validation of the regression models. The goal is to separate the data samples into coherent clusters and estimate the missing PCE and ηopt from the available measurements inside each cluster (see “Methods” section for details). Figure 4 illustrates the lack of unequivocal agreement among various clustering metrics regarding the optimal number of clusters. However, five clusters seem an average between their suggestions and a relevant number for the current dataset size. Hence, K-Means is set to K = 5 clusters. In the first step, each sample is assigned to the closest cluster based on the minimum distance between the sample and K cluster centroids. Next, for the samples whose PCE or ηopt values are unknown, the n closest samples (in our experiments n = 3) with available PCE and ηopt, were identified. The missing PCE and ηopt are then estimated as the median of the available measurements of the closest samples.
Regression vs. clustering
The concordance between the cluster-based estimations (unsupervised approach) and the regression models (supervised approach) is evaluated. Though the regression models demonstrate similar performance, we selected the K-Nearest Neighbor to compare with the K-Means clustering, due to its simplicity, no need for loss function optimization, and low amount of hyperparameters to choose.
Figure 5 depicts the MAE violin plots representing the mean absolute error between the estimation of PCE and ηopt. provided by the KNN regression models and the K-Means clustering. The MSE and R2 metrics for the same scenarios (with and without removed outliers) are also presented. Since PCE values range in (0–10) intervals (Fig. 2k), MAE equal to 0.65 (without removed outliers) or 0.60 (with removed outliers) means around 6% error between the K-means and the KNN estimations, which is overall an acceptable disagreement between the two approaches. Similarly, ηopt values are in the range of (0–50) interval (Fig. 2j), therefore MAE equal to 2.43 (without removed outliers) or 2.95 (with removed outliers) means less than 7% error which is also a very good agreement. Hence, the agreement between the two approaches (regression and clustering) is well demonstrated for the estimation of both PCE and ηopt.
The presented results show that the regression models are naturally a more reliable approach to estimating the targeted quantities particularly when the dataset is relatively small. If the statistical variability of the estimated variable is limited and well covered by the data, clustering is a simple way to validate the regression models and can also be used as a viable alternative. Further to that, our findings suggest that it is always plausible to find a way to augment the data to enhance the generalization capacity of the models.
In this work, the potential of ML techniques to foster materials development and optimization is explored. The research question we pose is how to reliably estimate the optical performance of luminescent materials for BIPV using only easily available information such as conventional photoluminescence and knowledge about the optical active center. We propose two stages of ML-based rational design. During the first stage, the optical performance features (PCE and ηopt) of the dataset of materials are estimated by applying regression models. At this stage, the photoluminescence measurements Ap, Amin, Amax, Ep, Emin, and Emax, are determined as the most suitable predictors. Our finding confirmed that once the most relevant features are properly identified, the choice of the ML model is less critical. The five regression methods exhibited similar performance that further enhanced the confidence in the proposed data-driven approach. The training and the validation of the regression models and the selection of their hyper-parameters followed the standard ML tuning process, namely K-fold cross-validation and the Grid search. In the second stage, we applied the K-means clustering technique to separate the materials into coherent groups and estimate the missing values for PCE and ηopt reinforcing the conclusions derived in the first stage. The major takeaway from this study is that the proposed rational design through the combination of supervised and unsupervised ML techniques is a promising way to speed up the development of new luminescent materials for sunlight harvesting and energy conversion. Combining the photoluminescence features used as inputs with the particularly versatile ML methods, the present work opens new perspectives for developing materials in a large variety of photonic applications beyond photovoltaics, where photoluminescence-related features are crucial (e.g. lighting and sensing).
Methods
Supervised learning with regression models
We aim to establish a reliable relationship between the experimental numerical features and the optical performance, therefore independent regression models were built for ηopt and PCE. The feature importance was also analyzed, and the most discriminative features were defined. Five regression models were constructed and tuned: Linear Regression, K-Nearest Neighbors, Random Forest, Gradient Boosting, and XGBoost (Supplementary Information for details).
The feature distribution in Fig. 2 shows the extreme values, which were not related to measurement errors, so they were considered outliers. Typically, the regression models are negatively influenced by the outliers, therefore two regression analyses were conducted building regression models with (i) all available values (without removal of outliers), and (ii) after filtering outliers using the interquartile range (IQR) method (Supplementary Information for details).
As the dataset contains 14%, 24%, and 41% of missing values for the optical properties ηyield, ηopt, and PCE, respectively25, to establish a baseline, we assessed the predictability of PCE and ηopt using only the six absorption and emission features without missing values (Ap, Amin, Amax, Ep, Emin, and Emax). To cope with the limited size of the dataset, we employed shallow learning models, and the dataset was split into 90% and 10% for training and testing, respectively. The split was adapted for the regression problem, as it divides the target variable into bins and then uses them for the train-test stratification process. This ensures that the resulting split has a data distribution as close as possible to the original dataset, about the target variable. Grid Search was implemented to set up the optimal model hyperparameters and K-fold Cross Validation (CV) was used for the training subset (the K-fold CV was also adapted for regression problems). Since the estimated properties are continuous variables, we chose three evaluation metrics: MAE, MSE, and R2. The MAE measures the average magnitude of the errors in a set of predictions and is calculated as the average absolute difference between the actual values and the predicted values and MSE quantifies the average squared differences between predicted and actual values. The smaller the MAE or the MSE, the better the model is at predicting the outcome. Finally, R2 is a statistical measure that represents how well a regression model fits the data. It measures how much of the variation in the dependent variable can be explained by the independent variables33,34,35.
Clustering approach
Clustering the dataset using only the six numerical input features (Ap, Amin, Amax, Ep, Emin, Emax) was used to analyze if there are patterns in these features that allow clustering the materials with enhanced optical performance (larger ηopt and PCE values).
The first step is to select the relevant number of clusters to understand the data. For that, we applied five different metrics to evaluate the cluster quality: Elbow, Calinski-Harabasz, Davies-Bouldin, Silhouette, and Bayesian information criterion (BIC). Each metric was computed from 1 to 20 clusters. The Elbow method is a heuristic approach to determine the most relevant number of clusters in a data set. It works by plotting the explained variation as a function of the number of clusters and picking the elbow of the curve as the number of clusters to use. The Calinski-Harabasz index rewards clustering in which the cluster centroids are far apart, and the cluster members are close to their respective centroids. The Davies-Bouldin index is defined as the average similarity measure of each cluster with its most similar cluster, where similarity is the ratio of within-cluster distances to between-cluster distances. The Silhouette score calculates how similar an object is to its cluster compared to other clusters. The Silhouette score ranges from − 1 to 1, where a score closer to 1 indicates that the object is well-matched to its cluster and poorly matched to neighboring clusters. The BIC is a metric for model selection among a finite set of models so that the model with the lowest BIC is preferred.
Data availability
The data that support the findings of this study are available at figShare (https://doi.org/10.6084/m9.figshare.24903387.v1).
Code availability
The code that supports the findings of this study is available at GitHub (https://github.com/mariolpantunes/sunlight-harvesting-energy-conversion).
References
Schmidt, J., Marques, M. R. G., Botti, S. & Marques, M. A. L. Recent advances and applications of machine learning in solid-state materials science. NPJ Comput. Mater. 5, 83 (2019).
Maddox, J. Crystals from first principles. Nature 335, 201 (1988).
Wang, A. Y. T. et al. Machine learning for materials scientists: An introductory guide toward best practices. Chem. Mater. 32, 4954–4965 (2020).
Dash, J. K., Sharma, M. & Dalapati, G. K. Chapter 12 - Machine learning approach for materials technologies. In Energy Saving Coating Materials: Design, Process, Implementation and Recent Developments (eds Dalapati, G. K. & Sharma, M.) 285–289 (Elsevier Inc., 2020).
Rajan, K. Materials informatics. Mater. Today 8, 38–45 (2005).
Cai, J. Z., Chu, X., Xu, K., Li, H. B. & Wei, J. Machine learning-driven new material discovery. Nanoscale Adv. 2, 3115–3130 (2020).
Mueller, T., Kusne, A. G. & Ramprasad, R. Machine learning in materials science: Recent progress and emerging applications. In Reviews in Computational Chemistry Vol. 29 (eds Parrill, A. L. & Lipkowitz, K. B.) 186–273 (Wiley, 2016).
Fransson, E., Eriksson, F. & Erhart, P. Efficient construction of linear models in materials modeling and applications to force constant expansions. NPJ Comput. Mater. 6, 135 (2020).
Schottle, M., Tran, T., Oberhofer, H. & Retsch, M. Machine learning enabled image analysis of time-temperature sensing colloidal arrays. Adv. Sci. 10, 2205512 (2023).
Caseiro, A. R. et al. Human umbilical cord blood plasma as an alternative to animal sera for mesenchymal stromal cells in vitro expansion—A multicomponent metabolomic analysis. PloS One 13, e0203936 (2018).
Lourenço, R., Andrade-Campos, A. & Georgieva, P. The use of machine-learning techniques in material constitutive modelling for metal forming processes. Metals 12, 427 (2022).
Behara, S., Rath, S. & Thomas, T. Machine learning (ML) as a tool for phosphor design: A perspective. Mater. Lett. 308, 131061 (2022).
Pilania, G. Machine learning in materials science: From explainable predictions to autonomous design. Comp. Mater. Sci. 193, 110360 (2021).
Ball, P. Materials innovation from quantum to global. Nat. Mater. 21, 962–967 (2022).
Ferreira, R. A. S., Correia, S. F. H., Monguzzi, A., Liu, X. & Meinardi, F. Spectral converters for photovoltaics—What’s ahead. Mater. Today 33, 105–121 (2020).
Meinardi, F., Bruni, F. & Brovelli, S. Luminescent solar concentrators for building-integrated photovoltaics. Nat. Rev. Mater. 2, 17072 (2017).
Weber, W. H. & Lambe, J. Luminescent greenhouse collector for solar radiation. Appl. Opt. 15, 2299–2300 (1976).
Goetzberger, A. & Greubel, W. Solar-energy conversion with fluorescent collectors. Appl. Phys. 14, 123–139 (1977).
Correia, S. F. H. et al. Bio-based solar energy harvesting for onsite mobile optical temperature sensing in smart cities. Adv. Sci. 9, 2104801 (2022).
Correia, S. F. H. et al. Autonomous power temperature sensor based on window-integrated transparent PV using sustainable luminescent carbon dots. Nanoscale Adv. 5, 3428–3438 (2023).
Frias, A. R. et al. Sustainable luminescent solar concentrators based on organic-inorganic hybrids modified with chlorophyll. J. Mater. Chem. A 6, 8712–8723 (2018).
Li, Y. L., Sun, Y. J. & Zhang, Y. C. Luminescent solar concentrators performing under different light conditions. Sol. Energy 188, 1248–1255 (2019).
André, P. S., Dias, L. M. S., Correia, S. F. H., Carneiro Neto, A. N. & Ferreira, R. A. S. Artificial neural networks for predicting optical conversion efficiency in luminescent solar concentrators. Sol. Energy 268, 112290 (2024).
Ferreira, R. A. S. et al. A comprehensive dataset of photonic features on spectral converters in energy harvesting and conversion applications. figshare. Dataset. https://doi.org/10.6084/m9.figshare.24903387.v1 (2023).
Ferreira, R. A. S. et al. A comprehensive dataset of photonic features on spectral converters for energy harvesting. Sci. Data 11, 50 (2024).
Meinardi, F. et al. Large-area luminescent solar concentrators based on “Stokes-shift-engineered” nanocrystals in a mass-polymerized PMMA matrix. Nat. Photonics 8, 392–399 (2014).
Yang, C. C. et al. Impact of Stokes shift on the performance of near-infrared harvesting transparent luminescent solar concentrators. Sci. Rep. 8, 16359 (2018).
Yang, C. H. et al. High-performance near-infrared harvesting transparent luminescent solar concentrators. Adv. Opt. Mater. 8, 1901536 (2020).
Richards, B. S. & Howard, I. A. Luminescent solar concentrators for building integrated photovoltaics: Opportunities and challenges. Energy Environ. Sci. 16, 3214–3239 (2023).
Mazzaro, R. & Vomiero, A. The renaissance of luminescent solar concentrators: the role of inorganic nanomaterials. Adv. Energy Mater. 8, 1801903 (2018).
de Bruin, T. A. & van Sark, W. G. J. H. M. Optimising absorption in luminescent solar concentrators constraint by average visible transmission and color rendering index. Front. Phys. 10, 856799 (2022).
Zviazhynski, B. & Conduit, G. Unveil the unseen: Exploit information hidden in noise. Appl. Intell. 53, 11966–11978 (2023).
Al-Jarrah, O. Y., Yoo, P. D., Muhaidat, S., Karagiannidis, G. K. & Taha, K. Efficient machine learning for big data: A review. Big Data Res. 2, 87–93 (2015).
Antony, P. J., Manujesh, P. & Jnanesh, N. A. In 2016 IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (REICT), 69–73 (2016).
Ferguson, A. L. Machine learning and data science in soft materials engineering. J. Phys. Condens. Matter 30, 043002 (2018).
Rondão, R. et al. High-performance near-infrared luminescent solar concentrators. ACS Appl. Mater. Interfaces 9, 12540–12546 (2017).
Correia, S. F. H. et al. Large-area tunable visible-to-near-infrared luminescent solar concentrators. Adv. Sustain. Syst. 2, 1800002 (2018).
Gallagher, S. J., Norton, B. & Eames, P. C. Quantum dot solar concentrators: Electrical conversion efficiencies and comparative concentrating factors of fabricated devices. Sol. Energy 81, 813–821 (2007).
Frias, A. R. et al. Sustainable liquid luminescent solar concentrators. Adv. Sustain. Syst. 3, 1800134 (2019).
Sadeghi, S. et al. Ecofriendly and efficient luminescent solar concentrators based on fluorescent proteins. ACS Appl. Mater. Interfaces 11, 8710–8716 (2019).
Carlos, C. P. A. et al. Environmentally friendly luminescent solar concentrators based on optically efficient and stable green fluorescent protein. Green Chem. 22, 4943–4951 (2020).
Acknowledgements
This work was supported by the European Union-NextGenerationEU, through the National Recovery and Resilience Plan of the Republic of Bulgaria, project N0 BG-RRP-2.004-0005. This work is financed by Portugal 2020 through the European Regional Development Fund (ERDF) in the frame of CENTRO2020 in the scope of the project PLANETa, CENTRO -01-0145-FEDER-181242 and within the scope of the project CICECO-Aveiro Institute of Materials (UIDB/50011/2020 & UIDP/50011/2020), Instituto de Telecomunicações (FCT Ref. UIDB/50008/2020-UIDP/50008/2020), SOLPOWINS—Solar-Powered Smart Windows for Sustainable Buildings (PTDC/CTM-REF/4304/2020), financed by national funds through the FCT/MEC and when appropriate co-financed by FEDER under the PT2020 Partnership through European Regional Development Fund (ERDF) in the frame of Operational Competitiveness and Internationalization Programme (POCI). SFHC thanks FCT (2022.03740.CEECIND/CP1716/CT0006).
Author information
Authors and Affiliations
Contributions
R.A.S.F. and P.S.A. contributed to the research plan determination, acquisition of all data, data analysis, result discussion, and manuscript writing. S.F.H.C. and L.F. contributed to data acquisition, data analysis, and manuscript writing. M.A. and P.G. defined the distribution of structures to serve as training data and made and trained the ML model.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Ferreira, R.A.S., Correia, S.F.H., Fu, L. et al. Predicting the efficiency of luminescent solar concentrators for solar energy harvesting using machine learning. Sci Rep 14, 4160 (2024). https://doi.org/10.1038/s41598-024-54657-x
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-024-54657-x
Keywords
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.