Machine learning assisted prediction of the Young’s modulus of compositionally complex alloys

Khakurel, Hrishabh; Taufique, M. F. N.; Roy, Ankit; Balasubramanian, Ganesh; Ouyang, Gaoyuan; Cui, Jun; Johnson, Duane D.; Devanathan, Ram

doi:10.1038/s41598-021-96507-0

Download PDF

Article
Open access
Published: 25 August 2021

Machine learning assisted prediction of the Young’s modulus of compositionally complex alloys

Hrishabh Khakurel¹^na1,
M. F. N. Taufique²^na1,
Ankit Roy³,
Ganesh Balasubramanian³,
Gaoyuan Ouyang⁴,
Jun Cui^4,5,
Duane D. Johnson^4,5 &
…
Ram Devanathan²

Scientific Reports volume 11, Article number: 17149 (2021) Cite this article

8086 Accesses
48 Citations
1 Altmetric
Metrics details

Subjects

Abstract

We identify compositionally complex alloys (CCAs) that offer exceptional mechanical properties for elevated temperature applications by employing machine learning (ML) in conjunction with rapid synthesis and testing of alloys for validation to accelerate alloy design. The advantages of this approach are scalability, rapidity, and reasonably accurate predictions. ML tools were implemented to predict Young’s modulus of refractory-based CCAs by employing different ML models. Our results, in conjunction with experimental validation, suggest that average valence electron concentration, the difference in atomic radius, a geometrical parameter λ and melting temperature of the alloys are the key features that determine the Young’s modulus of CCAs and refractory-based CCAs. The Gradient Boosting model provided the best predictive capabilities (mean absolute error of 6.15 GPa) among the models studied. Our approach integrates high-quality validation data from experiments, literature data for training machine-learning models, and feature selection based on physical insights. It opens a new avenue to optimize the desired materials property for different engineering applications.

Design of high bulk moduli high entropy alloys using machine learning

Article Open access 22 November 2023

High Entropy Alloys Mined From Binary Phase Diagrams

Article Open access 29 October 2019

Cluster-formula-embedded machine learning for design of multicomponent β-Ti alloys with low Young’s modulus

Article Open access 21 July 2020

Introduction

The conventional alloying method almost always starts with one or two principal metallic elements and advances by incorporation of different alloying elements to engineer desired mechanical and chemical properties^1,2,3. Therefore, the mechanical and chemical properties of the synthesized alloy remain controlled by the principal elements. For instance, Fe is the principal element in steels, Cu/Zn in brass, Ni/Co in superalloys and Ti in titanium alloys^4,5,6. About 15 years ago, Yeh and Cantor^7,8 introduced a novel alloy concept known as high entropy alloys (HEA) that consist of multiple-principal elements (N = 5 or more elements) in near equiatomic percentages. The increased complexity introduces higher configurational entropy (growing as k_BT NlnN, where T is the temperature) compared to conventional alloys. As the number of elements N increases, the number of pairs grows as ~ N² and raises the probability of favorable pair-driven formation enthalpy, which introduces a complex-chemistry effect (often referred to as a “cocktail effect”). The mixing of multi-principal elements generally introduces four core effects, such as, high mixing entropy, lattice distortions, slow diffusion, and a “cocktail” effect, which result in a simple microstructure and excellent mechanical properties^{9,10,11,12,13}. Further study revealed that several HEAs, such as the Mo_0.5AlNbTa_0.5TiZr system, did not overcome the enthalpic contributions due to comparatively lower configurational entropies and featured the formation of secondary phases instead of just solid solution phases. Therefore, a more preferred terminology for such alloy systems has emerged, with the more general naming and definition called CCAs^14,15 which is the naming convention used throughout this paper.

The number of elemental compositions is much higher in CCAs than that of traditional metallic alloys because CCAs comprise multiple-principal elements¹⁶. Moreover, a broader range of compositional space provides an opportunity to improve mechanical properties, such as Young’s modulus, yield strength, and hardness. However, it is extremely challenging to select the appropriate composition by trial-and-error experiment or intuition¹⁷. Atomistic modeling, such as molecular dynamics (MD), density functional theory (DFT), and thermodynamic modeling have been devoted to study phase stabilization, solidification, and crystallization kinetics of CCAs^{18,19,20,21,22,23,24,25}. These techniques are computationally expensive, challenging to apply to the study of large polycrystalline samples, time consuming, and hence cannot be used on a large scale to narrow down the search space. Moreover, the variety of microstructures gives rise to complex and computationally expensive calculations compared to traditional alloys and hence it is challenging to predict the chemistries and compositions for a target property.

Nowadays, data-driven research and more specifically ML, which is widely used in self driving cars²⁶, image classification²⁷, web-searches²⁸, and fraud detection²⁹, is also employed to solve different challenges in materials science³⁰. For instance, Zhang et al.¹⁹ found that atomic size difference (δ), mixing entropy ($\Delta S_{mix}$) and enthalpy ($\Delta H_{mix}$) are the most important features in phase selection of HEAs. Singh et al.^31,32,33 used high-throughput DFT to predict properties through the chemical ranges and revealed correlations with valence electron concentration (VEC), size-difference (bandwidth) and vacancies. Roy et al.³⁴ proposed that the average melting temperature (T_m) is the most important feature to predict the Young’s modulus of low, medium and high entropy alloys. Recent efforts utilizing ML³⁵ considered two additional features such as, Pauling electronegativity difference and difference in VEC and used a neural network (NN) to predict the phases that form in these CCAs. Thus, different features control each property of the alloy and the importance of features varies from property to property.

Here, we have employed different tree-based ensemble ML models, linear regression ML models, kernel-based ML models to predict the Young’s modulus of CCAs consisting of refractory elements. This work initially identified VEC, average melting temperature and difference in atomic radii as the most important physical properties that control the Young’s modulus of CCAs. The study compared the relative merits of different ML models for a training set of refractory alloy data that was gathered from published literature. The model prediction was then validated against the Young’s modulus measured for 32 new alloys synthesized and tested as part of this work. The findings offer considerable promise for alloy down selection based on ML models validated against high-quality experimental data of known provenance.

Methodology

Training data collection and feature selection

Data on Young's modulus for CCAs were collected from existing literature^34,36,37,38. Two different data sets were used for model training. The first data set contains 154 alloys with a mixture of refractory and non-refractory alloys. The second data set contains 96 refractory alloys of Mo, Nb, Ta, W, mixed with some other elements like Al, Cr and Ni. Both datasets are presented in Tables 1 and 2 in the supplementary section. The goal of using two different data sets (one with a mixture of refractory and non-refractory alloys and the other with only refractory alloys) was to examine the effect of the elemental composition of training data on the reliability of the prediction with respect to experimentally synthesized validation data.

For the features that were used to train the ML models, we calculated 11 feature values of these alloys. These features are listed in Table 1. Past studies have shown that all of these features have a direct effect on the Young’s modulus for any alloy. To obtain these features, we collected data on features identified from domain knowledge, such as Pauling electronegativity, VEC, lattice constant, melting temperature, mixing enthalpy and atomic radii. Then we used Python language scripts to calculate the features mentioned in Table 1.

Table 1 Features of alloys considered in this analysis.

Full size table

To see the association between the features, we examined the Pearson correlation coefficients (PCC). Figure 1 shows the PCC for the mixed alloys data set and for the refractory alloys data set. In the PCC “heatmap”, P = + 1 indicates a strong positive correlation and P = − 1 indicates a strong negative correlation. Figure 1 indicates the absence of any significant correlation amongst any pair of features except $\Delta$a and a_m from Fig. 1a. However, the ML models we considered here can deal with the multicollinearity, and hence this correlation will not have any significant impact on the predictions. Therefore we considered all the features in the model.

Validation data preparation and Young’s modulus measurement

An experimental data set was used to validate the final model predictions. The validation set consisted of 32 alloys in the Mo-based family of refractory CCAs, including Mo, Ta W, Ti, Zr, Al, Cr. The validation alloys used in the study were prepared at Ames Lab Materials Preparation Center in the form of thin metal plates/foils. The alloys (1.5 g each) with selected compositions were synthesized by arc melting using a 32-cavity arc melting system (MTI corp, SP-MAM32). The actual compositions of the alloys after arc melting were quantified by energy dispersive spectroscopy (EDS). The densities of the samples were measured by Archimedes measurement. The arc-melted buttons were then sliced by electrical-discharge machining into near-cylinder shapes (two parallel sides) with thicknesses of ~ 3 mm. The elastic modulus values were measured on the cylinders by the ultrasonic pulse-echo technique using a digital ultrasonic thickness gauge (Olympus, 38DL PLUS).

Machine learning models construction

To predict the Young's modulus, four tree based ensemble methods i.e. Gradient Boosting, Ada Boost, Extreme Gradient Boost (or XGBoost), Random Forest (RF), two linear models i.e. LASSO regression, Ridge regression, two kernel based methods i.e. Gaussian Process Regression and Support Vector Machine (SVM) models were used. These models were trained for the two sets of data separately. Once the data was collected and the feature values were selected for both data sets, the 8 ML models were trained on both the data sets. We obtained 16 models, 8 for the larger data set with both the refractory and non-refractory alloys and 8 for the smaller data set with only refractory alloys. Five-fold cross-validation was used to determine the errors. The cross-validation approach is better than the train-test split approach as it gives more robust estimation of the errors. There exist many good metrics to quantify the predictive strength of the model like root-mean-squared (RMS) error, mean-squared error, mean-absolute error (MAE), and the coefficient of determination R². We chose to use the MAE as our metric as it most closely represents the format of error as reported in most experimental measurements. Additionally, we also reported the R² values for the optimized models.

The errors were minimized by performing hyper-parameter optimization using the grid-search algorithm. This algorithm works by determining the test error for all possible combinations of the supplied hyper-parameter values. Out of all combinations, the one with the least error was selected for our model. Each of the algorithms has a different set of hyper-parameters. Once the best hyper-parameters were selected, the optimized model using those hyperparameters was used to make predictions for our validation set whose Young’s modulus had been experimentally measured. Finally, the uncertainty of the predictions i.e. standard deviations was calculated by Bootstrapping method by resampling 100 times for each case. All of the above-mentioned tasks like cross-validation and grid search were performed using the scikit-learn⁴⁵ library in Python. For our study, we employed all the ML models through the scikit-learn machine learning library for the Python language⁴⁶. The XGBoost model was implemented through the library created by Tianqi Chen⁴⁷.

Results and discussion

Model optimization

The ML models were first trained on both data sets. The hyper-parameters were optimized and then the training and validation error were calculated using five-fold cross-validation. We used these hyper-parameters to construct our final optimized models. The optimized hyperparameters are presented in the supplementary section (Table 3 in the supplementary section). These hyperparameters were used to predict the Young’s modulus for the unseen data i.e. the experimentally synthesized validation data set. The cross-validated MAE and R² values for all the models are presented in Table 2. From Table 2 it is clear that the performance of the Gradient Boosting model is superior to other models both in terms of accuracy (i.e., the MAE is lower and R² is higher than any other models) and robustness (i.e., the standard deviation of cross-validation is lower). Because of this excellent performance, we will discuss the feature importance and prediction of Young’s modulus generated by the Gradient Boosting model.

Table 2 Optimized hyperparameter and cross-validated MAE and R² for both data sets.

Full size table

In our data sets, tree-based ensemble type models perform better than other models to predict Young’s modulus. Ensemble type algorithm showed better performance in other studies to predict materials properties^34,48,49. Ensemble methods are meta algorithms that combine several base models to produce a better predictive model. To decrease variance, a bagging ensemble method can be used and to decrease bias a boosting ensemble method can be used. A boosting method converts weak learners to strong ones^50,51,52. Usually, decision stumps are used as the base weak learners, but this is not always the case. Most Boosting methods build models in a stage-wise fashion and they generalize the model by optimizing an arbitrary differentiable loss function. Boosting methods also help prevent the problem of over-fitting to some extent. Additionally, Boosting methods solve the problems of a non-linear relation between target properties and features and help to deal with the collinearity among the features. Furthermore, most boosting methods provide the feature importance associated with the model. Feature importance is important to conclude which features influence Young’s modulus the most. Boosting methods are affected by the presence of outliers. Hence, it is recommended to perform outlier analysis before training the data.

Feature importance

After training the models on both data sets containing refractory and non-refractory alloys using the optimized hyper-parameters, we determined the feature importance associated with the Gradient Boosting model. Feature importance is simply the score assigned to the features based on how useful they are at predicting a target variable. The feature importance for the larger data set containing both the refractory and non-refractory alloys, and smaller data set only with refractory alloys are presented in Fig. 2a,b, respectively. From feature importance, it is clear that the sequence of the features is not identical for both data sets. However, the smaller training data set showed better prediction accuracy as indicated in Table 2. Hence, we selected the important features generated from the smaller data set presented in Fig. 2b. In the next paragraph, we are going to explain the physical significance of some of the important features for the Young’s modulus of CCAs.

We found that VEC was the most important feature and had importance higher than 0.7. While it is not shown here, it is important to mention that other ML models i.e. XGBoost and RF showed good prediction capabilities and identified the VEC as the most important feature with an importance of more than 0.7. In the elastic limit and at a constant value of Poisson’s ratio, the Young’s modulus is related to the bulk modulus (Eq. 1) and hence we will explain the physics of the Young’s modulus dependence on VEC by exploring the physical relationship between bulk modulus and VEC^53,54.

$$K = \frac{E}{3(1 - 2v)}$$

(1)

Here, K, E and $v$ are the bulk modulus, Young’s modulus and Poisson’s ratio, respectively. Gilman et al.^53,54, reported that materials with higher valence electron density (VED) (valence electrons/unit volume) possess higher bulk modulus. As the number of valence electrons increases, the bulk modulus increases, and it decreases as the atomic size increases. The bulk modulus is determined predominantly by the resistance of the valence electrons to compression. In a metallic system, electrons behave like a dense gas, or liquid, with only a very small amount of viscosity. Hence, the greater the electron density, the more the resistance to compression, and the higher the bulk modulus and the Young’s modulus. For instance, osmium, possesses a VED 17% higher than for diamond and correspondingly exhibits a bulk modulus 4% greater as well^53,54. Though we considered VEC instead of VED in this work, it still follows the upward trend of Young’s modulus both for training and validation data sets with VEC as presented in Fig. 3a,b. Our calculated feature importance indicates that the melting point of alloys, which is an indirect metric of bond strength^34,55, has an impact on Young’s modulus, which generally increases with increasing melting temperature as presented in Fig. 3c,d. The geometrical parameter λ, which is a function of mixing entropy ($\Delta S_{mix}$) and the difference in atomic radii (δ) has a significant impact on Young’s modulus. The δ parameter has an impact on cohesive energy and Young’s modulus increases with increasing cohesive energy^56,57. In our case, we have seen that a lower value of δ results in higher Young’s modulus as presented in Fig. 3e,f. The difference in atomic radius influences the distribution of alloying elements and metallic bond energy. The electronegativity has an impact on the electron density of atoms and the larger value of electronegativity result in a higher Young’s modulus of metallic alloys⁵⁸. Additionally, larger electronegativity differences ($\Delta \chi$) and higher mixing enthalpy ($\Delta H_{mix}$) increases the probability of formation of intermetallic brittle phases, which have lower Young’s modulus. Therefore, these two parameters could play an important role to determine Young’s modulus of CCAs³⁴.

It is important to mention that Roy et al.³⁴ predicted Young’s modulus of low, medium and high entropy alloys composed of 5 elements by employing Gradient Boosting method and found that average melting temperature (T_m) was the most important feature without considering the impact of VEC. Corresponding MAE for their study was 23.59 GPa. In this study, we achieved significantly better performance (MAE = 6.15 GPa) by considering VEC in the feature sets. From the above discussion, we propose that VEC is the most important feature that determines the Young’s modulus of this refractory alloy system. Therefore, it is essential to include VEC as a key parameter in the design of new CCAs with tailored Young’s modulus.

Experimental validation

We finally used the trained Gradient Boosting model to predict Young's modulus of unseen CCAs, which are the experimentally synthesized 32 CCAs mostly composed of Mo–Ta–Ti–W–Zr elements. As the experimental validation alloys are all refractory alloys, we examined how the types of training sets have impact on the prediction of Young’s modulus. When we trained the Gradient Boosting model with larger data set containing both refractory and non-refractory alloys the predictions of the Young’s modulus were significantly off compared to experimentally measured Young’s modulus as presented in Fig. 4a. The predicted value consistently underestimated the experimental value. In contrast, we have achieved excellent predictions when we consider only the refractory alloys to train the Gradient Boosting model as presented in Fig. 4b. Only 2 predictions (alloy numbers 6 and 8) out of 32 alloys are outside of 68.3% confidence interval (± σ, where σ is the standard deviation of each prediction. Table 3 presents the actual value of experimental Young’s modulus, mean prediction of Young’s modulus with the percentage of error and standard deviation when the model was trained with refractory alloys. 26 of the alloys had errors ≤ 5% and a few of the predictions are almost identical compared to experimental values.

Table 3 Predicted Young's modulus with percentage of error and standard deviation from Gradient Boosting model trained with data containing refractory alloys.

Full size table

From Fig. 4 and Table 3 we conclude that the quality of the training data is very important to predict the target property accurately. We have a larger training set (154 alloys) with refractory and non-refractory alloys. On the other hand, we have a smaller training set (96 alloys) only with refractory alloys. Since the training set was more homogeneous for the smaller data set, we achieved better predictions. Moreover, the predicted Young’s modulus followed the trend with the experimental Young’s modulus with some exceptions as presented in Fig. 4b. Therefore, it is not only the size of the training data but also the quality and relevance of the training data that are important for better predictions.

Conclusion

We have presented an approach that uses ML with high throughput experimental synthesis and mechanical testing of alloys to predict the Young’s modulus of CCAs reliably. We conclude that among the eight ML models we used, Gradient Boosting had the best predictive strength. The prediction of Young’s modulus was influenced by the model chosen and by the composition of training data. Our experimental validation set was composed of refractory alloys, and when the models were trained with data containing only refractory alloys, the predictions were closer to the experimental values. This shows that when training ML models to predict characteristics of alloys, it is advantageous to include alloys of similar composition in the training data set. The valence electron concentration is the most important feature governing the Young’s modulus of refractory CCAs and can be used to rapidly screen alloys. Since feature importance also appears to be influenced by the choice of training data set, it is important to choose carefully the training data set based on the type of alloy being studied and validate against high-quality experimental data of known provenance. The integration of experimental synthesis and testing, machine learning, and physics-based interpretation demonstrated in this work holds considerable promise for alloy design and property prediction.

Data availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Code availability

The codes that support the findings of this study are available from the corresponding author upon reasonable request.

References

Huang, S. C. et al. Mechanical properties of zirconium-based random alloys: Alloying elements and composition dependencies. Comput. Mater. Sci. 127, 60–66 (2017).
Article CAS Google Scholar
Inoue, A. et al. Marzouki, development and applications of highly functional Al-based materials by use of metastable phases. Mater. Res. 18, 1414–1425 (2015).
Article CAS Google Scholar
Abdelaziz, M. H., Paradis, M., Samuel, A. M., Doty, H. W. & Samuel, F. H. Effect of aluminum addition on the microstructure, tensile properties, and fractography of cast Mg-based alloys. Ann. Mater. Sci. Eng. 2, 1–10 (2017).
Google Scholar
Schinhammer, M., Hänzi, A. C., Löffler, J. F. & Uggowitzer, P. J. Design strategy for biodegradable Fe-based alloys for medical applications. Acta Biomater. 6, 1705–1713 (2010).
Article CAS PubMed Google Scholar
Long, H., Mao, S., Liu, Y., Zhang, Z. & Han, X. Microstructural and compositional design of Ni-based single crystalline superalloys—A review. J. Alloy. Compd. 743, 203–220 (2018).
Article CAS Google Scholar
Hayama, A. O. F. et al. Effects of composition and heat treatment on the mechanical behavior of Ti–Cu alloys. Mater. Des. 55, 1006–1013 (2014).
Article CAS Google Scholar
Yeh, J. W. et al. Nanostructured highentropy alloys with multiple principal elements: Novel alloy design concepts and outcomes. Adv. Eng. Mater. 6, 299–303 (2004).
Article CAS Google Scholar
Cantor, B., Chang, I. T. H., Knight, P. & Vincent, A. J. B. Microstructural development in equiatomic multicomponent alloys. Mater. Sci. Eng. A 375–377, 213–218 (2004).
Article CAS Google Scholar
Yim, D. & Kim, H. S. Fabrication of the high-entropy alloys and recent research trends: A review. Korean J. Met. Mater. 55, 671–683 (2017).
CAS Google Scholar
Ren, B. et al. Corrosion behavior of CuCrFeNiMn high entropy alloy system in 1 M sulfuric acid solution. Mater. Corros. 63, 828–834 (2012).
Article ADS CAS Google Scholar
Kang, Y. B., Shim, S. H., Lee, K. H. & Hong, S. I. Dislocation creep behavior of CoCrFeMnNi high entropy alloy at intermediate temperatures. Mater. Res. Lett. 6, 689–695 (2018).
Article CAS Google Scholar
Fu, Z. Q., MacDonald, B. E. & Monson, T. C. Influence of heat treatment on microstructure, mechanical behavior, and soft magnetic properties in an fcc-based Fe29Co28Ni29Cu7Ti7 high-entropy alloy. J. Mater. Res. 33, 2214–2222 (2018).
Article ADS CAS Google Scholar
Tikhonovsky, M. A., Salishchev, G. A., Yurchenko, N. Y., Stepanov, N. D. & Zherebtsov, S. V. Aging behavior of the HfNbTaTiZr high entropy alloy. Mater. Lett. 211, 87–90 (2018).
Article CAS Google Scholar
Qiu, Y. et al. A lightweight single-phase AlTiVCr compositionally complex alloy. Acta Mater. 123, 115–124 (2017).
Article ADS CAS Google Scholar
Jensen, J. K. et al. Characterization of the microstructure of the compositionally complex alloy Al1Mo0.5Nb1Ta0.5Ti1Zr1. Scr. Mater. 121, 1–4 (2016).
Article CAS Google Scholar
Ye, Y. F., Wang, Q., Lu, J., Liu, C. T. & Yang, Y. High-entropy alloy: Challenges and prospects. Mater. Today 19, 349–362 (2016).
Article CAS Google Scholar
Miracle, D. B. & Senkov, O. N. A critical review of high entropy alloys and related concepts. Acta Mater. 122, 448–511 (2017).
Article ADS CAS Google Scholar
Ma, D., Grabowski, B., Körmann, F., Neugebauer, J. & Raabe, D. Ab initio, thermodynamics of the CoCrFeMnNi high entropy alloy: Importance of entropy contributions beyond the configurational one. Acta Mater. 100, 90–97 (2015).
Article ADS CAS Google Scholar
Zhang, C., Zhang, F., Chen, S. & Cao, W. Computational thermodynamics aided high-entropy alloy design. J. Occup. Med. 64, 839–845 (2012).
CAS Google Scholar
Jiang, C. & Uberuaga, B. P. Efficient ab initio modeling of random multicomponent alloys. Phys. Rev. Lett. 116, 105501 (2016).
Article ADS PubMed CAS Google Scholar
Saal, J. E., Berglund, I. S., Sebastian, J. T., Liaw, P. K. & Olson, G. B. Equilibrium high entropy alloy phase stability from experiments and thermodynamic modeling. Scr. Mater. 146, 5–8 (2017).
Article CAS Google Scholar
Lederer, Y., Toher, C., Vecchio, K. S. & Curtarolo, S. The search for high entropy alloys: A high-throughput ab-initio approach. Acta Mater. 159, 364–383 (2018).
Article ADS CAS Google Scholar
Sanchez, J. M., Vicario, I., Albizuri, J., Guraya, T. & Garcia, J. C. Phase prediction, microstructure and highhardness of novel light-weight high entropy alloys. J. Mater. Res. Technol. 424, 1–9 (2018).
Google Scholar
Tapia, A. J. S. F., Yim, D., Kim, H. S. & Lee, B. J. An approach for screening single phase high-entropy alloys using an inhouse thermodynamic database. Intermetallics 101, 56–63 (2018).
Article CAS Google Scholar
Senkov, O. N., Miller, J. D., Miracle, D. B. & Woodward, C. Accelerated exploration of multiprincipal element alloys with solid solution phases. Nat. Commun. 6, 6529 (2015).
Article ADS CAS PubMed Google Scholar
Bojarski, M. et al. End to end learning for self-driving cars. Preprint at arXiv:1604.07316 (2016).
He, K., Zhang, X., Ren, S. & Sun, J. Delving deep into rectifiers: Surpassing humanlevel performance on ImageNet classification. In 2015 IEEE International Conference on Computer Vision (ICCV) (eds Bajcsy, R. & Hager, G.) 1026–1034 (IEEE, 2015).
Chapter Google Scholar
Pazzani, M. & Billsus, D. Learning and revising user profiles: The identification of interesting web sites. Mach. Learn. 27, 313–331 (1997).
Article Google Scholar
Chan, P. K. & Stolfo, S. J. Toward scalable learning with non-uniform class and cost distributions: A case study in credit card fraud detection. In KDD’98 Proc. Fourth International Conference on Knowledge Discovery and Data Mining (eds Agrawal, R. et al.) 164–168 (AAAI Press, 1998).
Google Scholar
Rickman, J. M., Balasubramanian, G., Marvel, C. J., Chan, H. M. & Burton, M.-T. Machine learning strategies for high-entropy alloys. J. Appl. Phys. 128, 221101 (2020).
Article ADS CAS Google Scholar
Singh, P. et al. Design of high-strength refractory complex solid-solution alloys. npj Comput. Mater. 4, 16 (2018).
Article ADS CAS Google Scholar
Singh, P., Smirnov, A. V., Alam, A. & Johnson, D. D. First-principles prediction of incipient order in arbitrary high-entropy alloys: Exemplified in Ti0.25CrFeNiAlx. Acta Mater. 189, 248–254 (2020).
Article ADS CAS Google Scholar
Singh, P. et al. Vacancy-mediated complex phase selection in high entropy alloys. Acta Mater. 194, 540–546 (2020).
Article ADS CAS Google Scholar
Roy, A., Babuska, T., Krick, B. & Balasubramanian, G. Machine learned feature identification for predicting phase and Young’s modulus of low-, medium- and high-entropy alloys. Scr. Mater. 185, 152–158 (2020).
Article CAS Google Scholar
Islam, N., Huang, W. & Zhuang, H. L. Machine learning for phase selection in multi-principal element alloys. Comput. Mater. Sci. 150, 230–235 (2018).
Article CAS Google Scholar
Senkov, O., Miracle, D., Chaput, K. & Couzinie, J. Development and exploration of refractory high entropy alloys—A review. J. Mater. Res. 33, 3092–3128 (2018).
Article ADS CAS Google Scholar
Li, W., Liu, P. & Liaw, P. K. Microstructures and properties of high-entropy alloy films and coatings: A review. Mater. Res. Lett. 6(4), 199–229 (2018).
Article CAS Google Scholar
Couzinié, J.-P., Senkov, O. N., Miracle, D. B. & Dirras, G. Comprehensive data compilation on the mechanical properties of refractory high-entropy alloys. Data Brief. 21, 1622–1641 (2018).
Article PubMed PubMed Central Google Scholar
Fang, S., Xiao, X., Xia, L., Li, W. & Dong, Y. Relationship between the widths of supercooled liquid regions and bond parameters of Mg-based bulk metallic glasses. J. Non-Cryst. Solids 321, 120–125 (2003).
Article ADS CAS Google Scholar
Guo, S., Ng, C., Lu, J. & Liu, C. T. Effect of valence electron concentration on stability of fcc or bcc phase in high entropy alloys. J. Appl. Phys. 109, 103505 (2011).
Article ADS CAS Google Scholar
Takeuchi, A. & Inoue, A. Classification of bulk metallic glasses by atomic size difference, heat of mixing and period of constituent elements and its application to characterization of the main alloying element. Mater. Trans. 46, 2817–2829 (2005).
Article CAS Google Scholar
Yang, X. & Zhang, Y. Prediction of high-entropy stabilized solid-solution in multi-component alloys. Mater. Chem. Phys. 132, 233–238 (2012).
Article CAS Google Scholar
Singh, A. K., Kumar, N., Dwivedi, A. & Subramaniam, A. A geometrical parameter for the formation of disordered solid solutions in multi-component alloys. Intermetallics 53, 112–119 (2014).
Article CAS Google Scholar
Senkov, O. N., Wilks, G. B., Miracle, D. B., Chuang, C. P. & Liaw, P. K. Refractory high-entropy alloys. Intermetallics 18, 1758–1765 (2010).
Article CAS Google Scholar
Breiman, L. Arcing The Edge. Technical Report 486. Statistics Department, University of California, Berkeley (1997).
Pedregosa, F. et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
MathSciNet MATH Google Scholar
Tianqi, C. & Carlos, G. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 785–794 (2016).
Mamun, O. et al. A machine learning aided interpretable model for rupture strength prediction in Fe-based martensitic and austenitic alloys. Sci. Rep. 11, 5466 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Mamun, O. et al. Machine learning augmented predictive and generative model for rupture life in ferritic and austenitic steels. npj Mater. Degrad. 5, 20 (2021).
Article CAS Google Scholar
Schapire, R. E. The strength of weak learnability. Mach. Learn. 5, 197–227 (1990).
Article Google Scholar
Friedman, J. H. Greedy function approximation: A gradient boosting machine (PDF). Ann. Stat. 29, 1189–1232 (2001).
Article MATH Google Scholar
Freund, Y. & Schapire, R. E. A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. Syst. Sci. 55, 119–139 (1997).
Article MathSciNet MATH Google Scholar
Gilman, J. J. Electronic Basis of the Strength of Materials, Chapter 12 (Cambridge University Press, 2003).
Google Scholar
Gilman, J. J., Cumberland, R. W. & Kaner, R. B. Design of hard crystals. Int. J. Refract. Met. Hard Mater. 24, 1–5 (2006).
Article CAS Google Scholar
Rickman, J. M. Data analytics and parallel-coordinate materials property charts. npj Comput. Mater. 4, 5 (2018).
Article ADS CAS Google Scholar
Roy, A. et al. Lattice distortion as an estimator of solid solution strengthening in high-entropy alloys. Mater. Charact. 172, 110877 (2021).
Article CAS Google Scholar
Pettifor, D. G. Electron theory of metals. In Physical Metallurgy Vol. 73 (eds Cahn, R. W. & Haasen, P.) (Elsevier, 1983).
Google Scholar
Li, K., Kang, C. & Xue, D. Electronegativity calculation of bulk modulus and band gap of ternary ZnO-based alloys. Mater. Res. Bull. 47, 2902–2905 (2012).
Article CAS Google Scholar

Download references

Acknowledgements

This effort was principally supported by the U.S. Department of Energy's (DOE) Office of Energy Efficiency and Renewable Energy (EERE) under the Advanced Manufacturing Office (Project WBS 2.1.0.19) through Ames Laboratory, which is operated for the U.S. DOE by Iowa State University under contract DE-AC02-07CH11358. HK was supported in part through the National Science Foundation (NSF) Mathematical Sciences Graduate Internship (MSGI) Program sponsored by the NSF Division of Mathematical Sciences. This program is administered by the Oak Ridge Institute for Science and Education (ORISE) through an interagency agreement between the U.S. Department of Energy (DOE) and NSF. ORISE is managed for DOE by ORAU. This report was prepared as an account of work sponsored by an agency of the United States Government. Neither the United States Government nor any agency thereof, nor any of its employees, makes any warranty, express or implied, or assumes any legal liability or responsibility for the accuracy, completeness, or usefulness of any information, apparatus, product, or process disclosed, or represents that its use would not infringe privately owned rights. Reference herein to any specific commercial product, process, or service by trade name, trademark, manufacturer, or otherwise does not necessarily constitute or imply its endorsement, recommendation, or favoring by the United States Government or any agency thereof. The views and opinions of authors expressed herein do not necessarily state or reflect those of the United States Government or any agency thereof.

Author information

These authors contributed equally: Hrishabh Khakurel and M. F. N. Taufique.

Authors and Affiliations

Department of Mathematics, The University of Texas at Arlington, Arlington, TX, 76019, USA
Hrishabh Khakurel
Pacific Northwest National Laboratory, Richland, WA, 99354, USA
M. F. N. Taufique & Ram Devanathan
Department of Mechanical Engineering and Mechanics, Lehigh University, Bethlehem, PA18015, USA
Ankit Roy & Ganesh Balasubramanian
Ames Laboratory, United States Department of Energy, Ames, IA, 50011, USA
Gaoyuan Ouyang, Jun Cui & Duane D. Johnson
Department of Materials Science and Engineering, Iowa State University, Ames, IA, 50011, USA
Jun Cui & Duane D. Johnson

Authors

Hrishabh Khakurel
View author publications
You can also search for this author in PubMed Google Scholar
M. F. N. Taufique
View author publications
You can also search for this author in PubMed Google Scholar
Ankit Roy
View author publications
You can also search for this author in PubMed Google Scholar
Ganesh Balasubramanian
View author publications
You can also search for this author in PubMed Google Scholar
Gaoyuan Ouyang
View author publications
You can also search for this author in PubMed Google Scholar
Jun Cui
View author publications
You can also search for this author in PubMed Google Scholar
Duane D. Johnson
View author publications
You can also search for this author in PubMed Google Scholar
Ram Devanathan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.K. and M.F.N.T. contributed equally to this work as first authors. They performed the dataset construction, data analysis and wrote the manuscript. G.O. synthesized and characterized the validation data set. A.R, G.B., G.O., J.C., and D.J. oversaw results and discussion and reviewed the manuscript. R.D. provided technical expertise to CCA data, extracted data from research articles, oversaw the results and reviewed the manuscript.

Corresponding author

Correspondence to M. F. N. Taufique.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Tables.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Khakurel, H., Taufique, M.F.N., Roy, A. et al. Machine learning assisted prediction of the Young’s modulus of compositionally complex alloys. Sci Rep 11, 17149 (2021). https://doi.org/10.1038/s41598-021-96507-0

Download citation

Received: 04 January 2021
Accepted: 05 July 2021
Published: 25 August 2021
DOI: https://doi.org/10.1038/s41598-021-96507-0

This article is cited by

Finite Element Analysis and Machine Learning Guided Design of Carbon Fiber Organosheet-Based Battery Enclosures for Crashworthiness
- Shadab Anwar Shaikh
- M. F. N. Taufique
- Ayoub Soulami
Applied Composite Materials (2024)
Design of refractory multi-principal-element alloys for high-temperature applications
- Gaoyuan Ouyang
- Prashant Singh
- Jun Cui
npj Computational Materials (2023)
First principles-based design of lightweight high entropy alloys
- Viacheslav Sorkin
- Zhi Gen Yu
- Yong-Wei Zhang
Scientific Reports (2023)
Probabilistic analysis of a cantilever beam subjected to random loads via probability density functions
- Juan-Carlos Cortés
- Elena López-Navarro
- María-Dolores Roselló
Computational and Applied Mathematics (2023)
Machine-learning-guided descriptor selection for predicting corrosion resistance in multi-principal element alloys
- Ankit Roy
- M. F. N. Taufique
- Ganesh Balasubramanian
npj Materials Degradation (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Methodology

Training data collection and feature selection

Validation data preparation and Young’s modulus measurement

Machine learning models construction

Results and discussion

Model optimization

Feature importance

Experimental validation

Conclusion

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links