A predictive machine learning approach for microstructure optimization and materials design

Liu, Ruoqian; Kumar, Abhishek; Chen, Zhengzhang; Agrawal, Ankit; Sundararaghavan, Veera; Choudhary, Alok

doi:10.1038/srep11551

Download PDF

Article
Open access
Published: 23 June 2015

A predictive machine learning approach for microstructure optimization and materials design

Ruoqian Liu¹,
Abhishek Kumar²,
Zhengzhang Chen¹^nAff4,
Ankit Agrawal¹,
Veera Sundararaghavan³ &
…
Alok Choudhary¹

Scientific Reports volume 5, Article number: 11551 (2015) Cite this article

18k Accesses
162 Citations
5 Altmetric
Metrics details

Subjects

Abstract

This paper addresses an important materials engineering question: How can one identify the complete space (or as much of it as possible) of microstructures that are theoretically predicted to yield the desired combination of properties demanded by a selected application? We present a problem involving design of magnetoelastic Fe-Ga alloy microstructure for enhanced elastic, plastic and magnetostrictive properties. While theoretical models for computing properties given the microstructure are known for this alloy, inversion of these relationships to obtain microstructures that lead to desired properties is challenging, primarily due to the high dimensionality of microstructure space, multi-objective design requirement and non-uniqueness of solutions. These challenges render traditional search-based optimization methods incompetent in terms of both searching efficiency and result optimality. In this paper, a route to address these challenges using a machine learning methodology is proposed. A systematic framework consisting of random data generation, feature selection and classification algorithms is developed. Experiments with five design problems that involve identification of microstructures that satisfy both linear and nonlinear property constraints show that our framework outperforms traditional optimization methods with the average running time reduced by as much as 80% and with optimality that would not be achieved otherwise.

Self-supervised optimization of random material microstructures in the small-data regime

Article Open access 21 March 2022

A machine-learning-based alloy design platform that enables both forward and inverse predictions for thermo-mechanically controlled processed (TMCP) steel alloys

Article Open access 26 May 2021

Coupling physics in machine learning to predict properties of high-temperatures alloys

Article Open access 18 September 2020

Introduction

Material selection has traditionally been carried out with property cross-plots, a graphical representation of material–property–performance relationships popularized by Ashby¹. For example, density versus strength plot of different materials can be used by an aircraft engineer to find that for the same strength requirement, a titanium alloy weighs much less than a steel alloy. The disadvantage of chart- or plot-based selection is that only materials with known property are included. It is common knowledge that even within the selected alloy system, microstructural variability leads to a large range of material properties. We are currently moving towards a new paradigm of microstructure–sensitive design² where it is crucial to identify the very microstructure within an alloy system, out of innumerable candidates, that leads to a desired property or a combination of desired properties. Mathematical search-based designs explore the hypothetically infinite space and offers the freedom of engineering unknown microstructures. But the efficiency of mathematical search deteriorates quickly as the candidate space grows, with a microstructure represented by hundreds even thousands of dimensions traditional searches for microstructure design can be slow. In this paper, we explore advanced data oriented techniques to enhance the mathematical search by statistical heuristics, for the purpose of fast and accurate microstructure optimization for a recently discovered alloy, Galfenol.

Galfenol has been shown to exhibit magnetostrictive strains up to 400 ppm in single crystal form (more than 10 times that of α-Fe). When a magnetic field is applied to Galfenol single crystal, the boundaries between the magnetic domains shift and rotate, both of which cause a change in the material’s dimensions. This behavior, termed magnetostriction, has been used to transduce magnetic field to mechanical force in micro–scale sensors and actuators^3,4,5,6. While single crystals of Galfenol provide large magnetostriction, their preparation is expensive. Thus, development of polycrystalline Galfenol with favorable properties for various applications^7,8,9 is desirable. A combination of high stiffness, magnetostrictive strains and yield strength is optimal for use in a cantilever beam device that can be used to generate sonar waves (as actuators), measure vibrations (as sensors) or generate electricity (as energy harvesting devices).

While theoretical models for computing properties given the microstructure are known for this alloy, inversion of these relationships to obtain microstructures that lead to desired properties is challenging. The microstructure design of polycrystalline Galfenol can be performed by tailoring the distribution of various crystal orientations (‘the orientation distribution function (ODF)’) in the microstructure (Fig. 1a ¹⁰). The structural optimization is carried out along different crystallographic directions to attain favorable properties. The multiple crystallographic directions embedded in the multi-dimensional ODF are used as control variables and the theoretical functions for properties are the objective. The main challenge is to address the following three issues,

(a) High dimensionality: to search in a hypothetically unlimited space of all possible crystal orientation distributions and converge within a reasonable time.

(b) Multi-objective: to optimize under a requirement of multiple extremal properties that are often in conflict.

(c) Solution completeness: to identify the complete space (or as much of it as possible) of microstructures when more than one solution exists that produce the same optimal property.

Very few published works in literature discuss such design problems. Significant contributions in this area include¹¹ where the authors design an ODF that maximizes the deflection of a beam without plastically deforming it. In¹², the authors design a plate with a circular hole subjected to an in-plane tensile load so as to maximize the load carrying capacity while avoiding plastic deformation. These analyses employ a reduced spectral series representation of the texture that lies in a significantly smaller search space. However, much information in the texture is lost in the reduced representation and the optimization search is done rather manually, by numerically interpolating the microstructure space to find the location of best performance. Traditional optimization techniques used to search of the answer lead to an unique microstructural solution, rather than the complete space of optimal microstructures. Multiple solutions are favored in the sense that traditional low–cost manufacturing processes such as forming and heat treatment can only generate a limited set of microstructures^7,8,9 and a single design solution may not be economically feasible to manufacture¹⁰. Other optimization techniques that may lead to multiple solutions (such as combinatorial search methods and evolutionary methods) have been explored in materials selection and structural optimization design^13,14,15,16. However, these methods are often prohibited by the high dimensionality of search space (curse of dimensionality¹⁷).

Herein, we propose the employment of modern machine learning (ML) techniques as a tool to explore multiple design solutions and diminished searching time in high dimensional microstructure design problems, where the number of distinct design candidates is indeed infinite. Two crucial ML steps, namely, search path refinement and search space reduction, are designed to develop heuristics that tour the search force to a much smaller preferable space. As the diagram in Fig. 2 suggests, the ML method (bottom route) has these two steps (marked as 2 and 3) executed laterally, after a data preparation step (marked as 1) that precedes. The three steps supplement a traditional direct-search method (top route) by performing a search space preprocessing, before the actual search goes into action. Such a ML-based preprocessing is designed to locate critical regions of a search space with a small overhead, so that the search force can be consciously concentrated.

Results

Design Problems

We start with a spectral representation of the microstructure and its relevant statistics and present five polycrystalline alloy design problems, each with a different property objective to optimize. The properties are either a singular or a composite of the following: Young modulus (E), yield strength (Y) and magnetostrictive strain (m_s). For sensor applications where Galfenol is used in the form of compliant beams, E is inversely proportional to deflection, so a lower modulus results in higher deflection for the same applied stress. Higher yield strength Y will increase the load bearing capacity of the structure and it is our objective to maximize Y. A higher value of magnetostrictive strain (m_s) will enhance the capability of material to act as a sensor material.

Two additional composite functions are designed to express the need of a set of properties balanced for peak performance, fulfilling the requirement of multi-objective design. Composite function F₁ = Y ⋅ m_s/E is maximized to search for a microstructure with a low modulus and high value for strength and magnetostrictive strain. Another function F₂ is given by a combination of two desired properties, stiffness component = 274.94 GPa and magnetostrictive strain m_s = 99.53 ppm. A weighted Gaussian function of the form F₂ = 10exp(−3( − 274.94)² − 2(m_s − 99.53)²) is optimized such that the maximum value of F₂ = 10 is attained at the desired property values.

Property Calculation

The orientation distribution function (ODF), the probability density function for orientations, is applied for the quantification of crystallographic texture of Galfenol, as seen in Fig. 1. An axis-angle parametrization of the orientation space proposed by Rodrigues has been used. This is based on the unique association of an orientation with a rotation axis n and an angle of rotation θ about the axis such that r = . Relationship of r with the standard rotation matrix that maps the sample axes to the crystal axes is given in Section 2 of¹⁰. The ODF (in Fig. 1b) denoted by represents the volume density of crystals with orientation r. If the orientation-dependent property for a single crystal χ(r) is known, any polycrystal property can be expressed as an expectation value or average given by:

Polycrystal moduli calculation

Values of elastic parameters for BCC Galfenol crystal are C₁₁ = 213.0 GPa, C₁₂ = 174 GPa, C₄₄ = 120 GPa. The polycrystal stiffness, , is computed through a weighted average (over ) of the stiffness of individual crystals expressed in the sample reference frame. The elastic modulus (along x-axis) is computed through this polycrystal stiffness as¹⁸:

Magnetostrictive strain calculation

Magnetostrictive strain in Galfenol single crystals is specified using two independent parameters, λ₁₀₀ and λ₁₁₁, which characterize the changes in normal strain along the and direction resulting from the rotation of a magnetization state into these directions. The magnetostrictive strain tensor for a crystal with magnetization direction given by the unit vector m = (m_x, m_y, m_z) (in the crystal coordinate system) is then stated by the following expression:

We use the model from Armstrong¹⁹ which represents the free energy as a sum of internal and external energy terms. The following form of internal energy is taken:

The simple form for E_I used here ensures that a domain in the crystal has minimal and maximal energies when oriented, respectively, along the directions (easy direction) and the family (hard directions). Application of an external magnetic field leads to an energy change in energy proportional to the intensity of the magnetic field, H, the magnetization of the domain, M and the direction between them. The direction of the applied magnetic field is represented as n = (n_x, n_y, n_z) in the crystal coordinate system.

The energy contribution (per unit volume) associated with the interaction of externally applied stress tensor (σ in crystal coordinate frame) with magnetostrictive strains is given as:

In an ideal crystal without defects (at T = 0K), the domain would align in the direction of minimal energy. However, domain magnetization is expected to follow a Boltzmann–like distribution at higher temperatures due to an increase in entropy. The probability, P, that the magnetization direction is equal to m is given as:

The parameter Ω represents the spread of the magnetization direction from the ideal direction (of minimal energy). The magnetostriction strain tensor ε is obtained by averaging the strains over the probability density of magnetization in the crystal.

The above integral is calculated by sampling over all possible magnetization directions (ie. all points on a unit sphere). The strain tensor ε in the crystal coordinate system is rotated back to the sample coordinate frame ε_s(r) = R^TεR. The overall strain tensor is obtained by averaging ε_s(r) over the ODF using equation (1). The magnetostrictive strain that is optimized in our machine learning approach is the polycrystal averaged strain along the z–direction as measured with respect to an initial unstressed crystal.

For single crystal Galfenol, the various parameters used are as follows: K₁ = 3.6e4Jm⁻³, λ₁₀₀ = 170 ppm, λ₁₁₁ = −4.67 ppm, M = 1.83/μ₀Am⁻¹ (μ₀ = 4π × 10⁻⁷) and Ω = 625Jm⁻³ calibrated in⁶. In the examples, we apply the magnetic field of −500 Oe and a compressive pre-stress of 80 MPa, both along the [001] sample direction. At these values, the single crystals achieve a saturation magnetostrictive strain that compares well with the model in⁶. In the examples, the magnetostrictive strain (m_s) along the [001] sample direction is optimized. Note that magnetostrictive strain has significant anisotropy, with the saturation strains along [012] direction being 25% lower than that of the [001].

Yield strength calculation

The crystal plasticity model described in Method section and also in¹⁰ is used to calculate the yield strength at all nodal points in the fundamental region. The model adequately captures the macroscopic tensile mode stress-strain response at room temperature reported in²⁰ as shown in Fig. 3(b). To further validate the microstructural model, we compared the crystallographic textures seen in BCC iron rolling processes and textures predicted by our model. Fig. 3(a) shows that the model captures both α and γ texture that arise from rolling of BCC metals (experimental result from²¹). The strength (Y(r)) at orientation r is found as the offset z–stress resulting from an applied z–strain of 0.2% under the following velocity gradient¹⁸:

The overall yield strength is obtained by averaging Y(r) over the ODF using equation (1).

The single crystal properties for elastic modulus, magnetostrictive strains and the yield strength obtained from the above analyses can be visualized on the ODF mesh in Rodrigues space. The plots shown in Fig. 4(left) depict the surface contours with internal slices of the ODF shown alongside in Fig. 4(right). The single crystals with maximum and minimum properties and their locations can be seen directly from these plots. For example, the single crystals with maximum magnetostrictive strains are all located along the z–axis of the Rodrigues space as seen in Fig. 4(a) (right). This corresponds to the z–axis fiber in which the crystal direction of easy magnetization ([001]) is aligned along the measurement axis (sample z–axis).

Optimization Result Analysis

With the five design problems, the ML-based optimization approach, essential procedures shown in Fig. 2 (bottom route) and each step further explained in the Method section, develops heuristics that guide the search into most promising areas. To demonstrate its superiority at optimality, time efficiency and solution completeness, we compare with three baseline methods, all of them evaluated using three criteria: 1) the optimality, or goodness or property, determined by the sheer value of the property obtained; 2) the efficiency, judged by the time taken for obtaining the result under the same computational environment, the lower the better; and 3) the completeness of solutions, in cases where there exist multiple solutions of structures that produce the same best property, is reflected by the number of distinct answers generated, normalized by the most answers found thus far.

The three baseline methods to compare with are: 1) an exhaustive search (eSearch) containing 1 million random searches, 2) a generalized pattern search²², which can be considered as a smarter-than-random guided search (gSearch), 3) a traditional optimization algorithm, specifically the linear programming (LP) is used for linear problems and genetic algorithms (GA) for nonlinear problems. In the exhaustive search, to ensure the constraint satisfaction the Random Interval (details in Method section under random data generation) method is used to generate 1 million realizations of i.i.d. ODF values. The guided search, however, should require far less iterations and we randomize 100 initial starting positions of ODF and for each iteration, the search goes until a local optima is found.

Table 1 shows optimal values of each design function obtained by the aforementioned methods. As we can deduce from this comparison, for linear properties as Y and m_s, LP always provides a valid solution. It is the nonlinear problems as E, F₁ and F₂ that pose a challenge for traditional optimization methods. Exhaustive search gives largely unstable results; one almost has to rely on pure luck to bet on a fair result. Guided search has a tendency to get stuck at local optima, which can sometimes be even worse than the naive exhaustive search (although the running time is much less). Surprisingly, GA has worked poorly on nonlinear problems (we also tried GA on linear problems and the answer is never as good as LP). In fact, for nonlinear problems the guided search is often a better choice than GA. ML methods have been proven successful achieving the best answers throughout all problems, linear and nonlinear.

Table 1 Summary of optimal values for all design problems and methods.

Full size table

We use visualization bars, shown in Fig. 7, to illustrate the performance of different methods in a standardized manner in terms of all three criteria. Figure 7(left) describes the normalized level of optimality achieved by each method (shown by vertical color sticks) for each problem (each grey box). The highest level (rightmost) is defined by the best solution found among five methods and the lowest level (leftmost) represents the worst answer among the five. The position in between is at linear scale. Therefore how rightward the positions of color sticks represent how good the method is regarding finding the optimum answer.

The minimum of Young’s modulus (E^opt) obtained is 85.9878 GPa. Although the property is nonlinear with respect to the ODF values, multiple ODF solutions were not found. The optimal ODF was found to be a single-crystal with (011)[100] orientation, shown in Fig. 5(a). As is evident from the plots, the extremal ODF corresponds to a unique orientation (and symmetric equivalents, if any). Although the ODF ideally is a Dirac delta function at this orientation, a finite ODF value is plotted due to the use of a finite element discretization. In addition, the use of finite elements implies that this is not the exact minimum unless there is a node exactly at the location of the optimal orientation. However, as¹⁰ pointed out, one can achieve the property bounds accurately with finer discretizations of the Rodrigues space.

Since yield stress (Y^opt) is a linear property, the extremal values correspond to single crystals. In our analysis, the maximum value was found to be 353.11 MPa which is equal for two single crystals, close to the orientation as shown in Fig. 5(b). Similar analysis to find the ODF with maximal magnetostrictive strain () revealed a single-crystal at the orientation with field induced strain of 1.5498e−04. Note that crystal z–axis is the direction of easy magnetization and of the largest saturation magnetostriction in single crystal galfenol and the optimal crystal, as expected, has this direction aligned with the direction of strain measurement. Figure 5(c) shows the corresponding ODF.

The optimum microstructure for objective function F₁ was also a single-crystal. For problem F₂, however, we get 26 different ODFs which give the same maximum value of . ODF plots for all these solutions are shown in Fig. 6. Most distinctively, these 26 solutions are all polycrystal, taking values on every one of the crystalline orientations. In this sense, the ML method manages to discover complex and irregular compositions of microstructure in the search for extreme values.

A running time comparison is made in Table 2. For these problems LP can finish within 1 second and each round of GA normally takes 2 to 3 seconds. LP and GA (with 100 restarts and takes about 200 seconds) are shown in Fig. 7(middle) although not included in Table 2. eSearch conducts 1 million random searches along with 1 million times of function evaluation and gSearch takes 100 iterations each taking a different random initial point. Fig. 7(middle) illustrates the normalized level of time efficiency produced by each method for each problem. The highest level (rightmost) is defined by the one using the least time in producing its final answer and the lowest level (leftmost) represents the longest time taken among the five. Since the time varies among methods from a couple of seconds tens of thousands of seconds, the position in between is in a logarithmic scale. eSearch, as imagined, takes the longest time to finish since it explores randomly the space without a proper guidance. ML takes less than 5% of time consumed by eSearch. Compared with gSearch, ML reduced respectively 84.94%, 88.05%, 83.67%, 72.62% and 78.82% of the running time. The average reduction is 81.62%. The 4-th step of ML framework is the exact same gSearch with reduced variables; this implies that the reduced time comes from the reduced searching space.

Table 2 Summary of running times (s) for all design problems and methods.

Full size table

Last but not least, the completeness of solution is an important gauge of optimization design. Two of our design problems, the optimization of Y and F₂, appear to have multiple solutions that lead to identical optimum answers. The completeness of solutions for each method regarding these two problems is shown in Fig. 7(right). The linear problem has three solutions found by ML that lead to the same Y^opt, but only one is found by LP. eSearch and gSearch each also found one solution but according to Table 1 neither of them generates property as good as Y^opt. For the nonlinear problem F₂, eSearch and GA each obtained one solution and neither is as good as . gSearch was able to produce , but it only found 9 solutions. Only ML is able to find 26 solutions, the most complete as far as we know.

Discussion

The selection of materials and geometry to minimize (or maximize) some given property has been a common problem in material science. The optimum answer can be obtained by an exhaustive search among numerous choices in the searching domain bounded by some boundary conditions and constraints. More intelligently and less laboriously, the search can be guided by heuristics and how the heuristic is designed is discussed by various researchers in the area of searching algorithms and artificial intelligence^23,24. However, searches that are exhaustive or follow simple heuristics suffer from the high dimensionality used in the structure representation and they often return incomplete solution sets.

Machine learning based dimension reduction techniques are known to be powerful in analyzing variable relationships in large datasets containing high variable dimensions. It is often used for data representation for the sake of reducing storage space as well as computational efforts. Regarding optimization problems, ML methods can be considered to reduce the search space by limiting the number of design variables in search and finding patterns to form a superior searching strategy. The analysis that ML is able to make to a reasonable collection of variable-objective data instances could provide valuable insights towards the variable relations and answer questions as what variables to search first, what areas are the most promising, what direction would possibly lead to quick convergence, etc.

In this work, we applied the technique to optimize the magnetoelastic properties of Iron–Gallium alloy (Galfenol). The orientation space of BCC Galfenol was discretized using finite elements to obtain a compact representation of design variables. We searched for microstructures with optimal properties, such as low elastic modulus and high magnetostrictive strains, while also exploring possible non–unique solutions for these design cases. We can deduce from the experimental results that linear problems with unique solutions are better solved with LP, which is both fast and accurate. However, multiple solutions are not explored when using LP techniques. GA performed incompetently across all problems despite its advantage in running time efficiency, in that it was never able to produce the best known answers in our problems. ML method was able to achieve a fine balance between accuracy and efficiency and more importantly, it is able to find the complete set of solutions that none of the other methods can.

Methods

BCC crystal plasticity model

A rate-independent single-crystal plasticity model developed in Kothari and Anand²⁵ is used to compute the effect of macroscopic strain on the polycrystal. For a material with α = 1,…,N slip systems defined by ortho-normal vector pairs (m^α, n^α) denoting the slip direction and slip plane normal respectively, the constitutive equations relate the following basic fields: the deformation gradient F which can be decomposed into elastic and plastic parts as F = F^eF^p, the Cauchy stress T and the slip resistances s^α > 0. In the constitutive equations, the Green elastic strain measure defined on the relaxed configuration (plastically deformed, unstressed configuration) is utilized. The conjugate stress measure is then defined as and the stress–strain relation is given by where is the fourth-order elasticity tensor. It is assumed that deformation takes place through dislocation glide and the evolution of the plastic flow is given by

where is the Schmid tensor and is the plastic shearing rate on the α^th slip system. The resolved stress on the α^th slip system is given by . The resolved shear stress attains a critical value s^α on the systems where slip occurs ( = s^α, if ). Further, the resolved shear stress does not exceed s^α on the inactive systems with . The hardening law for the slip resistance s^α is taken as,

The slip system hardening model is given as:

where h^β is a single slip hardening rate, q is the latent-hardening ratio and δ^αβ is the Kronecker delta function. The parameter q is taken to be 1.0 for coplanar slip systems and 1.4 for non-coplanar slip systems. For the single-slip hardening rate, the following specific form is adopted:

where h_o, a and s_s are slip hardening parameters taken to be identical for all slip systems, with values h_o = 500 MPa, s_s = 350 MPa and a = 2.25and for BCC Galfenol single crystals. The initial value of slip system resistance is calibrated as s_o = 180 MPa. Plastic deformation due to crystallographic slip is assumed to occur in the direction and the possible slip planes are of the {110}, {112} and {123} type.

Random data generation

Data that are expected to be seen during the search are simulated and collected before the actual search. For the purpose of supervised learning the training set is in a form of {X^(j),y^(j)}, where the X^(j) and y^(j) are the j-th training example of the feature vector and the corresponding class label. In our design problem, X = [x₁,x₂,…,x_D] ⊆ R^D is an instance of structure representation (e.g. the multidimensional ODF) and y^(j) ∈ {−1,1} is an indicator of how good a property generated by the structure is. Given a property function f(X), supposedly to be maximized, the class label y^(j) = 1 is assigned to input vector X^(j) if the function value f(X^(j)) is ‘sufficiently high’. For some algorithms to work well, data from the opposing class might be needed, which indicates sufficiently low property and are assigned a class label of −1. Data quality of the training set is ensured by enforcing randomness and polarization, with the four randomization methods, namely, Random Intervals, Random k Intervals, Random Every k and Best-First Assignment, developed to address randomness from different angles and generate samples under the constraints in our problems: and X ≥ 0. While one of them (RI) gives the chance of taking values to every variable equally, the others tend to think that a limited number (k) of variables are of greater importance than the others, only we don’t know which k. After the randomized data generation, a data polarization procedure takes portions of samples from two extreme ends, on top of which the feature learning is devised.

(a) Random Intervals (RI). We consider the unit length 1 is divided into D random intervals, or making D − 1 random cuts between the interval [0,1], where D is the dimension of X, or ODF. Then the length of intervals are randomly assigned to each feature x_i.

(b) Random k Intervals (RkI). This is similar to RI but each time, only k intervals are generated and assigned to k random dimensions. k is iterated from 1 to D − 1 with an increasing number of samples generated with regards to k (roughly a linear relationship) and then down sampled to 1000 for each iteration, except when k = 1, D samples exist and are all used.

(c) Random Every k (REk). Randomly generate k values at a time, continue only when the sum s of the current k values does not exceed the threshold 1. Update the threshold to the remainder 1 − s and repeat the process until the remainder is sufficiently small. Assign the generated values to a random set of features. k is fixed to be 5 in the experiment.

(d) Best-First Assignment (BFA). Randomly pick a feature and assign to it a random value u, 0 ≤ u ≤ 1. Distribute the remainder 1 − u evenly to all other variables so that the constraint is met. Compute the objective function and obtain the function value. Repeat n times and continue with whichever gives the best function value. Fix that selected variable and repeat the process to select another variable and go on until no variable is left.

In terms of operation time, REk and BFA are the slowest since they either operate under a probability of getting valid feature values or involve multiple evaluations of the objective function. In generating random samples for F₂ with a MATLAB implementation, RI takes an average of 0.99 ms per sample, RkI 0.42 ms, REk 12.5 ms and BFA 12.7 ms. We let RI generate 50,000 data instances and terminate REk and BFA when their generated samples reach 10,000. Thus in total we obtained around 145,000 samples. We then sort according to the objective value and keep the top and bottom 25% of data instances, with one of the extremes desired and the other undesired, each with 36,250 samples.

Search path refinement via feature selection

The motivation of introducing feature ranking into optimization is to obtain a specialized search path in the form of a sorted order of variables prior to the start of search, so as to improve the searching efficiency. Four supervised feature ranking methods, χ², Information Gain, F-score²⁶ and SVM-weight (SVM²⁷) are employed. They work either through calculating a coefficient (also called filter methods) to characterize the relevance of each feature with the class target, or through building classifiers (also called wrapper methods) with each variable and evaluating the performance. Filter methods are generally faster. The final feature ranking is decided by a voting (majority wins) of the four result sets and it determines the order of search, that is, the order of variables whose values get updated. In the search to follow, at each iteration only one variable gets updated (by, say, gradient descent) and its value fixated when the objective function stops improving. In this way, prior to the search a search path is determined, which yields greater efficiency. To retain a degree of openness, we conduct multiple searches by shuffling the top 10% of the ranked list, which proves to be critical in obtaining multiple answers for a problem in later experiments.

Search space reduction via classification schemes

For the purpose of reducing the search region of each variable, in this subroutine we build a rule-based classification tree to learn the most promising region of values for each variable. As in the path refinement activity, the data category with desired function values is represented by the class ‘1’ and the contradictory class is labeled as ‘−1’. This creates a two-class classification problem. We use rule-based classifiers, such as decision trees, because they are easily traversed and thresholds are clearly attained. After a tree is constructed, we look for the leaf nodes with “−1” since our purpose is to minimize E. We traverse from the root to each of the “−1” leaf nodes and write down the rules generated along the path. The number of samples covered by the rule and the accuracy of the rule should also be considered. For the E-minimize example the most supported rule is: IF x₁₆ ≤ 0.17565 AND x₂₆ ≤ 0.20504 AND x₁₄ ≤ 0.13302 AND x₅₃ ≥ 0.064713 AND x₃₇ ≤ 0.004566 THEN “y = −1”. Therefore, the searching regions for these variables are modified to x₁₆ ∈ [0,0.17565], x₂₆ ∈ [0,0.20504], x₁₄ ∈ [0,0.13302], x₅₃ ∈ [0.064713,1] and x₃₇ ∈ [0,0.004566]. The searching effort is thus reduced to a more concentrated area on these variables. Compared to the original region of [0,1], relatively 83%, 80%, 87%, 6% and 99% of the search region has been reduced.

Enhanced optimization

A gradient-based line search²⁸ is conducted on a ordered list of variables, finding, one variable at a time, the value of it that optimizes the function from a reduced value space. Top-rank shuffling and multi-starting strategies are incorporated so that on each run, the algorithm starts from a randomly generated initial solution in the search space, with a slightly shuffled pre-planned searching order of variables. Optimization becomes a much promising endeavor when the search space is reduced and a pre-planned searching path is deployed.

Additional Information

How to cite this article: Liu, R. et al. A predictive machine learning approach for microstructure optimization and materials design. Sci. Rep. 5, 11551; doi: 10.1038/srep11551 (2015).

References

Ashby, M. F. & Cebon, D. Materials selection in mechanical design. Le Journal de Physique IV 3, C7–1 (1993).
Google Scholar
McDowell, D. L. & Olson G. B. Concurrent design of hierarchical materials and structures. In: Scientific Modeling and Simulations, 207–240 (Springer, 2009).
Downey, P. R. & Flatau, A. B. Magnetoelastic bending of Galfenol for sensor applications. J. Appl. Phys. 97, 10R505 (2005).
Article Google Scholar
Datta, S. & Flatau, A. B. Magnetostrictive vibration sensor based on irongallium alloy. Proc. Mater. Res. Soc. 888, 0888–V04 (2005).
Article Google Scholar
Mudivarthi, C., Datta, S., Atulasimha J. & Flatau, A. B. A bidirectionally coupled magnetoelastic model and its validation using a Galfenol unimorph sensor. Smart Mater. Struct. 17, 035005 (2008).
Article ADS Google Scholar
Atulasimha, J., Flatau, A. B. & Summers E. Characterization and energy-based model of the magnetomechanical behavior of polycrystalline irongallium alloys. Smart Mater. Struct. 16, 126576 (2007).
Article Google Scholar
Na, S. M. & Flatau, A. B. Secondary recrystallization, crystallographic texture and magnetostriction in rolled Fe-Ga based alloys. J. Appl. Phys. 101, 09N518 (2007).
Article Google Scholar
Cheng, L. M., Nolting, A. E., Voyzelle, B. & Galvani, C. Deformation behavior of polycrystalline Galfenol at elevated temperatures. In: The 14th International Symposium on: Smart Structures and Materials & Nondestructive Evaluation and Health Monitoring. San Diego, CA, March 18-22, 65262N (2007).
Srisukhumbowornchai, N. & Guruswamy, S. Crystallographic Textures in Cold-Rolled and Annealed Fe-Ga And Fe-Al Alloys. Metall. Mater. Trans. A 35, 2963–2970 (2004).
Article Google Scholar
Sundararaghavan, V. & Zabaras, N. Linear analysis of texture-property relationships using process-based representations of Rodrigues space. Acta Mater. 55, 1573–1587 (2007).
Article CAS Google Scholar
Adams, B. L. et al. Microstructure sensitive design of a compliant beam. J. Mech. Phys. Solids 49, 1639–1663 (2001).
Article ADS Google Scholar
Kalidindi, S. R., Houskamp, J. R., Lyons, M. & Adams, B. L. Microstructure sensitive design of an orthotropic plate subjected to tensile load. Int. J. Plasticity 20, 1561–1575 (2004).
Article Google Scholar
Hoonkyung, L., Woon Ih, C. & Jisoon I. Combinatorial search for optimal hydrogen-storage nanomaterials based on polymers. Phys. Rev. Lett. 97, 056104 (2006).
Article Google Scholar
Sun, T. X. Combinatorial search for advanced luminescence materials. Biotechnol. Bioeng. 61, 193–201 (1999).
Article CAS Google Scholar
Lee, K. S. & Geem, Z. W. A new structural optimization method based on the harmony search algorithm. Comput. Struct. 82, 781–798 (2004).
Article Google Scholar
Adeli, H. & Cheng, N. T. Integrated genetic algorithm for optimization of space structures. J. Aerospace Eng. 6, 315–328 (1993).
Article ADS Google Scholar
Bellman R. Dynamic Programming. Princeton, NJ: (Princeton University Press, 1957).
Sundararaghavan, V. & Zabaras, N. On the synergy between texture classification and deformation process sequence selection for the control of texture-dependent properties. Acta Mater. 53, 1015–1027 (2005).
Article CAS Google Scholar
Armstrong, W. D. Nonlinear behavior of magnetostrictive particle actuated composite materials. J. Appl. Phys. 87, 3027–3031 (2000).
Article CAS ADS Google Scholar
Li, J. H. et al. Ductility, texture and large magnetostriction of Fe–Ga-based sheets. Scripta Mater. 63, 246–249 (2010).
Article CAS Google Scholar
Bate, P. S. & Quinta da Fonseca, J. Texture development in the cold rolling of IF steel. Mat. Sci. Eng. A-Struct. 380, 365–377 (2004).
Article Google Scholar
Audet, C. & Dennis Jr. J. E. Analysis of generalized pattern searches. SIAM J. Optimiz. 13, 889–903 (2002).
Article MathSciNet Google Scholar
Kirkpatrick, S., Gelatt, C. & Vecchi, M. Optimization by simulated annealing. Science 220, 671–680 (13 May 1983).
Article CAS ADS MathSciNet Google Scholar
Glover, F. Heuristic for integer programming using surrogate constraints. Decision Sci. 8, 156–166 (1977).
Article Google Scholar
Anand, L. & Kothari, M. A computational procedure for rate-independent crystal plasticity. J. Mech. Phys. Solids 44, 525–558 (1996).
Article CAS ADS MathSciNet Google Scholar
Chen, Y. W. & Lin, C. J. Combining SVMs with various feature selection strategies In: Feature Extraction, Foundations and Applications. (Springer, 2006).
Vapnik, V., Golowich, S. E. & Smola, A. Support vector method for function approximation, regression estimation and signal processing. Adv. Neur. In 9, 281–287 (1997).
Google Scholar
Moré, J. J & Thuente, D. J. Line search algorithms with guaranteed sufficient decrease. ACM T. Math. Software 20, 286–307 (1994).
Article MathSciNet Google Scholar

Download references

Acknowledgements

This work is supported by the NSF Grant IIS-1343639, CCF-1409601, the U.S. Department of Energy (DOE) DE-SC0007456, Department of Commerce (DOC), National Institute of Standards and Technology (NIST) Award No. 70NANB14H012 and the Air Force Office of Scientific Research (AFOSR), Department of Defense (DOD) Award No. FA9550-12-1-0458.

Author information

Zhengzhang Chen
Present address: NEC Laboratories America, Inc., 4 Independence Way, Suite 200, Princeton, NJ, USA

Authors and Affiliations

EECS Department, Northwestern University, Evanston, IL, USA
Ruoqian Liu, Zhengzhang Chen, Ankit Agrawal & Alok Choudhary
Oak Ridge National Lab, Oak Ridge, TN, USA
Abhishek Kumar
Aerospace Engineering, University of Michigan, Ann Arbor, MI, USA
Veera Sundararaghavan

Authors

Ruoqian Liu
View author publications
You can also search for this author in PubMed Google Scholar
Abhishek Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Zhengzhang Chen
View author publications
You can also search for this author in PubMed Google Scholar
Ankit Agrawal
View author publications
You can also search for this author in PubMed Google Scholar
Veera Sundararaghavan
View author publications
You can also search for this author in PubMed Google Scholar
Alok Choudhary
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.L. wrote the manuscript text of Introduction, the first and third subsection of Results (“Design Problems” and “Result Analysis”), Discussion and the last four subsections of Methods. A.K. and V.S. wrote the second subsection in Results (“Property Calculation”) and the first subsection of Methods (“BCC crystal plasticity model”). A.K. and V.S. prepared Figs. 1, 3, 4, 5 and 6. R.L. made Figures 2, 7, with great help from Z.C., A.A. and A.C. All authors reviewed the manuscript.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Liu, R., Kumar, A., Chen, Z. et al. A predictive machine learning approach for microstructure optimization and materials design. Sci Rep 5, 11551 (2015). https://doi.org/10.1038/srep11551

Download citation

Received: 07 November 2014
Accepted: 27 April 2015
Published: 23 June 2015
DOI: https://doi.org/10.1038/srep11551

This article is cited by

Prediction of contact resistance of electrical contact wear using different machine learning algorithms
- Zhen-bing Cai
- Chun-lin Li
- Zhi-nan Zhang
Friction (2024)
Improving deep learning model performance under parametric constraints for materials informatics applications
- Vishu Gupta
- Alec Peltekian
- Ankit Agrawal
Scientific Reports (2023)
Deep learning approach for chemistry and processing history prediction from materials microstructure
- Amir Abbas Kazemzadeh Farizhandi
- Omar Betancourt
- Mahmood Mamivand
Scientific Reports (2022)
Moving closer to experimental level materials property prediction using AI
- Dipendra Jha
- Vishu Gupta
- Ankit Agrawal
Scientific Reports (2022)
Reaching the Full Potential of Machine Learning in Mitigating Environmental Impacts of Functional Materials
- Ying He
- Guohong Liu
- Xiliang Yan
Reviews of Environmental Contamination and Toxicology (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.