Abstract
This paper addresses an important materials engineering question: How can one identify the complete space (or as much of it as possible) of microstructures that are theoretically predicted to yield the desired combination of properties demanded by a selected application? We present a problem involving design of magnetoelastic FeGa alloy microstructure for enhanced elastic, plastic and magnetostrictive properties. While theoretical models for computing properties given the microstructure are known for this alloy, inversion of these relationships to obtain microstructures that lead to desired properties is challenging, primarily due to the high dimensionality of microstructure space, multiobjective design requirement and nonuniqueness of solutions. These challenges render traditional searchbased optimization methods incompetent in terms of both searching efficiency and result optimality. In this paper, a route to address these challenges using a machine learning methodology is proposed. A systematic framework consisting of random data generation, feature selection and classification algorithms is developed. Experiments with five design problems that involve identification of microstructures that satisfy both linear and nonlinear property constraints show that our framework outperforms traditional optimization methods with the average running time reduced by as much as 80% and with optimality that would not be achieved otherwise.
Introduction
Material selection has traditionally been carried out with property crossplots, a graphical representation of material–property–performance relationships popularized by Ashby^{1}. For example, density versus strength plot of different materials can be used by an aircraft engineer to find that for the same strength requirement, a titanium alloy weighs much less than a steel alloy. The disadvantage of chart or plotbased selection is that only materials with known property are included. It is common knowledge that even within the selected alloy system, microstructural variability leads to a large range of material properties. We are currently moving towards a new paradigm of microstructure–sensitive design^{2} where it is crucial to identify the very microstructure within an alloy system, out of innumerable candidates, that leads to a desired property or a combination of desired properties. Mathematical searchbased designs explore the hypothetically infinite space and offers the freedom of engineering unknown microstructures. But the efficiency of mathematical search deteriorates quickly as the candidate space grows, with a microstructure represented by hundreds even thousands of dimensions traditional searches for microstructure design can be slow. In this paper, we explore advanced data oriented techniques to enhance the mathematical search by statistical heuristics, for the purpose of fast and accurate microstructure optimization for a recently discovered alloy, Galfenol.
Galfenol has been shown to exhibit magnetostrictive strains up to 400 ppm in single crystal form (more than 10 times that of αFe). When a magnetic field is applied to Galfenol single crystal, the boundaries between the magnetic domains shift and rotate, both of which cause a change in the material’s dimensions. This behavior, termed magnetostriction, has been used to transduce magnetic field to mechanical force in micro–scale sensors and actuators^{3,4,5,6}. While single crystals of Galfenol provide large magnetostriction, their preparation is expensive. Thus, development of polycrystalline Galfenol with favorable properties for various applications^{7,8,9} is desirable. A combination of high stiffness, magnetostrictive strains and yield strength is optimal for use in a cantilever beam device that can be used to generate sonar waves (as actuators), measure vibrations (as sensors) or generate electricity (as energy harvesting devices).
While theoretical models for computing properties given the microstructure are known for this alloy, inversion of these relationships to obtain microstructures that lead to desired properties is challenging. The microstructure design of polycrystalline Galfenol can be performed by tailoring the distribution of various crystal orientations (‘the orientation distribution function (ODF)’) in the microstructure (Fig. 1a ^{10}). The structural optimization is carried out along different crystallographic directions to attain favorable properties. The multiple crystallographic directions embedded in the multidimensional ODF are used as control variables and the theoretical functions for properties are the objective. The main challenge is to address the following three issues,
(a) High dimensionality: to search in a hypothetically unlimited space of all possible crystal orientation distributions and converge within a reasonable time.
(b) Multiobjective: to optimize under a requirement of multiple extremal properties that are often in conflict.
(c) Solution completeness: to identify the complete space (or as much of it as possible) of microstructures when more than one solution exists that produce the same optimal property.
Very few published works in literature discuss such design problems. Significant contributions in this area include^{11} where the authors design an ODF that maximizes the deflection of a beam without plastically deforming it. In^{12}, the authors design a plate with a circular hole subjected to an inplane tensile load so as to maximize the load carrying capacity while avoiding plastic deformation. These analyses employ a reduced spectral series representation of the texture that lies in a significantly smaller search space. However, much information in the texture is lost in the reduced representation and the optimization search is done rather manually, by numerically interpolating the microstructure space to find the location of best performance. Traditional optimization techniques used to search of the answer lead to an unique microstructural solution, rather than the complete space of optimal microstructures. Multiple solutions are favored in the sense that traditional low–cost manufacturing processes such as forming and heat treatment can only generate a limited set of microstructures^{7,8,9} and a single design solution may not be economically feasible to manufacture^{10}. Other optimization techniques that may lead to multiple solutions (such as combinatorial search methods and evolutionary methods) have been explored in materials selection and structural optimization design^{13,14,15,16}. However, these methods are often prohibited by the high dimensionality of search space (curse of dimensionality^{17}).
Herein, we propose the employment of modern machine learning (ML) techniques as a tool to explore multiple design solutions and diminished searching time in high dimensional microstructure design problems, where the number of distinct design candidates is indeed infinite. Two crucial ML steps, namely, search path refinement and search space reduction, are designed to develop heuristics that tour the search force to a much smaller preferable space. As the diagram in Fig. 2 suggests, the ML method (bottom route) has these two steps (marked as 2 and 3) executed laterally, after a data preparation step (marked as 1) that precedes. The three steps supplement a traditional directsearch method (top route) by performing a search space preprocessing, before the actual search goes into action. Such a MLbased preprocessing is designed to locate critical regions of a search space with a small overhead, so that the search force can be consciously concentrated.
Results
Design Problems
We start with a spectral representation of the microstructure and its relevant statistics and present five polycrystalline alloy design problems, each with a different property objective to optimize. The properties are either a singular or a composite of the following: Young modulus (E), yield strength (Y) and magnetostrictive strain (m_{s}). For sensor applications where Galfenol is used in the form of compliant beams, E is inversely proportional to deflection, so a lower modulus results in higher deflection for the same applied stress. Higher yield strength Y will increase the load bearing capacity of the structure and it is our objective to maximize Y. A higher value of magnetostrictive strain (m_{s}) will enhance the capability of material to act as a sensor material.
Two additional composite functions are designed to express the need of a set of properties balanced for peak performance, fulfilling the requirement of multiobjective design. Composite function F_{1} = Y ⋅ m_{s}/E is maximized to search for a microstructure with a low modulus and high value for strength and magnetostrictive strain. Another function F_{2} is given by a combination of two desired properties, stiffness component = 274.94 GPa and magnetostrictive strain m_{s} = 99.53 ppm. A weighted Gaussian function of the form F_{2} = 10exp(−3( − 274.94)^{2} − 2(m_{s} − 99.53)^{2}) is optimized such that the maximum value of F_{2} = 10 is attained at the desired property values.
Property Calculation
The orientation distribution function (ODF), the probability density function for orientations, is applied for the quantification of crystallographic texture of Galfenol, as seen in Fig. 1. An axisangle parametrization of the orientation space proposed by Rodrigues has been used. This is based on the unique association of an orientation with a rotation axis n and an angle of rotation θ about the axis such that r = . Relationship of r with the standard rotation matrix that maps the sample axes to the crystal axes is given in Section 2 of^{10}. The ODF (in Fig. 1b) denoted by represents the volume density of crystals with orientation r. If the orientationdependent property for a single crystal χ(r) is known, any polycrystal property can be expressed as an expectation value or average given by:
Polycrystal moduli calculation
Values of elastic parameters for BCC Galfenol crystal are C_{11} = 213.0 GPa, C_{12} = 174 GPa, C_{44} = 120 GPa. The polycrystal stiffness, , is computed through a weighted average (over ) of the stiffness of individual crystals expressed in the sample reference frame. The elastic modulus (along xaxis) is computed through this polycrystal stiffness as^{18}:
Magnetostrictive strain calculation
Magnetostrictive strain in Galfenol single crystals is specified using two independent parameters, λ_{100} and λ_{111}, which characterize the changes in normal strain along the and direction resulting from the rotation of a magnetization state into these directions. The magnetostrictive strain tensor for a crystal with magnetization direction given by the unit vector m = (m_{x}, m_{y}, m_{z}) (in the crystal coordinate system) is then stated by the following expression:
We use the model from Armstrong^{19} which represents the free energy as a sum of internal and external energy terms. The following form of internal energy is taken:
The simple form for E_{I} used here ensures that a domain in the crystal has minimal and maximal energies when oriented, respectively, along the directions (easy direction) and the family (hard directions). Application of an external magnetic field leads to an energy change in energy proportional to the intensity of the magnetic field, H, the magnetization of the domain, M and the direction between them. The direction of the applied magnetic field is represented as n = (n_{x}, n_{y}, n_{z}) in the crystal coordinate system.
The energy contribution (per unit volume) associated with the interaction of externally applied stress tensor (σ in crystal coordinate frame) with magnetostrictive strains is given as:
In an ideal crystal without defects (at T = 0K), the domain would align in the direction of minimal energy. However, domain magnetization is expected to follow a Boltzmann–like distribution at higher temperatures due to an increase in entropy. The probability, P, that the magnetization direction is equal to m is given as:
The parameter Ω represents the spread of the magnetization direction from the ideal direction (of minimal energy). The magnetostriction strain tensor ε is obtained by averaging the strains over the probability density of magnetization in the crystal.
The above integral is calculated by sampling over all possible magnetization directions (ie. all points on a unit sphere). The strain tensor ε in the crystal coordinate system is rotated back to the sample coordinate frame ε_{s}(r) = R^{T}εR. The overall strain tensor is obtained by averaging ε_{s}(r) over the ODF using equation (1). The magnetostrictive strain that is optimized in our machine learning approach is the polycrystal averaged strain along the z–direction as measured with respect to an initial unstressed crystal.
For single crystal Galfenol, the various parameters used are as follows: K_{1} = 3.6e4Jm^{−3}, λ_{100} = 170 ppm, λ_{111} = −4.67 ppm, M = 1.83/μ_{0}Am^{−1} (μ_{0} = 4π × 10^{−7}) and Ω = 625Jm^{−3} calibrated in^{6}. In the examples, we apply the magnetic field of −500 Oe and a compressive prestress of 80 MPa, both along the [001] sample direction. At these values, the single crystals achieve a saturation magnetostrictive strain that compares well with the model in^{6}. In the examples, the magnetostrictive strain (m_{s}) along the [001] sample direction is optimized. Note that magnetostrictive strain has significant anisotropy, with the saturation strains along [012] direction being 25% lower than that of the [001].
Yield strength calculation
The crystal plasticity model described in Method section and also in^{10} is used to calculate the yield strength at all nodal points in the fundamental region. The model adequately captures the macroscopic tensile mode stressstrain response at room temperature reported in^{20} as shown in Fig. 3(b). To further validate the microstructural model, we compared the crystallographic textures seen in BCC iron rolling processes and textures predicted by our model. Fig. 3(a) shows that the model captures both α and γ texture that arise from rolling of BCC metals (experimental result from^{21}). The strength (Y(r)) at orientation r is found as the offset z–stress resulting from an applied z–strain of 0.2% under the following velocity gradient^{18}:
The overall yield strength is obtained by averaging Y(r) over the ODF using equation (1).
The single crystal properties for elastic modulus, magnetostrictive strains and the yield strength obtained from the above analyses can be visualized on the ODF mesh in Rodrigues space. The plots shown in Fig. 4(left) depict the surface contours with internal slices of the ODF shown alongside in Fig. 4(right). The single crystals with maximum and minimum properties and their locations can be seen directly from these plots. For example, the single crystals with maximum magnetostrictive strains are all located along the z–axis of the Rodrigues space as seen in Fig. 4(a) (right). This corresponds to the z–axis fiber in which the crystal direction of easy magnetization ([001]) is aligned along the measurement axis (sample z–axis).
Optimization Result Analysis
With the five design problems, the MLbased optimization approach, essential procedures shown in Fig. 2 (bottom route) and each step further explained in the Method section, develops heuristics that guide the search into most promising areas. To demonstrate its superiority at optimality, time efficiency and solution completeness, we compare with three baseline methods, all of them evaluated using three criteria: 1) the optimality, or goodness or property, determined by the sheer value of the property obtained; 2) the efficiency, judged by the time taken for obtaining the result under the same computational environment, the lower the better; and 3) the completeness of solutions, in cases where there exist multiple solutions of structures that produce the same best property, is reflected by the number of distinct answers generated, normalized by the most answers found thus far.
The three baseline methods to compare with are: 1) an exhaustive search (eSearch) containing 1 million random searches, 2) a generalized pattern search^{22}, which can be considered as a smarterthanrandom guided search (gSearch), 3) a traditional optimization algorithm, specifically the linear programming (LP) is used for linear problems and genetic algorithms (GA) for nonlinear problems. In the exhaustive search, to ensure the constraint satisfaction the Random Interval (details in Method section under random data generation) method is used to generate 1 million realizations of i.i.d. ODF values. The guided search, however, should require far less iterations and we randomize 100 initial starting positions of ODF and for each iteration, the search goes until a local optima is found.
Table 1 shows optimal values of each design function obtained by the aforementioned methods. As we can deduce from this comparison, for linear properties as Y and m_{s}, LP always provides a valid solution. It is the nonlinear problems as E, F_{1} and F_{2} that pose a challenge for traditional optimization methods. Exhaustive search gives largely unstable results; one almost has to rely on pure luck to bet on a fair result. Guided search has a tendency to get stuck at local optima, which can sometimes be even worse than the naive exhaustive search (although the running time is much less). Surprisingly, GA has worked poorly on nonlinear problems (we also tried GA on linear problems and the answer is never as good as LP). In fact, for nonlinear problems the guided search is often a better choice than GA. ML methods have been proven successful achieving the best answers throughout all problems, linear and nonlinear.
We use visualization bars, shown in Fig. 7, to illustrate the performance of different methods in a standardized manner in terms of all three criteria. Figure 7(left) describes the normalized level of optimality achieved by each method (shown by vertical color sticks) for each problem (each grey box). The highest level (rightmost) is defined by the best solution found among five methods and the lowest level (leftmost) represents the worst answer among the five. The position in between is at linear scale. Therefore how rightward the positions of color sticks represent how good the method is regarding finding the optimum answer.
The minimum of Young’s modulus (E^{opt}) obtained is 85.9878 GPa. Although the property is nonlinear with respect to the ODF values, multiple ODF solutions were not found. The optimal ODF was found to be a singlecrystal with (011)[100] orientation, shown in Fig. 5(a). As is evident from the plots, the extremal ODF corresponds to a unique orientation (and symmetric equivalents, if any). Although the ODF ideally is a Dirac delta function at this orientation, a finite ODF value is plotted due to the use of a finite element discretization. In addition, the use of finite elements implies that this is not the exact minimum unless there is a node exactly at the location of the optimal orientation. However, as^{10} pointed out, one can achieve the property bounds accurately with finer discretizations of the Rodrigues space.
Since yield stress (Y^{opt}) is a linear property, the extremal values correspond to single crystals. In our analysis, the maximum value was found to be 353.11 MPa which is equal for two single crystals, close to the orientation as shown in Fig. 5(b). Similar analysis to find the ODF with maximal magnetostrictive strain () revealed a singlecrystal at the orientation with field induced strain of 1.5498e−04. Note that crystal z–axis is the direction of easy magnetization and of the largest saturation magnetostriction in single crystal galfenol and the optimal crystal, as expected, has this direction aligned with the direction of strain measurement. Figure 5(c) shows the corresponding ODF.
The optimum microstructure for objective function F_{1} was also a singlecrystal. For problem F_{2}, however, we get 26 different ODFs which give the same maximum value of . ODF plots for all these solutions are shown in Fig. 6. Most distinctively, these 26 solutions are all polycrystal, taking values on every one of the crystalline orientations. In this sense, the ML method manages to discover complex and irregular compositions of microstructure in the search for extreme values.
A running time comparison is made in Table 2. For these problems LP can finish within 1 second and each round of GA normally takes 2 to 3 seconds. LP and GA (with 100 restarts and takes about 200 seconds) are shown in Fig. 7(middle) although not included in Table 2. eSearch conducts 1 million random searches along with 1 million times of function evaluation and gSearch takes 100 iterations each taking a different random initial point. Fig. 7(middle) illustrates the normalized level of time efficiency produced by each method for each problem. The highest level (rightmost) is defined by the one using the least time in producing its final answer and the lowest level (leftmost) represents the longest time taken among the five. Since the time varies among methods from a couple of seconds tens of thousands of seconds, the position in between is in a logarithmic scale. eSearch, as imagined, takes the longest time to finish since it explores randomly the space without a proper guidance. ML takes less than 5% of time consumed by eSearch. Compared with gSearch, ML reduced respectively 84.94%, 88.05%, 83.67%, 72.62% and 78.82% of the running time. The average reduction is 81.62%. The 4th step of ML framework is the exact same gSearch with reduced variables; this implies that the reduced time comes from the reduced searching space.
Last but not least, the completeness of solution is an important gauge of optimization design. Two of our design problems, the optimization of Y and F_{2}, appear to have multiple solutions that lead to identical optimum answers. The completeness of solutions for each method regarding these two problems is shown in Fig. 7(right). The linear problem has three solutions found by ML that lead to the same Y^{opt}, but only one is found by LP. eSearch and gSearch each also found one solution but according to Table 1 neither of them generates property as good as Y^{opt}. For the nonlinear problem F_{2}, eSearch and GA each obtained one solution and neither is as good as . gSearch was able to produce , but it only found 9 solutions. Only ML is able to find 26 solutions, the most complete as far as we know.
Discussion
The selection of materials and geometry to minimize (or maximize) some given property has been a common problem in material science. The optimum answer can be obtained by an exhaustive search among numerous choices in the searching domain bounded by some boundary conditions and constraints. More intelligently and less laboriously, the search can be guided by heuristics and how the heuristic is designed is discussed by various researchers in the area of searching algorithms and artificial intelligence^{23,24}. However, searches that are exhaustive or follow simple heuristics suffer from the high dimensionality used in the structure representation and they often return incomplete solution sets.
Machine learning based dimension reduction techniques are known to be powerful in analyzing variable relationships in large datasets containing high variable dimensions. It is often used for data representation for the sake of reducing storage space as well as computational efforts. Regarding optimization problems, ML methods can be considered to reduce the search space by limiting the number of design variables in search and finding patterns to form a superior searching strategy. The analysis that ML is able to make to a reasonable collection of variableobjective data instances could provide valuable insights towards the variable relations and answer questions as what variables to search first, what areas are the most promising, what direction would possibly lead to quick convergence, etc.
In this work, we applied the technique to optimize the magnetoelastic properties of Iron–Gallium alloy (Galfenol). The orientation space of BCC Galfenol was discretized using finite elements to obtain a compact representation of design variables. We searched for microstructures with optimal properties, such as low elastic modulus and high magnetostrictive strains, while also exploring possible non–unique solutions for these design cases. We can deduce from the experimental results that linear problems with unique solutions are better solved with LP, which is both fast and accurate. However, multiple solutions are not explored when using LP techniques. GA performed incompetently across all problems despite its advantage in running time efficiency, in that it was never able to produce the best known answers in our problems. ML method was able to achieve a fine balance between accuracy and efficiency and more importantly, it is able to find the complete set of solutions that none of the other methods can.
Methods
BCC crystal plasticity model
A rateindependent singlecrystal plasticity model developed in Kothari and Anand^{25} is used to compute the effect of macroscopic strain on the polycrystal. For a material with α = 1,…,N slip systems defined by orthonormal vector pairs (m^{α}, n^{α}) denoting the slip direction and slip plane normal respectively, the constitutive equations relate the following basic fields: the deformation gradient F which can be decomposed into elastic and plastic parts as F = F^{e}F^{p}, the Cauchy stress T and the slip resistances s^{α} > 0. In the constitutive equations, the Green elastic strain measure defined on the relaxed configuration (plastically deformed, unstressed configuration) is utilized. The conjugate stress measure is then defined as and the stress–strain relation is given by where is the fourthorder elasticity tensor. It is assumed that deformation takes place through dislocation glide and the evolution of the plastic flow is given by
where is the Schmid tensor and is the plastic shearing rate on the α^{th} slip system. The resolved stress on the α^{th} slip system is given by . The resolved shear stress attains a critical value s^{α} on the systems where slip occurs ( = s^{α}, if ). Further, the resolved shear stress does not exceed s^{α} on the inactive systems with . The hardening law for the slip resistance s^{α} is taken as,
The slip system hardening model is given as:
where h^{β} is a single slip hardening rate, q is the latenthardening ratio and δ^{αβ} is the Kronecker delta function. The parameter q is taken to be 1.0 for coplanar slip systems and 1.4 for noncoplanar slip systems. For the singleslip hardening rate, the following specific form is adopted:
where h_{o}, a and s_{s} are slip hardening parameters taken to be identical for all slip systems, with values h_{o} = 500 MPa, s_{s} = 350 MPa and a = 2.25and for BCC Galfenol single crystals. The initial value of slip system resistance is calibrated as s_{o} = 180 MPa. Plastic deformation due to crystallographic slip is assumed to occur in the direction and the possible slip planes are of the {110}, {112} and {123} type.
Random data generation
Data that are expected to be seen during the search are simulated and collected before the actual search. For the purpose of supervised learning the training set is in a form of {X^{(j)},y^{(j)}}, where the X^{(j)} and y^{(j)} are the jth training example of the feature vector and the corresponding class label. In our design problem, X = [x_{1},x_{2},…,x_{D}] ⊆ R^{D} is an instance of structure representation (e.g. the multidimensional ODF) and y^{(j)} ∈ {−1,1} is an indicator of how good a property generated by the structure is. Given a property function f(X), supposedly to be maximized, the class label y^{(j)} = 1 is assigned to input vector X^{(j)} if the function value f(X^{(j)}) is ‘sufficiently high’. For some algorithms to work well, data from the opposing class might be needed, which indicates sufficiently low property and are assigned a class label of −1. Data quality of the training set is ensured by enforcing randomness and polarization, with the four randomization methods, namely, Random Intervals, Random k Intervals, Random Every k and BestFirst Assignment, developed to address randomness from different angles and generate samples under the constraints in our problems: and X ≥ 0. While one of them (RI) gives the chance of taking values to every variable equally, the others tend to think that a limited number (k) of variables are of greater importance than the others, only we don’t know which k. After the randomized data generation, a data polarization procedure takes portions of samples from two extreme ends, on top of which the feature learning is devised.
(a) Random Intervals (RI). We consider the unit length 1 is divided into D random intervals, or making D − 1 random cuts between the interval [0,1], where D is the dimension of X, or ODF. Then the length of intervals are randomly assigned to each feature x_{i}.
(b) Random k Intervals (RkI). This is similar to RI but each time, only k intervals are generated and assigned to k random dimensions. k is iterated from 1 to D − 1 with an increasing number of samples generated with regards to k (roughly a linear relationship) and then down sampled to 1000 for each iteration, except when k = 1, D samples exist and are all used.
(c) Random Every k (REk). Randomly generate k values at a time, continue only when the sum s of the current k values does not exceed the threshold 1. Update the threshold to the remainder 1 − s and repeat the process until the remainder is sufficiently small. Assign the generated values to a random set of features. k is fixed to be 5 in the experiment.
(d) BestFirst Assignment (BFA). Randomly pick a feature and assign to it a random value u, 0 ≤ u ≤ 1. Distribute the remainder 1 − u evenly to all other variables so that the constraint is met. Compute the objective function and obtain the function value. Repeat n times and continue with whichever gives the best function value. Fix that selected variable and repeat the process to select another variable and go on until no variable is left.
In terms of operation time, REk and BFA are the slowest since they either operate under a probability of getting valid feature values or involve multiple evaluations of the objective function. In generating random samples for F_{2} with a MATLAB implementation, RI takes an average of 0.99 ms per sample, RkI 0.42 ms, REk 12.5 ms and BFA 12.7 ms. We let RI generate 50,000 data instances and terminate REk and BFA when their generated samples reach 10,000. Thus in total we obtained around 145,000 samples. We then sort according to the objective value and keep the top and bottom 25% of data instances, with one of the extremes desired and the other undesired, each with 36,250 samples.
Search path refinement via feature selection
The motivation of introducing feature ranking into optimization is to obtain a specialized search path in the form of a sorted order of variables prior to the start of search, so as to improve the searching efficiency. Four supervised feature ranking methods, χ^{2}, Information Gain, Fscore^{26} and SVMweight (SVM^{27}) are employed. They work either through calculating a coefficient (also called filter methods) to characterize the relevance of each feature with the class target, or through building classifiers (also called wrapper methods) with each variable and evaluating the performance. Filter methods are generally faster. The final feature ranking is decided by a voting (majority wins) of the four result sets and it determines the order of search, that is, the order of variables whose values get updated. In the search to follow, at each iteration only one variable gets updated (by, say, gradient descent) and its value fixated when the objective function stops improving. In this way, prior to the search a search path is determined, which yields greater efficiency. To retain a degree of openness, we conduct multiple searches by shuffling the top 10% of the ranked list, which proves to be critical in obtaining multiple answers for a problem in later experiments.
Search space reduction via classification schemes
For the purpose of reducing the search region of each variable, in this subroutine we build a rulebased classification tree to learn the most promising region of values for each variable. As in the path refinement activity, the data category with desired function values is represented by the class ‘1’ and the contradictory class is labeled as ‘−1’. This creates a twoclass classification problem. We use rulebased classifiers, such as decision trees, because they are easily traversed and thresholds are clearly attained. After a tree is constructed, we look for the leaf nodes with “−1” since our purpose is to minimize E. We traverse from the root to each of the “−1” leaf nodes and write down the rules generated along the path. The number of samples covered by the rule and the accuracy of the rule should also be considered. For the Eminimize example the most supported rule is: IF x_{16} ≤ 0.17565 AND x_{26} ≤ 0.20504 AND x_{14} ≤ 0.13302 AND x_{53} ≥ 0.064713 AND x_{37} ≤ 0.004566 THEN “y = −1”. Therefore, the searching regions for these variables are modified to x_{16} ∈ [0,0.17565], x_{26} ∈ [0,0.20504], x_{14} ∈ [0,0.13302], x_{53} ∈ [0.064713,1] and x_{37} ∈ [0,0.004566]. The searching effort is thus reduced to a more concentrated area on these variables. Compared to the original region of [0,1], relatively 83%, 80%, 87%, 6% and 99% of the search region has been reduced.
Enhanced optimization
A gradientbased line search^{28} is conducted on a ordered list of variables, finding, one variable at a time, the value of it that optimizes the function from a reduced value space. Toprank shuffling and multistarting strategies are incorporated so that on each run, the algorithm starts from a randomly generated initial solution in the search space, with a slightly shuffled preplanned searching order of variables. Optimization becomes a much promising endeavor when the search space is reduced and a preplanned searching path is deployed.
Additional Information
How to cite this article: Liu, R. et al. A predictive machine learning approach for microstructure optimization and materials design. Sci. Rep. 5, 11551; doi: 10.1038/srep11551 (2015).
References
Ashby, M. F. & Cebon, D. Materials selection in mechanical design. Le Journal de Physique IV 3, C7–1 (1993).
McDowell, D. L. & Olson G. B. Concurrent design of hierarchical materials and structures. In: Scientific Modeling and Simulations, 207–240 (Springer, 2009).
Downey, P. R. & Flatau, A. B. Magnetoelastic bending of Galfenol for sensor applications. J. Appl. Phys. 97, 10R505 (2005).
Datta, S. & Flatau, A. B. Magnetostrictive vibration sensor based on irongallium alloy. Proc. Mater. Res. Soc. 888, 0888–V04 (2005).
Mudivarthi, C., Datta, S., Atulasimha J. & Flatau, A. B. A bidirectionally coupled magnetoelastic model and its validation using a Galfenol unimorph sensor. Smart Mater. Struct. 17, 035005 (2008).
Atulasimha, J., Flatau, A. B. & Summers E. Characterization and energybased model of the magnetomechanical behavior of polycrystalline irongallium alloys. Smart Mater. Struct. 16, 126576 (2007).
Na, S. M. & Flatau, A. B. Secondary recrystallization, crystallographic texture and magnetostriction in rolled FeGa based alloys. J. Appl. Phys. 101, 09N518 (2007).
Cheng, L. M., Nolting, A. E., Voyzelle, B. & Galvani, C. Deformation behavior of polycrystalline Galfenol at elevated temperatures. In: The 14th International Symposium on: Smart Structures and Materials & Nondestructive Evaluation and Health Monitoring. San Diego, CA, March 1822, 65262N (2007).
Srisukhumbowornchai, N. & Guruswamy, S. Crystallographic Textures in ColdRolled and Annealed FeGa And FeAl Alloys. Metall. Mater. Trans. A 35, 2963–2970 (2004).
Sundararaghavan, V. & Zabaras, N. Linear analysis of textureproperty relationships using processbased representations of Rodrigues space. Acta Mater. 55, 1573–1587 (2007).
Adams, B. L. et al. Microstructure sensitive design of a compliant beam. J. Mech. Phys. Solids 49, 1639–1663 (2001).
Kalidindi, S. R., Houskamp, J. R., Lyons, M. & Adams, B. L. Microstructure sensitive design of an orthotropic plate subjected to tensile load. Int. J. Plasticity 20, 1561–1575 (2004).
Hoonkyung, L., Woon Ih, C. & Jisoon I. Combinatorial search for optimal hydrogenstorage nanomaterials based on polymers. Phys. Rev. Lett. 97, 056104 (2006).
Sun, T. X. Combinatorial search for advanced luminescence materials. Biotechnol. Bioeng. 61, 193–201 (1999).
Lee, K. S. & Geem, Z. W. A new structural optimization method based on the harmony search algorithm. Comput. Struct. 82, 781–798 (2004).
Adeli, H. & Cheng, N. T. Integrated genetic algorithm for optimization of space structures. J. Aerospace Eng. 6, 315–328 (1993).
Bellman R. Dynamic Programming. Princeton, NJ: (Princeton University Press, 1957).
Sundararaghavan, V. & Zabaras, N. On the synergy between texture classification and deformation process sequence selection for the control of texturedependent properties. Acta Mater. 53, 1015–1027 (2005).
Armstrong, W. D. Nonlinear behavior of magnetostrictive particle actuated composite materials. J. Appl. Phys. 87, 3027–3031 (2000).
Li, J. H. et al. Ductility, texture and large magnetostriction of Fe–Gabased sheets. Scripta Mater. 63, 246–249 (2010).
Bate, P. S. & Quinta da Fonseca, J. Texture development in the cold rolling of IF steel. Mat. Sci. Eng. AStruct. 380, 365–377 (2004).
Audet, C. & Dennis Jr. J. E. Analysis of generalized pattern searches. SIAM J. Optimiz. 13, 889–903 (2002).
Kirkpatrick, S., Gelatt, C. & Vecchi, M. Optimization by simulated annealing. Science 220, 671–680 (13 May 1983).
Glover, F. Heuristic for integer programming using surrogate constraints. Decision Sci. 8, 156–166 (1977).
Anand, L. & Kothari, M. A computational procedure for rateindependent crystal plasticity. J. Mech. Phys. Solids 44, 525–558 (1996).
Chen, Y. W. & Lin, C. J. Combining SVMs with various feature selection strategies In: Feature Extraction, Foundations and Applications. (Springer, 2006).
Vapnik, V., Golowich, S. E. & Smola, A. Support vector method for function approximation, regression estimation and signal processing. Adv. Neur. In 9, 281–287 (1997).
Moré, J. J & Thuente, D. J. Line search algorithms with guaranteed sufficient decrease. ACM T. Math. Software 20, 286–307 (1994).
Acknowledgements
This work is supported by the NSF Grant IIS1343639, CCF1409601, the U.S. Department of Energy (DOE) DESC0007456, Department of Commerce (DOC), National Institute of Standards and Technology (NIST) Award No. 70NANB14H012 and the Air Force Office of Scientific Research (AFOSR), Department of Defense (DOD) Award No. FA95501210458.
Author information
Affiliations
Contributions
R.L. wrote the manuscript text of Introduction, the first and third subsection of Results (“Design Problems” and “Result Analysis”), Discussion and the last four subsections of Methods. A.K. and V.S. wrote the second subsection in Results (“Property Calculation”) and the first subsection of Methods (“BCC crystal plasticity model”). A.K. and V.S. prepared Figs. 1, 3, 4, 5 and 6. R.L. made Figures 2, 7, with great help from Z.C., A.A. and A.C. All authors reviewed the manuscript.
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Rights and permissions
This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
About this article
Cite this article
Liu, R., Kumar, A., Chen, Z. et al. A predictive machine learning approach for microstructure optimization and materials design. Sci Rep 5, 11551 (2015). https://doi.org/10.1038/srep11551
Received:
Accepted:
Published:
Further reading

Obtaining Underlying Parameters from Magnetic Domain Patterns with Machine Learning
Journal of the Physical Society of Japan (2021)

Predicting elastic strain fields in defective microstructures using image colorization algorithms
Computational Materials Science (2021)

Predicting the dynamic behavior of the mechanical properties of platinum with machine learning
The Journal of Chemical Physics (2020)

Dirty engineering datadriven inverse prediction machine learning model
Scientific Reports (2020)

Designing complex architectured materials with generative adversarial networks
Science Advances (2020)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.