Discovering plasticity models without stress data

Flaschel, Moritz; Kumar, Siddhant; De Lorenzis, Laura

doi:10.1038/s41524-022-00752-4

Download PDF

Article
Open access
Published: 28 April 2022

Discovering plasticity models without stress data

npj Computational Materials volume 8, Article number: 91 (2022) Cite this article

5625 Accesses
31 Citations
11 Altmetric
Metrics details

Subjects

Abstract

We propose an approach for data-driven automated discovery of material laws, which we call EUCLID (Efficient Unsupervised Constitutive Law Identification and Discovery), and we apply it here to the discovery of plasticity models, including arbitrarily shaped yield surfaces and isotropic and/or kinematic hardening laws. The approach is unsupervised, i.e., it requires no stress data but only full-field displacement and global force data; it delivers interpretable models, i.e., models that are embodied by parsimonious mathematical expressions discovered through sparse regression of a potentially large catalog of candidate functions; it is one-shot, i.e., discovery only needs one experiment. The material model library is constructed by expanding the yield function with a Fourier series, whereas isotropic and kinematic hardening is introduced by assuming a yield function dependency on internal history variables that evolve with the plastic deformation. For selecting the most relevant Fourier modes and identifying the hardening behavior, EUCLID employs physics knowledge, i.e., the optimization problem that governs the discovery enforces the equilibrium constraints in the bulk and at the loaded boundary of the domain. Sparsity promoting regularization is deployed to generate a set of solutions out of which a solution with low cost and high parsimony is automatically selected. Through virtual experiments, we demonstrate the ability of EUCLID to accurately discover several plastic yield surfaces and hardening mechanisms of different complexity.

Materials informatics for the screening of multi-principal elements and high-entropy alloys

Article Open access 13 June 2019

Self-supervised optimization of random material microstructures in the small-data regime

Article Open access 21 March 2022

Machine learning enabled autonomous microstructural characterization in 3D samples

Article Open access 06 January 2020

Introduction

Data-driven and machine-learning-based methods are currently pushing forward the frontiers of material modeling. What started with simple regression on uniaxial tensile data has rapidly expanded into high-dimensional and big-data-based surrogate modeling of basically all types of materials of technical interest, including metals, polymers, composites, and more. While conventional material modeling was based on the a priori assumption of a constitutive law of which the unknown parameters were identified through best-fitting with experimental (or, within multiscale settings, lower-scale computational) results, current data-driven and machine-learning methods give up the usage of an analytical constitutive law altogether. In doing so, they avoid the modeling errors arising due to, e.g., the largely experience-based modeling assumptions and the choice of experimental (or computational) tests being too restrictive to describe the true physics.

Despite the achieved progress, currently available methods are still problematic due to their data-hungry and black-box nature. The state-of-the-art techniques^{1,2,3,4,5,6,7,8,9} that either bypass (directly use data as look-up tables in a model-free fashion) or surrogate (encode in, e.g., artificial neural networks (ANNs) or Gaussian processes) material models are rooted in a supervised learning or curve-fitting setting. Hence, they need a large amount of data consisting of input–output, i.e., strain-stress pairs. Since experimental stress data are only obtainable in the simplest situations, e.g., uniaxial tensile or bending tests, the comprehensive observation of strain-stress relations relying on these tests is nearly impossible. Additionally, stress tensors are challenging to measure experimentally, while force measurements only provide incomplete data in the form of boundary-averaged projections of stress tensors. Multiscale simulations can generate training data sets with tensorial stress–strain pairs, but their computational cost is still too expensive to probe the entire high-dimensional stress–strain space. Recognition of this issue is very recent. Within the constitutive-model-free paradigm, it motivated the development of the data-driven identification method^10,11, which formulates the inverse problem associated with the approach in ref. ³. Likewise recognizing the issue of limited stress data availability, a mathematical framework is proposed in ref. ¹² to calculate stress fields from deformation fields under the assumption of the alignment of the principal directions of stress and strain or strain rate. Within the stream of research on surrogating constitutive models with ANNs, recent attempts to use only displacement and global force data have been performed^13,14,15, but are limited to very simple cases (constitutive models of known form with unknown parameters or unknown constitutive models but for one-dimensional cases). At last, the uninterpretability of stress–strain relations in both paradigms is a standing challenge: it implies significant difficulties in enforcing or verifying the satisfaction of physics constraints and it hinders the extrapolation power.

From the perspectives of both the labeled data requirement and uninterpretability, the treatment of path-dependent material behavior, such as plasticity, is even more challenging as the stress state at a material point is not solely defined by its strain state, but is additionally dependent on the history of that material point, which is traditionally described using internal variables. Also in this context, the idea of constitutive-model-free approaches is to bypass the formulation of a path-dependent constitutive law and hence any assumptions on the material behavior, by solving forward problems that are directly informed by the given data^{16,17,18,19,20,21}. The other stream of approaches describes the path-dependent constitutive behavior based on ANNs^{7,22,23,24,25,26,27}, support vector machines²⁸, symbolic regression²⁹, or use the information gained from the data to correct material models known from traditional theories³⁰. Being supervised, all these methods require for the training process a tremendous amount of labeled data in form of stress–strain paths, i.e. stress–strain couples at each time over each possible loading history. In other words, it is no longer sufficient to sample data from the stress–strain space; instead, stress–strain increments for several - theoretically infinite - loading histories are required. This makes the coverage of the sampling space basically impossible and the supervised learning task unfeasible. This conclusion applies regardless of whether the data are meant to come from experiments (in which case, as mentioned earlier, already path-independent sampling is highly problematic) or from multiscale simulations (whose computational cost, which is already prohibitive in the path-independent case, becomes now orders of magnitude larger).

In this light, we propose EUCLID (Efficient Unsupervised Constitutive Law Identification and Discovery), an unsupervised discovery framework that bridges the advantages of data-driven and traditional modeling approaches. EUCLID does not require the a priori choice of a material model and thus is flexible to describe a variety of different material behaviors, and it only relies on unlabeled data, i.e., full-field displacements (obtained e.g., via Digital Image Correlation (DIC)) and global reaction forces but no stress data, generated by a single experiment. EUCLID was recently successfully demonstrated for hyperelastic material model discovery^31,32,33 and is here extended to the significantly more challenging problem of path-dependent elastoplasticity. The idea is to formulate a large library of interpretable candidate material models (also referred to as features) and to automatically discover the most relevant features in the library based on the given unlabeled data using only physics constraints (as opposed to stress labels). The inspiration for this approach comes from the dynamics community, where sparse regression from a library of candidate features has been used to discover the nonlinear dynamics of physical systems³⁴, albeit in a strictly supervised setting.

Our objective here is to discover fully general and evolving plastic yield surfaces, which characterize the material behavior of three-dimensional elastoplastic solids, purely based on two-dimensional displacement field and reaction force measurements from only one experimental test on an arbitrarily shaped specimen. The material model library is constructed by expanding the yield function with a Fourier series containing a potentially large number of terms. Yield surface growth and translation, i.e., isotropic and kinematic hardening, are introduced by making the yield function dependent on the accumulated plastic multiplier and the back stress, which are internal history variables that evolve with the plastic deformation. For selecting the most relevant Fourier modes and identifying the hardening mechanisms, EUCLID employs physics knowledge, i.e., the optimization problem that governs the discovery is formulated based on the balance of linear momentum, compensating for the unavailability of stress data. The step-by-step schematic of EUCLID is illustrated in Fig. 1 and described below.

**Fig. 1: Step-by-step schematic of EUCLID.**

Results

Material model library

In this work on path-dependent material behavior, we focus on homogeneous, isotropic materials for which linear elastic behavior is followed by associated, pressure-insensitive plastic behavior with isotropic and/or kinematic hardening. We also assume small strains and plane stress conditions. In the theory of elastoplasticity, the infinitesimal strain tensor, which is obtained from the spatial gradient of the displacement field u, is additively split into an elastic contribution and a plastic contribution ϵ = ϵ_e + ϵ_p, with the plastic strain acting as internal (or history) variable. The elastic properties of the material are characterized by the stiffness tensor ${\mathbb{C}}$, which determines the linear relation between the elastic strain and the Cauchy stress tensor ${{{\boldsymbol{\sigma }}}}={\mathbb{C}}\!:{{{{\boldsymbol{\epsilon }}}}}_{{{{\rm{e}}}}}$, whereas the plastic properties are described through the yield function f(σ, γ, σ^back), which is here assumed to be dependent on the stress tensor, the accumulated plastic multiplier γ, and the back stress tensor σ^back. The zero level set f = 0 defines the yield surface, i.e., the material deforms elastically if f < 0, and plastic yielding occurs at f = 0. Further, the yield function governs the evolution of the plastic strain through the plastic evolution law

$${\dot{{{{\boldsymbol{\epsilon }}}}}}_{p}=\dot{\gamma }\frac{\partial f({{{\boldsymbol{\sigma }}}},\gamma ,{{{{\boldsymbol{\sigma }}}}}^{{{{\rm{back}}}}})}{\partial {{{\boldsymbol{\sigma }}}}},$$

(1)

where the superposed dot denotes the derivative with respect to time. The plastic multiplier and the yield function need to fulfill the Kuhn-Tucker loading and unloading conditions as well as the consistency condition (see Supplementary Methods). For details on the theory of elastoplasticity, the reader is referred to^35,36.

The construction of a suitable material model library, i.e., a large catalog of potential candidate models, builds the basis for the unsupervised discovery framework. As the elastic material properties can be identified independently from the plastic material properties in a preprocessing step (for example, based on the full-field measurements of the first load steps, see refs. ^31,37), the elastic stiffness tensor is assumed to be known here and the main focus lies on the discovery of the yield function (see Table 1). The material model library is constructed by choosing a Fourier series ansatz (See refs. ^38,39 for other Fourier-type expansions of the yield surface).

$$f(r,\alpha ,\gamma )=\sqrt{\frac{3}{2}}r-{H}^{{{{\rm{iso}}}}}(\gamma )\mathop{\sum }\limits_{i=0}^{{n}_{f}}{\theta }_{i}\cos (3i\alpha ),$$

(2)

where θ_i are the unknown components of the material parameter vector θ and (n_f + 1) is the number of features in the library. The Lode radius r and the Lode angle α are invariants of the relative stress tensor σ^rel = σ−σ^back, and are related to the relative principal stresses σ_i, i.e., the eigenvalues of σ^rel, through

$$r=\sqrt{{\pi }_{1}^{2}+{\pi }_{2}^{2}},\quad \alpha ={{{\rm{atan}}}}2({\pi }_{2},{\pi }_{1}),\quad {{{\rm{with}}}}\quad {\pi }_{1}=\sqrt{\frac{2}{3}}{\sigma }_{1}-\sqrt{\frac{1}{6}}{\sigma }_{2}-\sqrt{\frac{1}{6}}{\sigma }_{3},\quad {\pi }_{2}=\sqrt{\frac{1}{2}}{\sigma }_{2}-\sqrt{\frac{1}{2}}{\sigma }_{3},$$

(3)

where ${{{\rm{atan}}}}2(\cdot ,\cdot )$ is the four-quadrant inverse tangent and the eigenvalues are taken in increasing order, i.e., σ₁ ≤ σ₂ ≤ σ₃. Sine terms as well as certain cosine terms are excluded from the library Equation (2) to fulfill isotropy requirements (see Supplementary Methods).

Table 1 Yield functions of the (true) hidden and discovered material models (γ = 0, σ in mm, parameters in kN mm⁻²).

Full size table

Isotropic hardening is considered in Equation (2) through the nonlinear isotropic hardening function H^iso(γ), whereas kinematic hardening is incorporated by letting the back stress evolve nonlinearly with the plastic deformation. We here assume Voce isotropic hardening^40,41 and Armstrong-Frederick kinematic hardening⁴² by defining

$$\begin{array}{ll}{H}^{{{{\rm{iso}}}}}(\gamma )&=1+{H}_{1}^{{{{\rm{iso}}}}}\gamma +{H}_{2}^{{{{\rm{iso}}}}}\left(1-\exp (-{H}_{3}^{{{{\rm{iso}}}}}\gamma )\right),\\ {\dot{{{{\boldsymbol{\sigma }}}}}}^{{{{\rm{back}}}}}&={H}_{1}^{{{{\rm{kin}}}}}\dot{{{{{\boldsymbol{\epsilon }}}}}_{{{{\rm{p}}}}}}-{H}_{2}^{{{{\rm{kin}}}}}\dot{\gamma }{{{{\boldsymbol{\sigma }}}}}^{{{{\rm{back}}}}},\end{array}$$

(4)

where ${{{\bf{H}}}}=[{H}_{1}^{{{{\rm{iso}}}}}\,{H}_{2}^{{{{\rm{iso}}}}}\,{H}_{3}^{{{{\rm{iso}}}}}\,{H}_{1}^{{{{\rm{kin}}}}}\,{H}_{1}^{{{{\rm{kin}}}}}]$ is a vector containing the unknown hardening parameters that are here assumed to be non-negative.

By choosing different combinations of active features in the Fourier series, Equation (2) can be used to describe smooth yield surfaces of arbitrary shape. The representation of the yield function as a closed-form mathematical expression (in contrast to black-box models or constitutive-model-free approaches) facilitates physical interpretation of the material model. E.g., it becomes straightforward to verify whether the yield surface is convex or whether the material behavior is tension-compression symmetric (see Supplementary Methods). The closed-form description also enables interpretable constraints on the parameters θ based on physical requirements. E.g., assuming a vanishing stress state and that no hardening has occurred (σ = 0, γ = 0, σ^back = 0), the material is expected to behave elastically (f < 0),

$$f({{{\boldsymbol{\sigma }}}}={{{\boldsymbol{0}}}},\gamma =0,{{{{\boldsymbol{\sigma }}}}}^{{{{\rm{back}}}}}={{{\boldsymbol{0}}}})=f(r=0,\alpha ,\gamma =0) \,<\, 0\ \Rightarrow \ {\theta }_{0} > \mathop{\sum }\limits_{i=1}^{{n}_{f}}| {\theta }_{i}| .$$

(5)

The numerical implementation of the model library presented in this section, either for forward finite element simulations (used to generate the artificial data) or for the inverse discovery algorithm (EUCLID), requires the formulation of a stress update procedure. Given the strain ϵ^t at the current time step t, the history variables ${{{{\bf{h}}}}}^{t-1}=\{{{{{\boldsymbol{\epsilon }}}}}_{{{{\rm{p}}}}}^{t-1},{\gamma }^{t-1},{({{{{\boldsymbol{\sigma }}}}}^{{{{\rm{back}}}}})}^{t-1}\}$ of the previous time step (All history variables are assumed to vanish at t = 0, i.e., H⁰ = {0, 0, 0}.) and the material parameters θ and H, the current stress σ^t(ϵ^t, h^t−1, θ, H) is calculated via a classical elastic predictor-plastic corrector return mapping algorithm (see Supplementary Methods).

Optimization problem

To compensate for the unavailability of stress data, we employ physics knowledge to identify which features in the feature library should be active and to find the values of the corresponding active parameters within θ and H. Under quasi-static loading of a two-dimensional domain Ω with boundary ∂Ω, the linear momentum balance (∇ ⋅ σ = 0) in its weak formulation is given by

$${\int}_{\!{{\Omega }}}{{{{\boldsymbol{\sigma }}}}}^{t}({{{{\boldsymbol{\epsilon }}}}}^{t},{{{{\bf{h}}}}}^{t-1},{{{\boldsymbol{\theta }}}},{{{\bf{H}}}}):\nabla {{{\bf{v}}}}{{{\rm{d}}}}A={\int}_{\!{\partial {{\Omega }}}}{\hat{{{{\bf{t}}}}}}^{t}\cdot {{{\bf{v}}}}{{{\rm{d}}}}s,$$

(6)

for all admissible test functions v, where ${\hat{{{{\bf{t}}}}}}^{t}$ denotes the boundary tractions. The available data consists of displacement measurements {u^a,t: a = 1, …, n_n; t = 1, …, n_t} at n_n points and n_β net reaction force measurements $\{{\hat{R}}^{\beta,t}:\beta =1,\ldots ,{n}_{\beta };t=1,\ldots ,{n}_{t}\}$ on some boundary segments, both for n_t time steps. Our objective is to determine θ and H such that Equation (6) is satisfied by the data. We emphasize that, although a two-dimensional inverse problem is formulated here, the discovered plasticity models are valid for three-dimensional stress states. This means that we exploit a two-dimensional data set to automatically find plasticity models applicable to three-dimensional solids.

We create a mesh connecting the points, each point being associated with finite element shape functions {N^a(X): a = 1, …, n_n} such that the strain field is obtained as ϵ^t(X) = ∑_asym( ∇ N^a(X) ⊗ u^a,t), and consequently the stress field σ^t(X, ϵ^t, h^t−1, θ, H) is determined for each time step via the elastoplastic constitutive law described by θ and H. Using the same set of shape functions for the test functions, the nodal internal forces are computed as

$${{{{\bf{F}}}}}^{a,t}({{{\boldsymbol{\theta }}}},{{{\bf{h}}}})={\int}_{\!{{\Omega }}}{{{{\boldsymbol{\sigma }}}}}^{t}({{{\bf{X}}}},{{{{\boldsymbol{\epsilon }}}}}^{t},{{{{\bf{h}}}}}^{t-1},{{{\boldsymbol{\theta }}}},{{{\bf{H}}}})\nabla {N}^{a}({{{\bf{X}}}}){{{\rm{d}}}}A.$$

(7)

Equilibrium dictates that the internal forces corresponding to all free degrees of freedom (grouped in the set ${{{{\mathcal{D}}}}}^{{{{\rm{free}}}}}$) be zero at each time step, which naturally leads to the cost function

$${C}^{{{{\rm{free}}}}}({{{\boldsymbol{\theta }}}},{{{\bf{H}}}})=\mathop{\sum }\limits_{t=1}^{{n}_{t}}\mathop{\sum}\limits_{(a,i)\in {{{{\mathcal{D}}}}}^{{{{\rm{free}}}}}}{\left|{F}_{i}^{a,t}({{{\boldsymbol{\theta }}}},{{{\bf{H}}}})\right|}^{2}.$$

(8)

At the same time, for each set of constrained degrees of freedom ${{{{\mathcal{D}}}}}^{{{{\rm{disp}}}},\beta }$, equilibrium requires that the sum of the internal forces be balanced at each time step by the corresponding reaction force ${\hat{R}}^{\beta ,t}$,

$${C}^{{{{\rm{disp}}}}}({{{\boldsymbol{\theta }}}},{{{\bf{H}}}})=\mathop{\sum }\limits_{t=1}^{{n}_{t}}\mathop{\sum }\limits_{\beta =1}^{{n}_{\beta }}{\left|{\hat{R}}^{\beta ,t}-\mathop{\sum}\limits_{(a,i)\in {{{{\mathcal{D}}}}}^{{{{\rm{disp}}}},\beta }}{F}_{i}^{a,t}({{{\boldsymbol{\theta }}}},{{{\bf{H}}}})\right|}^{2}.$$

(9)

We combine the two costs in a single cost function

$$C({{{\boldsymbol{\theta }}}},{{{\bf{H}}}})={C}^{{{{\rm{free}}}}}({{{\boldsymbol{\theta }}}},{{{\bf{H}}}})+{\lambda }_{r}{C}^{{{{\rm{disp}}}}}({{{\boldsymbol{\theta }}}},{{{\bf{H}}}}),$$

(10)

with the balancing hyperparameter λ_r > 0. For details, see Supplementary Methods.

The key difference between EUCLID and traditional (supervised or unsupervised) parameter identification methods is the fact that the form of the material model is not known a priori. In the context of yield surface discovery, the number and combination of active features in Equation (2) is unknown. Directly minimizing the cost function in Equation (10) would result in a dense solution vector θ, i.e., a highly complex material model with many non-zero material parameters. Such material models are not desired as their calibration and implementation are impractical and they bear a high risk of overfitting the data, possibly resulting in physically inadmissible material behavior. For these reasons, we promote sparsity in the solution vector by formulating an ℓ_p-regularized minimization problem as

$$\{{{{{\boldsymbol{\theta }}}}}^{{{{\rm{opt}}}}},{{{{\bf{H}}}}}^{{{{\rm{opt}}}}}\}=\arg \mathop{\min }\limits_{\{{{{\boldsymbol{\theta }}}},{{{\bf{H}}}}\ge {{{\bf{0}}}}\}}\left(C({{{\boldsymbol{\theta }}}},{{{\bf{H}}}})+{\lambda }_{p}\parallel {{{\boldsymbol{\theta }}}}{\parallel }_{p}^{p}\right),\qquad {{{\rm{where}}}}\qquad \parallel {{{\boldsymbol{\theta }}}}{\parallel }_{p}={\left(\mathop{\sum }\limits_{i = 1}^{{n}_{f}}| {\theta }_{i}{| }^{p}\right)}^{1/p}$$

(11)

For the hardening models, we assume a mathematical form that is parsimonious at the outset but flexible enough to describe a large class of different hardening mechanisms. To achieve parsimony in the yield surface, we apply ℓ_p-regularization to the parameter vector θ. The regularization with hyperparameters λ_p > 0 and p ∈ (0, 1] is a generalization of the LASSO (least absolute shrinkage and selection operator)⁴³ which is recovered for p = 1. Smaller values of p and higher values of λ_p promote sparsity more aggressively, but on the other side increase the degree of the non-convexity of the function to be minimized. Here, as in our previous work³¹, we use p = 1/4, while the choice of λ_p is discussed in the Supplementary Methods. Note that in order to fulfill Equation (5), the parameter θ₀ corresponding to the constant Fourier mode is purposefully not considered in the ℓ_p regularization. Further, we highlight that applying the ℓ_p regularization shrinks the absolute values ∣θ_i∣ for i ∈ {1, …, n_f} and hence reinforces the fulfillment of the physical constraint in Equation (5). A similar correlation between sparsity of the material model and fulfillment of physical requirements has been observed by^31,34, supporting the hypothesis that sparse models are more likely to fulfill physical requirements.

Equation (11) is a nonlinear and non-convex optimization problem. As there exists no closed-form expression for the stress–strain relation σ^t(ϵ^t, h^t−1, θ, H), it is not feasible to differentiate the cost function in closed form. For this reason, we use a trust-region reflective Newton solver⁴⁴ with gradients computed via a finite difference approximation; however, other optimization techniques may also be feasible to use. To tackle the issue of non-convexity, which leads to multiple local minima, we optimize for multiple randomly chosen initial guesses and operate an automatic threshold-based selection that favors a solution with low cost and high parsimony. All details regarding the optimization procedure are provided in the Supplementary Methods.

Benchmarks

We benchmark EUCLID on eight largely different elastoplastic material models with different hardening parameters (see Tables 1 and 2 and Fig. 3 for yield function expressions, hardening parameters, and yield surface plots, respectively)^45,46:

VM: Von-Mises yield function
F2: Yield function in Equation (2) with only θ₀, θ₂ ≠ 0
TR^*: Smooth approximation of Tresca yield function⁴⁷
SI^*: Smooth approximation of Schmidt-Ishlinsky yield function^48,49
F1: Yield function in Equation (2) with only θ₀, θ₁ ≠ 0
IV^*: Smooth approximation of Ivlev yield function⁵⁰
MA^*: Smooth approximation of Mariotte yield function⁵¹
NC: Yield function with non-convex yield surface

The first four material models VM, F2, TR^*, SI^* exhibit tension-compression symmetry, while the others are tension-compression asymmetric. Material models VM, F2, F1, NC are obtained by choosing different combinations of active and inactive parameters in Equation (2). While yield surfaces in classical plasticity are convex and despite the issues connected with the thermodynamic consistency of non-convex yield surfaces, we deliberately choose model NC to evaluate the capabilities of our approach also in the rare case of non-convexity⁵². The original Tresca, Schmidt-Ishlinsky, Ivlev, and Mariotte models are characterized by non-smooth yield functions which require complex stress update procedures³⁶ beyond the scope of this work. Therefore, and in order to verify the flexibility of the chosen yield function library, the models with non-smooth yield functions are approximated by the Fourier-type expansion in Equation (2) and denoted by superscript (⋅)^*. Each of the approximated models contains eleven active features with n_f up to 20 in Equation (2), see Supplementary Table 1. The validity of such approximations is proved in the Supplementary Methods.

Table 2 Hardening parameters of the (true) hidden and discovered plasticity models (σ in mm).

Full size table

EUCLID takes full-field displacement and global reaction force measurements as input. DIC data are emulated by generating artificial data via the finite element method (FEM) based on the material models illustrated above. The chosen domain is a square plate with two elliptic holes (as schematically shown in Fig. 2) in-plane stress conditions and meshed with bilinear quadrilateral elements. The plate is deformed under displacement-controlled tension, followed by displacement-controlled compression. In the tension phase, the prescribed vertical displacement δ is linearly increased from 0 to δ = 0.5 mm, and subsequently decreased to δ = −0.5 mm in the compression phase. The nodal displacements are recorded from the FEM solution at a total of n_t = 2250 load steps, 750 load steps in the tension phase, and 1500 load steps in the compression phase. The total horizontal and vertical reaction forces on the top boundary are also recorded. Note that we purposefully choose a complex specimen geometry, in contrast to the simple geometry of traditional coupon tests, with the objective to obtain a strain field that is rich enough to solve the ill-posed problem of identifying the yield surface with no stress data and just one experiment.

**Fig. 2: Geometry and boundary conditions of the chosen domain: a plate with two elliptic holes under displacement-controlled vertical tension followed by vertical compression.**

As real DIC data are unavoidably affected by noise in the measured displacement field, we add independent Gaussian noise with zero mean and standard deviation σ > 0 to the synthetic displacement data coming from the FEM simulations (see Supplementary Methods). We consider noise levels σ ∈ {0 μm, 0.1 μm, 0.3 μm, 0.5 μm}, where σ = 0.1 μm is considered a reasonable upper limit for modern DIC setups^53,54. Pushing EUCLID to its breaking point, we further test it for a noise level of σ = 0.5 μm. The effect of the noise on the yield surface discovery can be reduced by temporal and spatial smoothing. Here, we restrict ourselves to temporal denoising by applying a Savitzky-Golay filter⁵⁵ based on quadratic polynomial fitting with a moving-window length of 50-time steps.

With EUCLID, discovery can proceed from a potentially very large model library—e.g., in our previous work³¹ a library with 43 features was used for the discovery of hyperelastic strain energy functions. However, it turns out that with the chosen Fourier ansatz a relatively small number of features is already sufficient to provide a remarkably flexible and general yield surface description. Thus, we consider here only seven features (n_f = 6) in the model library, i.e., cosine terms with frequencies up to $\frac{18}{2\pi }$. The closed-form expressions of the yield functions discovered by EUCLID from the data with different noise levels are reported in Table 1, in comparison to the true expressions. Table 2 shows the corresponding hardening parameters. A comparison of the yield surface plots in the π plane of the true models and the discovered models after hardening (γ = 0.1) is presented in Fig. 3. As the π plane depends on the relative stresses and not on the absolute stresses, the yield surface plots in Fig. 3 do not capture kinematic hardening. To this end, yield surface plots in the absolute stress component space are shown in Fig. 4 for an exemplary material model at the end of three different deformation paths in the data set. Similar plots for the other material models are shown in Supplementary Figures 6 and 7.

**Fig. 3: Yield surface plots of the (true) hidden and discovered plasticity models for different noise levels σ (in mm).**

Fig. 4: Yield surface plots of the (true) hidden and discovered Schmidt-Ishlinsky material model (lower row of plots) at the end of three different deformation paths (A, B, C) corresponding to characteristic points in the specimen domain.

In the case of displacements without noise (σ = 0), material models VM, F2, F1, NC are discovered exactly, i.e., both the mathematical form of the yield function and the parameters are correctly identified. For increasing noise, the discovered parameters deviate from the true parameters as expected. In the non-convex case (NC), false-positive predictions (features that appear in the discovered formula but are not present in the true model) are observed. Material models TR^*, SI^*, IV^*, MA^* on the other hand cannot be described exactly by the chosen model library which makes the exact discovery of the yield function form impossible. However, for the tension-compression symmetric models (TR^*, SI^*) it is observed in many cases that tension-compression symmetry-breaking features (e.g., $\cos (3\alpha )$, $\cos (9\alpha )$ and $\cos (15\alpha )$) are automatically discarded by EUCLID, which increases the interpretability of the discovered models. For the tension-compression asymmetric models (IV^*, MA^*) the considered ground truth models are not sparse and consequently, EUCLID discovers dense models. This indicates that there is no concise and interpretable model in the library that can describe the provided data accurately. Hence, a trade-off between model accuracy and interpretability is observed in this situation. This could in future studies be mitigated by expanding the feature library, e.g., by including (in addition to the trigonometric features) non-smooth yield surfaces in the ansatz space.

Whenever EUCLID fails to discover the correct yield function form, parameters (θ_i) corresponding to false-positive features are observed to be considerably smaller than the other parameters and hence have a small influence on the material behavior. Further, false-negative feature predictions (features that are not discovered although active in the true model) do not seem to have a high impact on the material behavior either, which is corroborated by the high accuracy observed in the yield surface plots for all models and noise levels except higher noise TR^* and NC (see Figs. 3 and 4). Hence, for the models and noise levels for which EUCLID does not find the correct closed-form expression of the yield function, accurate surrogate models are discovered that mimic the behavior of the true yield function. The Tresca yield surface is the only benchmark for which non-optimal fitting results are observed in shear stress regions even in the case without noise (see Fig. 3). The reasons for this are discussed in detail in the Supplementary Methods, where we also show that enriching the data with shear deformation drastically improves the results.

Discussion

We show that EUCLID is able to discover interpretable plasticity models from displacement and net reaction force data only and without using any stress data. The method hence provides a physics-constrained, data-efficient alternative to supervised data-driven and machine-learning methods, which require an enormous amount of labeled data and thus are most often inapplicable. The sparse regression enables parsimonious model selection in contrast to an a priori choice of the plasticity model such as in the traditional material model calibration techniques. Hence, after having demonstrated EUCLID for hyperelasticity³¹ and for plasticity, we aim at pursuing its extension to the discovery of more general cases of plasticity with pressure sensitivity and anisotropy in future work. Further extensions of interest may include other categories of material behavior such as visco-elasticity, visco-plasticity, damage, and general combinations thereof. Another important future goal will be the employment of EUCLID on experimental data in the two- and three-dimensional settings using digital image and volume correlation data, respectively.

An important question raised by EUCLID is how to choose the specimen geometry and loading conditions to maximize the richness of information in the data. While it is qualitatively clear that more complex setups hold the promise of a richer resulting data set, a quantitative answer requires specific investigations which are an important future research direction. A related question is on the range of strains and loading histories within which the outcome of EUCLID should be trusted. According to our experience, the discovered material model performs accurately not only in interpolation within the range of the data; as EUCLID ensures physical admissibility of the material model, good fitting accuracy, as well as physically sound material behavior, can be observed even beyond the range of the training data. However, as in any other scientific application, extrapolation has its limits and the discovered material model should be used with caution too far outside of the range of the training data set.

Methods

Detailed descriptions of the numerical implementation of the material model library, the data generation, the formulation of the objective function, and the optimization procedure are provided in the Supplementary Methods.

Data availability

The data generated during the current study are available in the ETH Research Collection⁵⁶.

Code availability

The codes generated during the current study are available in the ETH Library⁵⁷ (see also https://euclid-code.github.io/).

References

Ghaboussi, J., Garrett, J. H. & Wu, X. Knowledge based modeling of material behavior with neural networks. J. Eng. Mech. 117, 132–153 (1991).
Article Google Scholar
Sussman, T. & Bathe, K.-J. A model of incompressible isotropic hyperelastic material behavior using spline interpolations of tension-compression test data. Commun. Numer. Methods Eng. 25, 53–63 (2009).
Article Google Scholar
Kirchdoerfer, T. & Ortiz, M. Data-driven computational mechanics. Computer Methods Appl. Mech. Eng. 304, 81–101 (2016).
Article Google Scholar
Ibañez, R. et al. Data-driven non-linear elasticity: constitutive manifold construction and problem discretization. Comput. Mech. 60, 813–826 (2017).
Article Google Scholar
Crespo, J., Latorre, M. & Montáns, F. J. WYPIWYG hyperelasticity for isotropic, compressible materials. Comput. Mech. 59, 73–92 (2017).
Article Google Scholar
González, D., Chinesta, F. & Cueto, E. Learning corrections for hyperelastic models from data. Front. Mater. 6, 14 (2019).
Article Google Scholar
Mozaffar, M. et al. Deep learning predicts path-dependent plasticity. Proc. Natl, Acad. Sci. 116, 26414–26420 (2019).
Article CAS Google Scholar
Zhang, A. & Mohr, D. Using neural networks to represent von Mises plasticity with isotropic hardening. Int. J. Plasticity 132, 102732 (2020).
Article Google Scholar
Kovachki, N. et al. Multiscale modeling of materials: computing, data science, uncertainty and goal-oriented optimization. Mech. Mater. 165, 104156 (2022).
Article Google Scholar
Leygue, A., Coret, M., Réthoré, J., Stainier, L. & Verron, E. Data-based derivation of material response. Comput. Methods Appl. Mech. Eng. 331, 184–196 (2018).
Article Google Scholar
Dalémat, M. Measuring stress field without constitutive equation. Mech. Mater. 136, 103087 (2019).
Article Google Scholar
Cameron, B. C. & Tasan, C. Full-field stress computation from measured deformation fields: a hyperbolic formulation. J. Mech. Phys. Solids 147, 104186 (2021).
Article Google Scholar
Tartakovsky, A. M., Marrero, C. O., Perdikaris, P., Tartakovsky, G. D. & Barajas Solano, D. Physics informed deep neural networks for learning parameters and constitutive relationships in subsurface flow problems. Water Resour. Res. 56 e2019WR026731 (2020).
Huang, D. Z., Xu, K., Farhat, C. & Darve, E. Learning constitutive relations from indirect observations using deep neural networks. J. Comput. Phys. 416, 109491 (2020).
Article Google Scholar
Haghighat, E., Raissi, M., Moure, A., Gomez, H. & Juanes, R. A physics-informed deep learning framework for inversion and surrogate modeling in solid mechanics. Comput. Methods Appl. Mech. Eng. 379, 113741 (2021).
Article Google Scholar
Chinesta, F. et al. Data-driven computational plasticity. Procedia Eng. 207, 209–214 (2017).
Article Google Scholar
Ibañez, R. et al. A manifold learning approach to data-driven computational elasticity and inelasticity. Arch. Comput. Methods Eng. 25, 47–57 (2018).
Article Google Scholar
Eggersmann, R., Kirchdoerfer, T., Reese, S., Stainier, L. & Ortiz, M. Model-free data-driven inelasticity. Comput. Methods Appl. Mech. Eng. 350, 81–99 (2019).
Article Google Scholar
Tang, S. et al. MAP123-EP: A mechanistic-based data-driven approach for numerical elastoplastic analysis. Comput. Methods Appl. Mech. Eng. 364, 112955 (2020).
Article Google Scholar
Carrara, P., De Lorenzis, L., Stainier, L. & Ortiz, M. Data-driven fracture mechanics. Comput. Methods Appl. Mech. Eng. 372, 113390 (2020).
Article Google Scholar
Carrara, P., Ortiz, M. & De Lorenzis, L. Data-driven rate-dependent fracture mechanics. J. Mech. Phys. Solids 155, 104559 (2021).
Article Google Scholar
Li, X., Roth, C. C. & Mohr, D. Machine-learning based temperature- and rate-dependent plasticity model: application to analysis of fracture experiments on DP steel. Int. J. Plasticity 118, 320–344 (2019).
Article CAS Google Scholar
Vlassis, N. N. & Sun, W. Sobolev training of thermodynamic-informed neural networks for interpretable elasto-plasticity models with level set hardening. Comput. Methods Appl. Mech. Eng. 377, 113695 (2021).
Article Google Scholar
Huang, D., Fuhg, J. N., Weißenfels, C. & Wriggers, P. A machine learning based plasticity model using proper orthogonal decomposition. Comput. Methods Appl. Mech. Eng. 365, 113008 (2020).
Article Google Scholar
Shen, W., Cao, Y., Shao, J. & Liu, Z. Prediction of plastic yield surface for porous materials by a machine learning approach. Mater. Today Commun. 25, 101477 (2020).
Article CAS Google Scholar
Kumar, S., Tan, S., Zheng, L. & Kochmann, D. M. Inverse-designed spinodoid metamaterials. npj Comput. Mater. 6, 73 (2020).
Article Google Scholar
Zheng, L., Kumar, S. & Kochmann, D. M. Data-driven topology optimization of spinodoid metamaterials with seamlessly tunable anisotropy. Comput. Methods Appl. Mech. Eng. 383, 113894 (2021).
Article Google Scholar
Hartmaier, A. Data-oriented constitutive modeling of plasticity in metals. Materials 13, 1600 (2020).
Article CAS Google Scholar
Bomarito, G. et al. Development of interpretable, data-driven plasticity models with symbolic regression. Comput. Struct. 252, 106557 (2021).
Article Google Scholar
Ibáñez, R. et al. Hybrid constitutive modeling: data-driven learning of corrections to plasticity models. Int. J. Mater. Form. 12, 717–725 (2019).
Article Google Scholar
Flaschel, M., Kumar, S. & De Lorenzis, L. Unsupervised discovery of interpretable hyperelastic constitutive laws. Comput. Methods Appl. Mech. Eng. 381, 113852 (2021).
Article Google Scholar
Flaschel, M., Kumar, S. & De Lorenzis, L. Supplementary software for “Unsupervised discovery of interpretable hyperelastic constitutive laws". ETH Libr. https://doi.org/10.5905/ethz-1007-508 (2021).
Flaschel, M., Kumar, S. & De Lorenzis, L. FEM Data - Unsupervised discovery of interpretable hyperelastic constitutive laws. ETH Res Collect https://doi.org/10.3929/ethz-b-000505693 (2021).
Brunton, S. L., Proctor, J. L. & Kutz, J. N. Discovering governing equations from data by sparse identification of nonlinear dynamical systems. Proc. Natl Acad. Sci. 113, 3932–3937 (2016).
Article CAS Google Scholar
Simo, J. C. & Hughes, T. J. R. Computational inelasticity. No. v. 7 in Interdisciplinary applied mathematics (Springer, New York, 1998).
Neto, E. d. S., Peric, D. & Owen, D. Computational methods for plasticity (John Wiley & Sons, 2008).
Pannier, Y., Avril, S., Rotinat, R. & Pierron, F. Identification of elasto-plastic constitutive parameters from statically undetermined tests using the virtual fields method. Exp. Mech. 46, 735–755 (2006).
Article Google Scholar
Ortiz, M. & Popov, E. P. Distortional hardening rules for metal plasticity. J. Eng. Mech. 109, 1042–1057 (1983).
Article Google Scholar
Raemy, C., Manopulo, N. & Hora, P. On the modelling of plastic anisotropy, asymmetry and directional hardening of commercially pure titanium: a planar Fourier series based approach. Int. J. Plasticity 91, 182–204 (2017).
Article CAS Google Scholar
Voce, E. The relationship between stress and strain for homogeneous deformation. J. Inst. Met. 74, 537–562 (1948).
CAS Google Scholar
Voce, E. A practical strain-hardening function. Metallurgia 51, 219–226 (1955).
Google Scholar
Armstrong, P. J. & Frederick, C. A mathematical representation of the multiaxial Bauschinger effect, vol. 731 (Central Electricity Generating Board and Berkeley Nuclear Laboratories, 1966).
Tibshirani, R. Regression Shrinkage and Selection via the Lasso. J. R. Stat. Soc.: Ser. B (Methodol.) 58, 267–288 (1996).
Google Scholar
Coleman, T. F. & Li, Y. On the convergence of interior-reflective Newton methods for nonlinear minimization subject to bounds. Math. Program. 67, 189–224 (1994).
Article Google Scholar
Kolupaev, V. A. Equivalent Stress Concept for Limit State Analysis, vol. 86 of Advanced Structured Materials (Springer International Publishing, Cham, 2018). http://link.springer.com/10.1007/978-3-319-73049-3.
Rosendahl, P. L., Kolupaev, V. A. & Altenbach, H. Extreme Yield Figures for Universal Strength Criteria. In Altenbach, H. & Öchsner, A. (eds.) State of the Art and Future Trends in Material Modeling, vol. 100, 259–324 (Springer International Publishing, Cham, 2019). http://link.springer.com/10.1007/978-3-030-30355-6_12. Series Title: Advanced Structured Materials.
Tresca, H. E. Mémoire sur l’écoulement des corps solides (Imprimerie impériale, 1869).
Schmidt, R. Über den Zusammenhang von Spannungen und Formänderungen im Verfestigungsgebiet. Ing.-Arch. 3, 215–235 (1932).
Article Google Scholar
Ishlinsky, A. Y. Hypothesis of strength of shape change (in Russ.: Gipoteza prochnosti formoizmenenija). Uchebnye Zap. Moskovskogo Universiteta, Mekhanika 46, 104–114 (1940).
Google Scholar
Ivlev, D. The theory of fracture of solids. J. Appl. Math. Mech. 23, 884–895 (1959).
Article Google Scholar
Mariotte, E. & de La Hire, P. Traité du mouvement des eaux et des autres corps fluides (Chez Claude-Jombert, 1718).
Glüge, R. & Bucci, S. Does convexity of yield surfaces in plasticity have a physical significance? Math. Mech. Solids 23, 1364–1373 (2018).
Article Google Scholar
Pierron, F., Avril, S. & The Tran, V. Extension of the virtual fields method to elasto-plastic material identification with cyclic loads and kinematic hardening. Int. J. Solids Struct. 47, 2993–3010 (2010).
Article Google Scholar
Marek, A., Davis, F. M., Rossi, M. & Pierron, F. Extension of the sensitivity-based virtual fields to large deformation anisotropic plasticity. Int. J. Mater. Form. 12, 457–476 (2019).
Article Google Scholar
Savitzky, A. & Golay, M. J. E. Smoothing and differentiation of data by simplified least squares procedures. Anal. Chem. 36, 1627–1639 (1964).
Article CAS Google Scholar
Flaschel, M., Kumar, S. & De Lorenzis, L. FEM Data - Discovering plasticity models without stress data. ETH Res Collect https://doi.org/10.3929/ethz-b-000534002 (2022).
Flaschel, M., Kumar, S. & De Lorenzis, L. Supplementary software for “Discovering plasticity models without stress data". ETH Lib https://doi.org/10.5905/ethz-1007-509 (2022).

Download references

Acknowledgements

M.F. and L.D.L. gratefully acknowledge the financial support of the Swiss National Science Foundation (SNF Project 200021_204316).

Author information

Authors and Affiliations

Department of Mechanical and Process Engineering, ETH Zürich, 8092, Zürich, Switzerland
Moritz Flaschel & Laura De Lorenzis
Department of Materials Science and Engineering, Delft University of Technology, 2628 CD, Delft, The Netherlands
Siddhant Kumar

Authors

Moritz Flaschel
View author publications
You can also search for this author in PubMed Google Scholar
Siddhant Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Laura De Lorenzis
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.F., S.K., and L.D.L. conceived the research. M.F. developed the theory, implemented the algorithm, and performed the numerical experiments. M.F., S.K., and L.D.L. wrote the paper.

Corresponding author

Correspondence to Laura De Lorenzis.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Flaschel, M., Kumar, S. & De Lorenzis, L. Discovering plasticity models without stress data. npj Comput Mater 8, 91 (2022). https://doi.org/10.1038/s41524-022-00752-4

Download citation

Received: 13 August 2021
Accepted: 14 March 2022
Published: 28 April 2022
DOI: https://doi.org/10.1038/s41524-022-00752-4

This article is cited by

A Review of the Constitutive Modelling of Metals and Alloys in Machining Process
- Hadi Bakhshan
- Eugenio Oñate
- Josep Maria Carbonell i Puigbó
Archives of Computational Methods in Engineering (2024)
Deep learning in computational mechanics: a review
- Leon Herrmann
- Stefan Kollmannsberger
Computational Mechanics (2024)
Application of Machine Learning and Deep Learning in Finite Element Analysis: A Comprehensive Review
- Dipjyoti Nath
- Ankit
- Sachin Singh Gautam
Archives of Computational Methods in Engineering (2024)
Advancing Nitinol Implant Design and Simulation Through Data-Driven Methodologies
- Harshad M. Paranjape
Shape Memory and Superelasticity (2023)