Abstract
When far from equilibrium, manybody systems display behavior that strongly depends on the initial conditions. A characteristic such example is the phenomenon of plasticity of crystalline and amorphous materials that strongly depends on the material history. In plasticity modeling, the history is captured by a quenched, local and disordered flow stress distribution. While it is this disorder that causes avalanches that are commonly observed during nanoscale plastic deformation, the functional form and scaling properties have remained elusive. In this paper, a generic formalism is developed for deriving local disorder distributions from fieldresponse (e.g., stress/strain) timeseries in models of crackling noise. We demonstrate the efficiency of the method in the hysteretic randomfield Ising model and also, models of elastic interface depinning that have been used to model crystalline and amorphous plasticity. We show that the capacity to resolve the quenched disorder distribution improves with the temporal resolution and number of samples.
Introduction
When Gibbs proposed the famous measure for quantifying the probabilities of individual gas microstates,^{1} the primary interest was on figuring out the microscopic principles that gave rise to well defined thermodynamic state variables, even though actual microstates kept changing through Brownian motion. Gibbs’ original inquiry has reemerged in the context of material deformation of crystals but also other mechanically deformable systems: the farfromequilibrium response to mechanical deformation depends critically, especially at nonselfaveraging small scales, on the initial configuration of defects. Here, we provide a way that can be used, in principle, to estimate the probability of quenched microstates as a function of material parameters (plastic strain, size, and hardening coefficients).^{2}
The evidence for quenched disorder in initial defect microstructures has been accumulated through observations of abrupt plastic events or materialcrackling noise in a large variety of materials, such as FCC and BCC crystals,^{3,4,5,6} amorphous solids^{7} and also earthquake geological faults.^{8,9} This crackling noise^{10} has been commonly explained by random field models^{11,12} or interface depinning ones,^{13,14,15,16,17,18} where the major component is homogeneous solid elasticity, but also a spatially inhomogeneous and random distribution of local, quenched disorder (typically entering local flow stress information)^{4,17,19,20,21} and the allowed microstates are characterized by its stress and strain and minimize the elastic energy functional. The evidence of crackling noise has led to an extensive study of the local, statistical properties of abrupt events and their properties, such as their sizes, durations, average shapes, and their critical exponents. However, the major concern has been the fact that while homogeneous elastic properties are relatively straightforward to measure and test at virtually any scale,^{22} the model distribution of local, quenched disorder is elusive, despite its commonly observed signature response of stochastic plastic bursts.^{3,4,5,7} In this paper, we propose a feasible approach to “learn” the quenched disorder distributions directly from loadresponse timeseries: We argue that the full characteristics of the timeseries may unveil the information on the form of the quenched disorder distribution which is not available through typical temporally local observables (such as abrupt event size/duration).^{23} While the major motivation of this work stems from plastic deformation, this method is generally applicable across crackling noise phenomena, defined through timeseries of an applied field (magnetic field, force, stress) and the associated response variable (magnetization, displacement, and strain). We label this method as time series—machine learning (TSML). TSML is clearly limited by the physical applicability and completeness of the utilized model. In its current form, for clarity purposes, TSML makes use of an unsupervised machine learning approach through principal component analysis (PCA) and kmeans clustering. However, other unsupervised ML approaches could be successfully applied (see Supplementary Information for some examples). For demonstration purposes, TSML is applied to a typical model of plasticity,^{14} as well as to a general example of crackling noise.^{24}
Materials at small scales display size effects in material properties such as the strength and hardening coefficients.^{21,25,26,27,28,29,30} Stochastic yielding has been known to vanish with increasing system size.^{31} However, its mechanism has been debated.^{4,26} A plausible phenomenological scenario that explains the overall behavior is that the stochastic yield distribution at a representative volume has nontrivially large tails^{32} at the nanoscale (that could either be a fattailed/largekurtosis distribution or special distributions such as Weibull or Gumbel) and it gradually selfaverages into the central limit behavior as the volume increases (cf. Fig. 1). While still a phenomenologicallydriven hypothesis, the validity of this scenario would explain the selfconsistent emergence of size effects and stochasticity in smallscale plastic deformation, in a material and sample independent manner.^{4,5,33} In contrast to this picture, it is common in multiscale modeling of material deformation^{22,34} to assume that the statistics of yield parameters in a representative material volume are dominated solely by their averages. Such a severe assumption is clearly not valid in a scenario where the yielding distribution has a large variance, or it is nonGaussian in nature. Given that all current multiscale modeling approaches are fundamentally based on estimates of atomistic mechanisms that are active at the nanoscale, it is natural to consider and pursue the validation of such a scenario. For this reason, robust and swift methods are required for the derivation of local yield distributions as function of sample size, prior loading history and time.
In this paper, we propose a systematic way to solve the inverse problem of deriving the stochastic yield distributions from timeseries that may be the outcome of mechanical deformation or other crackling noise experiments. We use typical methods of unsupervised machine learning that naturally depend on physical modeling’s completeness and the possible universality class.^{4,23} The method does not aim to identify novel physical mechanisms of crackling noise; instead, it may provide a classification of parent quenched disorder distributions despite the existence of coexisting universal behavior. In the following, we perform simulations in two characteristic models of crackling noise, the T = 0 meanfield Random Field Ising model^{24} (RFIM) and the elastic longrange dipolar interface depinning model(LRDIDM),^{14,35} designed for crystalline or amorphous solids. In all these cases, we observe a capacity of the method to cluster model data despite the similarity of avalanche distributions, and then classify them according to the quenched yield distribution at a testing volume. Finally, the method may be also directly applied to actual experimental data on uniaxially compressed Ni micropillars, following prior work^{6} (see Section 4 and associated Fig. S6 of the Supplementary Information).
Results
TSML assumes N samples that generally would include the discrete timeseries of the applied field f_{ ij } and the material response m_{ ij } with time index i ∈ [1, T] and sample index j ∈ [1, N]. The complete data matrix is built through a multiscale description of the timeseries that may be performed through sequential estimation of the sliding window timevarying moments of order n, at scale p and for sample j^{36}
where X could be either m or/and f. Through the construction of these moments, all fluctuations are captured (that could be equivalently unraveled through histogram distributions (cf. Fig. 1b)). For each sample j, a vector may be constructed that contains all moments up to a maximum resolution scale p_{max} and max order n_{max}, with total length M = n_{max}p_{max}. The parameters n_{max} and p_{max} are controlled by the resolution of the timeseries and can be estimated through: \(p_{{\mathrm{max}}} \simeq {\mathrm{log}}_2(T){\mathrm{/}}2\) and n_{max} ∈ ,^{3,8} given that identified moments remain nonzero for the given resolution. Then, the effective data matrix D_{eff} is built through these vectors and has dimensions N × M. D_{eff} is used towards unsupervised machine learning through principal component analysis (PCA) and kmeans clustering.
Machine learning (ML) has had important success in various fields of science and engineering, but also materials science, for example accurate predictions of phase diagrams, crystal structures, and materials properties.^{37} Unsupervised learning through using PCA may work independentently of the input data types, as soon as the data cluster into distinct, distinguishable spaces that can be tracked by a clustering technique such as kmeans.^{38} PCA reduces the dimensionality of a dataset^{39} by identifying subspaces that demonstrate characteristic data variation. The success of the method depends on the capacity of data that originate in different quenched disorder distributions, to be spatially separated into distinct clusters in PCA space. The identification and quantification of the clusters is done through the kmeans method, which finds k clusters that minimize the pairwise squared deviations of points in the same cluster. It is worth noting that our utilized ML is just a general way to statistically distinguish different signals, where the differences are caused by quenched disorder. However, other ML methods, may be similarly applicable (see Supplementary Information, Section 1 and Fig. S1), or may be expected to perform much better, such as deep learning methods.^{40,41} These approaches will be studied in detail elsewhere.
The models
TSML is applied on two characteristic models of crackling noise: a the elastic longrange dipolar interfacedepinning model^{13} (LRDIDM) and b) the hysteretic RandomField Ising Model (RFIM).^{24} The interactions in both models can be short or long ranged, altering the universality class. The purpose of this paper is to demonstrate examples, so some limiting behaviors are considered: mean–field interactions in the LRDIDM^{42} and a complex, anisotropic and longrange kernel for the RFIM.^{14} The method is identically applicable for quite general interaction kernels in both models. In the LRDIDM, the Talamali et al.’s approach^{14} in two dimensions and assume that plastic deformation in disordered solids is modeled by the xxcomponent of the strain tensor \(\epsilon ^{(p)} \equiv \epsilon _{xx}^{(p)} =  \epsilon _{yy}^{(p)}\). The interaction due to local slip is the stress generated by local deformations of a random medium,^{43} which takes the form \(\tilde F_{{\mathrm{int}}}(k,\omega )\) = \((  {\mathrm{cos}}(4\omega )  1)\tilde \epsilon (k,\omega )\) where k, ω are the polar coordinates in Fourier space^{44} and \(\tilde \epsilon ,\tilde F_{{\mathrm{int}}}\) are the transforms of strain and interaction respectively. We initialize the L × L system with \(\epsilon\)(x) = 0 ∀ x and stress thresholds f_{ p }(x) are taken from a uniform distribution [0, 1). The external field F is increased adiabatically, and at each timestep, for all x that the total local force f(x) = F_{int}(x) + F − f_{ p }(x) becomes positive, there is a strain increase d\(\epsilon\)(x) randomly picked from a uniform [0, 1) distribution. The external stress is decreased by k/L^{2} at each timestep to cutoff avalanches—this cutoff mechanism resembles typical machine response at slow nominal straining.^{14,45} In the RFIM, the approach follows the basic algorithm,^{46,47} for which the zero temperature energy is \({\cal H}\) = \( \mathop {\sum}\nolimits_i \left( {H + J + h_i} \right)s_i\), with H the applied field, J the homogeneneous meanfield coupling, and h_{ i } the quenched random field that follows the distribution ρ(h) (to be specified). The local spin s_{ i } gives rise to the response \(m = \mathop {\sum}\nolimits_{i = 1}^N {\kern 1pt} s_i\). Starting with s_{ i } = −1∀i, H is increased until s_{ i } = +1∀i. In both models, it is possible to define abrupt events in terms of the size of response S (strain in the LRDIDM, magnetization per site in RFIM) during an event, as well as other quantities such as duration and energyrelease.^{4} The distribution of S has been studied extensively^{13,14,24} and it is known to be described, in the pinned regime, by P(S) ~ S^{−τ}\({\cal P}\)(S/S_{0}) where τ is ~ 1.3^{44} for the LRDIDM and 1.5 for RFIM.^{24} The cutoff function \({\cal P}\)(x) typically resembles an exponential function and S_{0} is a nominal maximum event size. In the depinned regime, there are systemspanning events, and therefore P(S) may include an additive component P_{inf} that scales with the system size. In this paper, we consider various behavioral regimes.
Quenched disorder distributions
Various possible random distributions are considered in both models. We consider: (a) normal distributions with mean μ and variance σ, (b) flat in a specified range, c)Weibull with probability density function g(f) = \(\frac{\alpha }{\lambda }(f{\mathrm{/}}\lambda )^{\alpha  1}e^{  (f{\mathrm{/}}\lambda )^\alpha }\) where δf ~ N^{−1/α} defines the range of the distribution. Here, α is the Weibull exponent^{48}, d) Gumbel distributions with g(f) = \(e^{  (f  \mu )/\beta }e^{  e^{  (f  \mu )/\beta }}{\mathrm{/}}\beta\) with mean μ + 0.57721β and variance π^{2}β^{2}/6. All these distributions are very typical in materials science, depending on the character and origin of randomness.^{49}
First, in order to demonstrate the applicability of TSML method in realistic plastic yield problems, we consider the behavior of the LRDIDM. As shown in Fig. 2, the input random distributions that range from Weibull, normal, and flat (cf. Fig. 2c), give a range of nonlinear stress–strain curves (cf. Fig. 2a), consistent with the commonly observed behavior in crystalline and amorphous systems.^{4} However, abrupt event distributions in terms of the strain burst sizes, are characteristically independent of the imposed distribution (cf. Fig. 2b). This result is consistent with previous studies in various avalanche models.^{50}
The presence of the abrupt event distribution validates TSML and facilitates a selfconsistent physical picture. In order for TSML to be applicable (since highorder temporal moments need to be nontrivial), the presence of noise is required in the behavior. If such uncertainty was absent, then a simple constitutive formula would suffice to perform parameter fitting of the behavior. The very presence of abrupt events in the behavior validates the application of TSML, since it implies that the overall behavior is driven by noise—thus, a noise distribution should be identified. In crystal plasticity, the presence of such abrupt events further validate their origin into a quench disorder distribution, therefore TSML may be attempted on reasonable physical grounds. In such a noisy case (like, for example, in the experimentally testable case of uniaxially compressed Ni micropillars,^{6} studied in Section 4 of the Supplementary Information), it makes no physical or practical sense to apply parameter fitting for parameters such as work hardening or yielding—these features are caused by a quenched disorder distribution that can be identified using TSML. In a natural way, work hardening is caused by the form of the quenched disorder distribution, which can be directly associated to the distribution of e.g., dislocation sources. If no such abrupt events are observed, TSML should be regarded as unphysical/unnnatural, since there would be no motivation for a quenched disorder distribution in material modeling. In this sense, the selfconsistency of the physical picture may be tested. The application of TSML on 100 samples (20 each random distribution) is shown in Fig. 2, justifying the applicability of the method.
Then, the RFIM is discussed, which does not include any spatial resolution. In this model, as shown in Fig. 3, 20 samples of seven different distributions are produced, (see Fig. 3c), totaling 140 samples. Then, the type of imposed randomness naturally influence the form of the M − H curves (cf. Fig. 3a), however the avalanche distributions (cf. Fig. 3b) demonstrate either a predepinning powerlaw behavior, or a spanning behavior, depending on the ratio of the disorder variance and the meanfield coupling strength. Then, all these samples are clustered and classified using TSML, and a very clear clustering effect that captures completely the original quenched threshold distribution is seen (cf. Fig. 3a).
The behavior of the RFIM is analyzed in further detail, by considering the limits of the applicability of TSML (for the method validation, see also Supplementary Information, Sections 2 and 3 and associated Figs. S3–S5). As shown in Fig. 4, the performance of the method is influenced as the number of samples S, the timeseries resolution R, and the number of degrees of freedom in the crackling system N. Three relevant observables are considered:

1.
The “inertia” of kmeans clustering \(\left\langle D \right\rangle\), which amounts to the sum of distances from the center of the assigned cluster;

2.
The F_{ β } score with β = 2 which amounts to the success of the classification approach (kmeans clustering) to identify correctly which samples are in the clusters;^{51} The definition of the F_{2} score amounts to a weighted average of precision and recall: F_{2} = 5 × precision × recall/(4 × precision + recall), where Precision is defined as TP/(TP + FP), and Recall is defined as TP/(TP + FN). The F_{2}score originates from the binary classification background, where we only have two classes that we want to distinguish are positive and negative. In this scenario there are four possible outcomes: TP (True Positive) → The object belongs to class positive and we classified it as positive, FP (False Positive) → The object belongs to class negative and we classified it as positive, TN (True Negative) → The object belongs to class negative and we classified it as negative, FN (False Negative) → The object belongs to class positive but we classified it as negative.

3.
The variance percentage v(i) of the PCA component i. As it can be seen, \(\left\langle D \right\rangle\) increases with resolution R (cf. Fig. 4a) and number of samples S (cf. Fig. 4b), which amounts to naturally “larger” clusters in the PCA space. However, F_{2} decreases with both R, S (cf. Fig. 4a,b), signifying that the larger clusters are also mutually well displaced, keeping the reliability of the method robust. Moreover, \(\left\langle D \right\rangle\) and F_{2} decrease drastically with the number of system degrees of freedom N. Finally, the variance is clearly distributed among the first three components, with more than 97% of the data variance.
Discussion
In this paper, we suggested a possible scenario for size effects in crystal plasticity and also, we proposed an explicit method for correlating the shape of noisy loadresponse curves in crackling noise phenomena to the shape of the quenched disorder distribution, which is the natural cause of the noise in the response timeseries. In the case of crystalline deformation of nanocrystals, we argued that this method and approach can investigate in detail the uncertainty in the work hardening behavior during statistical sampling of the deformation, in order to provide an estimate of the originating quenched disorder distribution that may relate to various physical quantities (e.g., precipitate defects or preexisting immobile dislocation forests). This approach can be equivalently thought as an unbiased “fitting” approach for workhardening curves using stochastic distributions (instead of constitutive formulas that would only utilize average behaviors). In this paper, we argued that in various physical phenomena (especially ones that display crackling noise effects), the very nature of the problem enforces the use of stochastic distributions in order to be able to pursue multiscale modeling using physically accurate (statistically) microscopic information.
The TSML method provides a unique description of the quenched disorder distribution in the limit of a large number of test samples, in the sense that the PCA clusters separate distinctly and can be easily distinguished by a classifier (e.g. Kmeans). In the context of our theoretical models, we provide such evidence (cf. Fig. S3) in both the RFIM and the LRDIDM plasticity models (cf. Fig. 4). Also, it is natural to check the cluster centers (identified through the centers of the KMeans clusters) and how they progress with the increase in the number of samples. As it is discussed in the Supplementary Information (cf. Fig. S4) the average cluster position for all clusters in the RFIM case (cf. Fig. S2) is quite stable with the increase of the number of samples. Analogous results are found for the LRDIDM model. Moreover, uniqueness is critically dependent on the physical completeness of the utilized theoretical model. In that respect, the uniqueness of the TSML method can be established by the generality of its applicability: the LRDIDM and RFIM models can be generalized by adding short or long range interactions, valid for various applications. We tested TSML for a number of additional possibilities, such as the 3DRFIM hysteretic model,^{47} the model of ref. ^{35} for continuum dislocation plasticity, and also discrete dislocation dynamics models of crystal plasticity.^{52} TSML appears to be generally applicable towards identifying quenched disorder distributions, appropriately defined for each model of reference. Finally, in continuum plasticity models the threshold distribution edge is critical for the yield behavior during avalanches. Thus, if two distributions have large overlap near the edge, then the limit of number of samples needed to distinguish different quenched disorder behaviors is prohibitively large (see SI’s Section 3 and Fig. S5 for a characteristic example, where overlapping Gaussian distributions lead to corresponding distinguishable but highly overlapping PCA maps).
The validation of the method can be performed by separating data sets into “experiment” and “simulations”, with the experimental data sets being the ones that will be assessed, while all the rest contributing to the classification scheme. As it is shown in Fig. S3, we separated statistical samples into two equal subsets for both the RFIM and the LRDIDM plasticity model, (“experimental”/testing and “simulations”/training), and then the experimental data sets are projected on the trained PCA components. The testing data set successfully correlates in a percentage of around 90% for this size of the data set (which includes 50–100 samples per quenched noise distribution). However, we acknowledge that the method requires further validation, especially in the experimental front: In the current work, we showed how the method may be practically applied to actual experimental data on uniaxially compressed Ni micropillars, following prior work^{6} (see Supplementary Information, Section 4 and Fig. S6). Through this investigation, a quenched disorder distribution with Weibull statistics and \(\alpha \simeq 2\)) was shown to be consistent with various theorybased suggestions, originating on Weibull statistics of dislocation sources (singlearm etc.) (see for example ref. ^{53}). Overall, the method is a concrete, unsupervised classifier of noisy stressstrain curves (and possibly other timeseries) that takes into account all moments of the noise, beyond the average behavior.
Through this work, it has become clear that it is viable to distinguish random threshold distributions from stochastic fieldresponse timeseries in typical crackling noise models. TSML can be efficiently implemented in order to distinguish quenched stochastic yield distributions in plastic deformation, that may originate in nanoscale experimental data. For this reason, high throughput experiments are required in order to efficiently probe the uncertainty in well defined deformed volumes. These distributions help define the material properties, if a generalized definition of a “material” is used where processing and deformation history is considered as a defining characteristic. In the future, the target would be to classify these distributions for various experimental cases in the processing mechanics of various materials and also various prior loading histories, aiming at producing a library of stochastic yield distributions that can be implemented in multiscale mechanics models,^{22,34} in a similar way that interatomic potentials libraries^{54} exist for molecular dynamics models.^{55} The crucial importance of such a physical picture is the transition to predictive models that are based on the selfconsistent and intrinsically outofequilibrium statistical mechanics of crystal and amorphous plasticity.
Methods
For the plasticity model, a custom Python code is used, solving the model on a 256 × 256 grid. The samples were loaded on well equilibrated strain configurations, that were generated by repeated loading/unloading of the samples. k is selected as 0.01 for these simulations. Typical cellular automaton rules are used, as described in the text. For the randomfield model, the coupling J is chosen to be unity and the number of spins is chosen to be 512. For applying PCA, the sklearn library of Python is used: Nonlinear PCA is accomplished by the application of Singular Value Decomposition, where the data matrix may be decomposed to a diagonal matrix of singular values S and left/right singular vectors V/U, D_{eff} = V^{T}SU. The V vectors that correspond to the largest singular values, capture the most characteristic temporal behavior. For solving the kmeans problem, Lloyds algorithm^{56} is used; the average complexity is given by O(knT), were n is the number of samples and T is the number of iteration, while the worst case complexity is given by O(N^{k+2/M}).^{57} For more details, see also the Supplementary Information. The computational codes for the application of TSML on generic stress–strain or other timeseries are available upon email request at stefanos.papanikolaou@mail.wvu.edu.
Data availability
The data that support the findings of this study are available from the corresponding author upon reasonable request.
Additional information
Publisher's note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
 1.
Georgii, H. O. Gibbs measures and phase transitions, Vol. 9, De Gruyter Studies in Mathematics Series, (De Gruyter, Berlin, 2011).
 2.
Truesdell, C. & Toupin, R. In Principles of Classical Mechanics and Field Theory/Prinzipien der Klassischen Mechanik und Feldtheorie (ed Flügge, S.) 226–858 (Springer, New York, 1960)
 3.
Uchic, M. D., Shade, P. A. & Dimiduk, D. M. Plasticity of micrometerscale single crystals in compression. Annu. Rev. Mater. Res. 39, 361–386 (2009).
 4.
Papanikolaou, S., Cui, Y. & Ghoniem, N. Avalanches and plastic ow in crystal plasticity: An overview. Model. Simul. Mater. Sci. Eng. 26, 013001 (2017).
 5.
Greer, J. R. & De Hosson, J. Th. M. Plasticity in smallsized metallic systems: Intrinsic versus extrinsic size effect. Prog. Mater. Sci. 56, 654–724 (2011).
 6.
Papanikolaou, S. et al. Quasiperiodic events in crystal plasticity and the selforganized avalanche oscillator. Nature 490, 517–521 (2012).
 7.
Sethna, J. P. et al. Deformation of crystals: connections with statistical physics. Annu. Rev. Mater. Res. 47, 217–246 (2017).
 8.
Uhl, J. T. et al. Universal quake statistics: from compressed nanocrystals to earth3 quakes. Sci. Rep. 5, 16493 (2015).
 9.
Bak, P., Christensen, K., Danon, L. & Scanlon, T. Unified scaling law for earthquakes. Phys. Rev. Lett. 88, 178501 (2002).
 10.
Sethna, J. P., Dahmen, K. A. & Myers, C. R. Crackling noise. Nature 410, 242–250 (2001).
 11.
Kuntz, M. C. & Sethna, J. P. Noise in disordered systems: the power spectrum and dynamic exponents in avalanche models. Phys. Rev. B 62, 11699–11708 (2000).
 12.
Martens, K., Bocquet, L. & Barrat, J.L. Spontaneous formation of permanent shear bands in a mesoscopic model of owing disordered matter. Soft Matter 8, 4197–4205 (2012).
 13.
Fisher, D. S. Collective transport in random media: from superconductors to earthquakes. Phys. Rep. 301, 113–150 (1998).
 14.
Talamali, M., Petäjä, V., Vandembroucq, D. & Roux, S. Avalanches, precursors, and finitesize uctuations in a mesoscopic model of amorphous plasticity. Phys. Rev. E 84, 016115 (2011).
 15.
Nattermann, T., Stepanow, S., Tang, L.H. & Leschhorn, H. Dynamics of interface depinning in a disordered medium. J. De Phys. II 2, 1483–1488 (1992).
 16.
Marchetti, M. C., Middleton, A. A. & Prellberg, T. Viscoelastic depinning of driven systems: meanfield plastic scallops. Phys. Rev. Lett. 85, 1104 (2000).
 17.
Zaiser, M. Scale invariance in plastic ow of crystalline solids. Adv. Phys. 55, 185–245 (2006).
 18.
Laurson, L. & Alava, M. J. Dynamic hysteresis in cyclic deformation of crystalline solids. Phys. Rev. Lett. 109, 155504 (2012).
 19.
Ovaska, M., Laurson, L., & Alava, M. J. Quenched pinning and collective dislocation dynamics. Sci. Rep. 5,10580, 1–8 (2015).
 20.
Ispánovity, P. D. et al. Avalanches in 2d dislocation systems: plastic yielding is not depinning. Phys. Rev. Lett. 112, 235501 (2014).
 21.
Ispánovity, P. D. et al. Average yielding and weakest link statistics in micronscale plasticity. Acta Mater. 61, 6234–6245 (2013).
 22.
Asaro, R. & Lubarda, V. Mechanics of Solids and Materials (Cambridge University Press, Cambridge, 2006).
 23.
Papanikolaou, S. et al. Universality beyond power laws and the average avalanche shape. Nat. Phys. 7, 316–320 (2011).
 24.
Sethna, J. P. et al. Hysteresis and hierarchies: dynamics of disorderdriven firstorder phase transformations. Phys. Rev. Lett. 70, 3347 (1993).
 25.
Uchic, M. D., Dimiduk, D. M., Florando, J. N. & Nix, W. D. Sample dimensions inuence strength and crystal plasticity. Science 305, 986–989 (2004).
 26.
Maass, R. & Derlet, P. M. Microplasticity and recent insights from intermittent and smallscale plasticity. Preprint at arXiv:1704.07297 (2017).
 27.
Nix, W. D. & Gao, H. Indentation size effects in crystalline materials: a law for strain gradient plasticity. J. Mech. Phys. Solids 46, 411–425 (1998).
 28.
Maaß, R., Wraith, M., Uhl, J. T., Greer, J. R. & Dahmen, K. A. Slip statistics of dislocation avalanches under different loading modes. Phys. Rev. E 91, 042403–042411 (2015).
 29.
Tsekenis, G., Uhl, J. T., Goldenfeld, N. & Dahmen, K. A. Determination of the universality class of crystal plasticity. EPL 101, 36003 (2013).
 30.
Dahmen, K. A., BenZion, Y. & Uhl, J. T. A simple analytic theory for the statistics of avalanches in sheared granular materials. Nat. Phys. 7, 554–557 (2011).
 31.
Csikor, F. F., Motz, C., Weygand, D., Zaiser, M. & Zapperi, S. Dislocation avalanches, strain bursts, and the problem of plastic forming at the micrometer scale. Science 318, 251–254 (2007).
 32.
Foss, S., Korshunov, D. & Zachary, S. An Introduction to HeavyTailed and SubexPonential Distributions, Vol. 6 (Springer, New York, 2011).
 33.
Greer, J. R., Oliver, W. C. & Nix, W. D. Size dependence of mechanical properties of gold at the micron scale in the absence of strain gradients. Acta Mater. 53, 1821–1830 (2005).
 34.
Lebensohn, R. A. & Tomé, C. N. A selfconsistent viscoplastic model: prediction of rolling textures of anisotropic polycrystals. Mater. Sci. Eng. A 175, 71–82 (1994).
 35.
Zaiser, M. & Moretti, P. Fluctuation phenomena in crystal plasticitya continuum model. J. Stat. Mech. Theory Exp. 2005, P08004 (2005).
 36.
Scoville, J. Stochastic timeseries spectroscopy. Preprint at arXiv:1504.01436 (2015).
 37.
Mueller, T., Kusne, A. G. & Ramprasad, R. Machine learning in materials science: Recent progress and emerging applications. In Reviews in Computational Chemistry, vol. 29,1st edn (ed Parrill, A. L. & Lipkowitz, K. B.) 186273 (J. Wiley & Sons, Inc., New Jersey, 2016).
 38.
Press, W. H., Flannery, B. P., Teukolsky, S. A., Vetterling, W. T. & Kramer, P. B. Numerical recipes: the art of scientific computing, New York: Cambridge University Press (1986).
 39.
Greenewald, K. & Hero, A. O. Robust kronecker product pca for spatiotemporal covariance estimation. IEEE Trans. Signal Process. 63, 6368–6378 (2015).
 40.
Hinton, G. E. & Salakhutdinov, R. R. Reducing the dimensionality of data with neural networks. Science 313, 504–507 (2006).
 41.
Goodfellow, I. et al. Generative adversarial nets. In Proceedings: Advances in neural information processing systems, 2672–2680 (2014).
 42.
Dahmen, K. A., BenZion, Y. & Uhl, J. T. Micromechanical model for deformation in solids with universal predictions for stressstrain curves and slip avalanches. Phys. Rev. Lett. 102, 175501 (2009).
 43.
Eshelby, J. D. The determination of the elastic field of an ellipsoidal inclusion, and related problems. Proc. R. Soc. Lond. A 241, 376–396 (1957).
 44.
Budrikis, Z. & Zapperi, S. Avalanche localization and crossover scaling in amorphous plasticity. Preprint at arXiv:1307.2135 (2013).
 45.
Papanikolaou, S. Shearing a glass and the role of pinning delay in models of interface depinning. Phys. Rev. E 93, 032610 (2016).
 46.
Kuntz, M. C., Perkovic, O., Dahmen, K. A., Roberts, B. W. & Sethna, J. P. Hysteresis, avalanches, and noise: numerical methods. Preprint at arXiv:condmat/9809122 (1998).
 47.
Kuntz, M. C., Perkovic, O., Dahmen, K. A., Roberts, B. W. & Sethna, J. P. Hysteresis, avalanches, and noise. Comput. Sci. Eng. 1, 73–81 (1999).
 48.
Weibull, W. A statistical distribution function of wide applicability. J. Appl. Mech. 18, 293–297 (1951).
 49.
Bouchaud, J.P. & Mézard, M. Universality classes for extremevalue statistics. J. Phys. A. 30, 7997 (1997).
 50.
Le Doussal, P. & Wiese, K. J. Driven particle in a random landscape: disorder correlator, avalanche distribution, and extreme value statistics of records. Phys. Rev. E 79, 051105 (2009).
 51.
BaezaYates, R. & RibeiroNeto, B. Modern Information Retrieval. 327–328. (Addison Wesley, Boston, 2011).
 52.
Papanikolaou, S., Song, H. & Van der Giessen, E. Obstacles and sources in dislocation dynamics: Strengthening and statistics of abrupt plastic events in nanopillar compression. J. Mech. Phys. Solids 102, 17 (2015).
 53.
Parthasarathy, T. A., Rao, S. I., Dimiduk, D. M., Uchic, M. D. & Trinkle, D. R. Contribution to size effect of yield strength from the stochastics of dislocation source lengths in finite samples. Scr. Mater. 56, 313–316 (2007).
 54.
Tadmor, E. B., Elliott, R. S., Sethna, J. P., Miller, R. E. & Becker, C. A. The potential of atomistic simulations and the knowledgebase of interatomic models. JOM 63, 17–17 (2011).
 55.
Frenkel, D. & Smit, B. Understanding Molecular Simulation: From Algorithms to Applications (Academic Press, San Diego, 2002).
 56.
Lloyd, S. Least squares quantization in pcm. IEEE Trans. Inf. Theory 28, 129–137 (1982).
 57.
Arthur, D. & Vassilvitskii, S. How Slow is the kmeans Method? 144–153 (Proceedings of the twenty second annual symposium on Computational geometry, ACM, 2006).
Acknowledgements
D. M. Dimiduk, H. Song, H. Yavas, E. Van der Giessen are gratefully thanked for inspiring discussions. This work is supported by the US Department of Energy, Basic Energy Sciences, Award Number DESC0014109, and by the Department of Commerce  NIST Award 70NANB15H230N.
Author information
Affiliations
Department of Mechanical & Aerospace Engineering, The West Virginia University, Morgantown, WV, 26506, USA
 Stefanos Papanikolaou
Department of Physics, The West Virginia University, Morgantown, WV, 26506, USA
 Stefanos Papanikolaou
Department of Mechanical Engineering, The Johns Hopkins University, Baltimore, MD, 21218, USA
 Stefanos Papanikolaou
Authors
Search for Stefanos Papanikolaou in:
Contributions
The author designed and performed the research study described in this manuscript.
Competing interests
The authors declare no competing interests.
Corresponding author
Correspondence to Stefanos Papanikolaou.
Electronic supplementary material
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.