A framework for glass-box physics rule learner and its application to nano-scale phenomena

Attempts to use machine learning to discover hidden physical rules are in their infancy, and such attempts confront more challenges when experiments involve multifaceted measurements over three-dimensional objects. Here we propose a framework that can infuse scientists’ basic knowledge into a glass-box rule learner to extract hidden physical rules behind complex physics phenomena. A “convolved information index” is proposed to handle physical measurements over three-dimensional nano-scale specimens, and the multi-layered convolutions are “externalized” over multiple depths at the information level, not in the opaque networks. A transparent, flexible link function is proposed as a mathematical expression generator, thereby pursuing “glass-box” prediction. Consistent evolution is realized by integrating a Bayesian update and evolutionary algorithms. The framework is applied to nano-scale contact electrification phenomena, and results show promising performances in unraveling transparent expressions of a hidden physical rule. The proposed approach will catalyze a synergistic machine learning-physics partnership. Using machine learning to interpret complex phenomena and reveal unknown physical rules is an active research frontier. Here, the authors address how to combine basic physics, a convolved information index, and a transparent flexible link function to identify mathematical expressions of the underlying physical processes of nanoscale contact electrification.


W
2][3] .But, the extension of deep learning to three-dimensional (3D) data sets is an active research area facing many theoretical and computational challenges 4,5 .Although advanced machine learning (ML) methods gradually master and replace the complex actions that require the humanlevel intelligence and control capabilities 6,7 , there exists a deep chasm between advanced ML methods and mature areas of science and engineering.Often, successful ML methods are limited to opaque decision-making capability about simple tasks.A direct adoption of an advanced ML method can hardly guarantee successful learning and prediction of the real-world experimental data that often involve 3D objects and multifaceted physical measurements.Scientific discovery often follows a typical process -measuring complex physical quantities, investigating the observed data, and deriving a rule that best describes the target physical phenomenon.This fundamental process poses several obstacles to the advanced ML methods.Physical measurements are often scattered over complex 3D spaces or objects.A few apparent descriptors are insufficient for learning.The size and volume of physical experimental data are relatively small for direct adoption of advanced ML methods.Scientists seek to find "expressions" of physical rules, not merely a "black-box" prediction.
Recently, a new research paradigm emerges to address these hurdles, i.e., the so-called physics-or theory-guided ML paradigm 8 .Domain science is used to help ML to predict physically sound solutions, e.g.human neurology 9 , quantum mechanics 10 , and heterogeneous composite structures 11 .Governing physical rules (often partial differential equations) are often fed into ML, e.g., geophysics by an hierarchical graph model 12 and wave and fluid flow by deep learning 13 .The present goal of searching hidden rules shares the similar notion with auto-encoder methods in pursuit of salient latent terms 14 .Still, this study differs from prior works in several aspects.First, this framework focuses on obtaining transparent expressions of hidden physical rules.Second, it seeks to leverage basic physics and scientists' experience rather than relying upon the predefined global governing equations.Third, it is built upon two ingredients, the externalized multi-layer convolved information index and flexible link functions which reinterpret and inherit the deep learning's successful philosophies.Last, the identified expressions can easily evolve by embracing other physics and more experience.In essence, this framework is purely data-driven, requiring no distributional assumptions about priors and posteriors compared to 14 .A comparable recent work to our framework would be 15 which pursues both hidden coordinates and a parsimonious form of governing equation of dynamic systems although the key methods and procedures are different.
This research proposes a framework for a constantly evolving, "glass-box" (as opposed to the black-box) rule learner which can help extract hidden physical rules behind complex real-world phenomena by integrating experienced scientists' knowledge and the central notions of deep learning.In particular, we propose a framework that can facilely deal with multifaceted measurements of 3D specimens by incorporating basic physics and scientists' experience.Also, the framework can extract hidden rules in terms of transparent expressions which will evolve with increasing data through a Bayesian update scheme.
For feasibility tests, this research applied the framework to nano-scale contact electrification (CE), a process which is difficult to elucidate due to its complex geometry, intractable small length scale, and interaction of several physical mechanisms such as friction, fracture, demolding, and friction-induced charging (called tribocharging).The feasibility tests show a promising performance of the proposed framework in reproducing complex 3D distributions of CE-induced electric potentials over the 3D point cloud along with transparent expressions.

Results
Quantified basic physics and experiences of scientists.The first question is about how to quantify and transform basic physics and scientists' experiences into ML-friendly quantities.Depending upon the underlying physics and domain-specific knowledge, the resultant quantities may differ in forms and ranges.This study presents an example of how to quantify and infuse the scientists' simple experience.The target experiments adopted for the present feasibility tests is about CE during the demolding process of the hardened polydimethylsiloxane (PDMS) specimen from the polyethylene terephthalate (PET) base mold which holds unique nano-scale patterns.The physical measurements are distributed over the 3D point cloud and each point is on a smooth surface (Fig. 1a, b).The demolding occurs in a specific direction, e.g., left-to-right (Fig. 1c, d), and the separation of PDMS from the patterned PET mold (Fig. 1e) has an important role in determining the direction of crack propagation and frictional stress development (Fig. 1f, g), which scientists know as prior knowledge.We quantify this basic experience by a directional vector, u ¼ fu x ; u y ; u z g 2 R 3 .It is true that well-known analyses such as principal component analysis or Isomap 16 with raw data may identify some information about the directivity of demolding test or reduced features, but this study regards scientists' basic experiences as a starting point and focuses on how to infuse them into the subsequent rule learning.In particular, u = {0.001,0, 1} was used to mimic left-to-right demolding process along the global X-axis whereas u = {0.001,−0.0014, 1}, for the tilted demolding process.We denote the unit normal vector to the friction surface as n ¼ fn x ; n y ; n z g 2 R 3 , and each point has unique normal vector (Fig. 1b).Assuming scientist's decent control of demolding, u is constant for all points of an individual specimen while n is varying at every point due to nano-scale patterns (Fig. 1e).Another important physical principle stemming from the scientists' prior knowledge is that the frictioninduced electric potential is proportional to the surface friction.Physically, the more surface friction, the more friction-related fracture energy can develop.Thus, to systematically quantify and infuse the degree of how much the demolding direction is aligned with the tangential direction of the surface friction, we leverage simple terms of the projection of demolding direction onto the friction surface, which can be mathematically described by where u is the global demolding direction vector and n is the surface normal vector.Since the current experimental data set consists of the observed electric potential over the 3D point cloud, the creation of the point-wise n requires a specific algorithm, which is described in Supplementary Note 1. Aiming at a simple range term that accommodates all these experiences and physical knowledge, we define a point-wise information index (II).We denote the local II (i.e., the degree of projection of the demolding on the friction surface) as II local 2 R À1; 1 ½ : where sgn(u, n)= +1 if cos À1 ð u XY Án XY ju XY jjn XY j Þ < π=2; −1 otherwise; u XY = {u x , u y }, n XY = {n x , n y }.One simple physical reason behind this sgn(.)definition is that the specimens are assumed to lie on the XY plane and demolding occurs mainly toward the Z direction.Thus, sgn(u, n) gives +1 when the directions of demolding and surface normal are positively aligned, and −1 otherwise.It should be noted that the range of II local ∈ [−1, 1] has several physical meanings.It will help confine the convolved II in the simple range [0, 1], which is compatible with the support range of the adopted cubic spline-based link functions.The simple monotonic range from −1 to 1 retains the similarity to activation values of typical ML methods.Clear physical interpretations exist: +1 indicating the positively aligned tangential friction whereas −1 meaning the negatively aligned condition.As long as these characteristics are satisfied, there exists ample room for different expressions of information index depending upon domainspecific knowledge.In computational mechanics, for instance, the spatial proximity between disparate materials can be represented by another form of information index by using virtual stress excitation 11 .It should be noted, however, that the local index by itself may be insufficient to capture the interactions among nearby physical quantities and may suffer from irregularity and local spikes.These drawbacks are to be resolved by the convolved information index, similar to multi-layered convolution in deep learning.Example plots of the local information index calculated with practical nano-scale measurements are presented in Supplementary Note 2.
Generation of convolved information index.One of the key enablers of deep learning is the convolution process that allows information integration.If convolution is done over a spatial domain, ML can better understand the interaction of spatially distributed information and hidden patterns while applied to the temporal domain, the interactions between past and present information can be elucidated 5,17 .Inheriting the philosophy of the deep learning's convolution, the proposed framework seeks to spatially integrate the local II over the 3D point cloud.The key difference of this study's convolution is that we "externalize" the deep learning's multi-layered convolutions by conducting multiple convolutions at the input information level, not in the opaque deep network layers.Rather than a uniform integration, we adopt a weighted integration using Gaussian weight function (denoted ω) to realize the proximity-proportionate importance of information.This process creates "convolved" information index denoted as II, which is calculated by where the factor 1/2 and addition of 1 in the parentheses are for normalization to [0, 1].For the integration over a discrete 3D point cloud, with the uniformity assumption over a small patch we have an approximation of where A (i) is a patch area of the friction surface centered at the point i and calculated over the 3D point cloud (see the algorithm in Supplementary Note 1); ξ (j) is the position vector of the point j.
As a primary spatial weighting function, we used the Gaussian function which has been widely used in other fields with different  18,19 ).
names (other weighting functions may be used): where position vectors x (i) , ξ ∈ V; L is the influence range parameter; N is the dimension parameter (herein N = 3 for the 3D point space).Of particular importance is the radius of influence range L. With a larger value of L, the information over a large space can be incorporated, but over-smoothing effects may occur; with a smaller L, adjacent information to the current point becomes more important, but the near information may be peaked which may lead to an over-fitting issue (visual comparisons of varying L are presented in Supplementary Note 2).When compared to the convolution of image data, the larger L may correspond to a large image patch used for the convolution.Figure 2 illustrates how the spatial information integration is conducted over the 3D point cloud, and also compares its similarity to the spatial or temporal convolutions used in typical deep learning methods.In the multi-layered deep learning, convolution can take place multiple times over several layers.Analogous to this philosophy, we generate multiple convolved IIs with various values of L and consider their interactions via multiple link functions.
Feasibility tests with nano-scale experimental data.To confirm the feasibility of the proposed framework, we applied it to real experimental data sets of nano-scale electric potentials.The charge values were measured by Kelvin probe force microscopy over patterned nano-cups represented by the 3D point cloud (Figs.3-5; raw test data are adapted from 18,19 ).Raw data in Fig. 3a, b are from 4000 nm × 4000 nm square area with interval of 7.8125 nm, and Fig. 3c, d show the reproduced potentials using our framework.Raw data in Fig. 4a, b are measured over 3000 nm × 3000 nm square area with interval of 5.86 nm, and Fig. 4c, d show the corresponding predictions.To confirm general learning capability, the Bayesian-evolution training is conducted on two seemingly disparate experimental data sets.In particular, the proposed glass-box learning begins from raw data of Fig. 3a, and then the identified rules are inherited to the next learning of Fig. 4a.After each training finishes, the best-so-far rules are used to reproduce the distribution of electric potentials of training data sets as shown in Fig. 3c, d and Fig. 4c, d.Out of many possible combinations of multiple link functions (LFs) with different influence ranges, the best-so-far rule is found to be the combination using 3 LFs of L = 8, 50, 100 nm (here, L is the spatial convolution influence range).The mathematical expression is attained by the proposed framework as where θ (l) = {a (l) , x *(l) } are the free parameters of the lth and their values are summarized in Supplementary Table 1.Then, the identified physical rules are used to predict substantially different experimental data sets, Fig. 5a, b.Visual illustrations of the identified rules are presented in Supplementary Note 3. The identified rule of Eq. ( 6) appears to be able to predict the spatial distributions of potentials which are substantially irregular and complex.It should be noted that the electric potential is a relative quantity, and thus we focus on overall shapes of the potential rather than specific values of certain locations.The best-so-far rule appears to reasonably reproduce the peaks and patterns over the 3D space.As anticipated, the identified rule is defined at the material point level, and thus the prediction can be done regardless of substantial differences in demolding directions, nano-patterns, and different geometries.Since the inheritance takes place for underlying rule expressions, the learned physical rules can be used for general cases (Fig. 5) regardless of substantial differences of nano-scale experiments.

Discussion
One of the strengths of the proposed glass-box learner is its transparency and clear interpretability.In view of Eq. ( 3), each convolved information index may be regarded as a marginal likelihood with the Gaussian conditional probability (dimension (1) N = 3; influence range L): Here, the factor 1/2 and addition of 1 used for normalization (Eq. 3) do not change the intended meaning of the likelihood.The best-so-far LFs are regarded as a weighted summation of constant, linear, quadratic, and cubic polynomials, respectively).
If we decompose the best-so-far LF's cubic spline basis, it will help elucidate the probable relationships; each polynomial basis informs the dominant relationship (e.g., linear, parabolic or high nonlinearity) between the target physics and II (compare Supplementary Figures 5 and 6 in Supplementary Note 4).This clear interpretability may help explicitly reveal a dominant relationship and approximation in subsequent investigations of domain scientists.
From the statistical angle, all the bases take the marginal likelihood In particular, the observed physical rule behind the electric potential and our information index is written in an explicit form as ð Þ is basis of the adopted cubic spine splines for LFs (details about this basis are presented in the section of Flexible and transparent linf functions in Methods); target ΔV and local index II local are defined at each material point, i.e. each data point of the 3D point cloud; θ (l) = {a (l) , x *(l) } are summarized in Supplementary Table 1.By all means, this identified physical rule about ΔV is not a fixed, unique form.Rather, it suggests a best-sofar probable and physically explainable expression regarding the target phenomena.There are myriad of ways to use this identified physical rule based on scientists' knowledge.For instance 20 , suggested a plausible causal pathway between the frictional charge and contact-surface temperature difference, ΔV ∝ ΔT, at nanoscale surfaces.Since the local information index always preserves physical meaning, e.g., direction alignment of demolding and tangential friction in Eq. ( 2 19 ).c-d Reproduced results using the identified physical rule with the best-so-far 3 link functions of convolved IIs in Eq. ( 6).
the combination of a Bayesian update and evolutionary algorithm, which may translate into a maximization of log-likelihood, requiring no distributional assumptions about the priors and posteriors.The framework seeks to identify explicit mathematical expressions about the target physical phenomenon and IIs via LFs.Practical feasibility tests with complex nano-scale CE phenomena showed a promising capability of the framework in identifying reasonable expressions about intractable electric potential distribution across the 3D point cloud measured from nano-patterned specimens with varying geometry and demolding directions.By virtue of the transparency of LFs, the revealed physical rule will serve as a gateway for numerous possible rules, and with such a fertile partnership with ML, the new discovery will return to the hands of scientists in diverse disciplines.In general, this framework can be applied to other disciplines where multi-dimensional, multi-faceted physical data sets and the limited access to internal states pose challenges: e.g., new metamaterial design, geophysics, and complex heterogeneous bodies 11 .
Overall, this study demonstrates how advanced ML methods inspire domain scientists and how they can combine to tackle hitherto intractable scientific questions, promoting more crossdisciplinary collaborations.This framework will spark imagination of scientists to develop their domain-specific IIs and also invigorate ML community to embrace IIs into their successful platforms.
Still, there are ample rooms for further sophistication of this initial framework: to name a few, more flexible and versatile basis for link functions 24 or use of an extensive library of possible mathematical expressions as done in 15 ; more advanced ML methods and evolutionary algorithms for efficient searching of latent feature and high-dimensional parameters 14,25 ; automated approaches to finding optimal layers of LFs; integration of diverse multiphysics rules such as nano-scale heat transfer, nano electrification and mechanical friction.Toward any extensions and applications, ideas in this study will serve as a fertile ground for departure.

Methods
Flexible and transparent link functions.In deep learning methods, the hidden layers' weights embed important relations and interactions of variables and neurons in terms of numerical values.The meaning of weights gradually becomes opaque as the number of hidden layers of convolution processes increases.To emphasize higher transparency and interpretability, this study suggests using a LF that describes the impact of convolved information index II on the hidden physical rules in terms of clear mathematical expressions.LF is denoted as L II; θ À Á where θ is a set of free parameters prescribing the LF.Since the true form of a hidden physical rule remains unknown, this study suggests to borrow the power of an evolutionary algorithm to enable LF to continue to learn, train, and evolve.The framework focuses on evolving θ of LF, rather than finding a single set of parameters.Although there is no restriction on the selection of specific LF, this study chose the cubic spline basis, which is highly smooth and flexible.The cubic spline curves consist of a few cubic polynomials connected (at the so-called knots) such that the curves are continuous up to the second derivatives 24 .For example, when a practical cubic spline basis 26 (denoted as b i ) is adopted, LFs are given as Here, x * i is ith knot location.Therefore, to completely describe one LF, we need to identify p + (p − 2) unknowns, i.e. a = {a 1 , …¸a p } and For brevity, we denote the total unknown parameters as θ = {a,  19 ).c-d Reproduced results using the identified physical rule with the best-so-far 3 link functions of convolved IIs in Eq. ( 6).A Bayesian update is used to inherit trained physical rule from Fig. 3.
x * }.Regarding the LF's flexibility, when a physical relationship is monotonic, a smooth shape LF may suffice, but when the hidden relations are in complex relations, the more flexible shape may be better.The adopted cubic spline basis can accommodate all of these characteristics.It should be emphasized that the adopted cubic spline basis is not used for direct regression.Rather than aiming at direct regression, this framework seeks to leverage the flexibility and transparency of cubic spline basis for finding "expressions" of LFs.Comparable to the multiple convolution over layers of deep learning, we allow an interaction of multiple LFs with different influence ranges.Thus, a target physical response ΔV is, in general, obtained with θ (l) = {a (l) , x *(l) } by where ΔV (i) is the predicted electric potential at point (i), is the convolved II associated with lth influence range L (l) at the point (i), and the number of total LFs is denoted by n l which is also to be determined through learning.After we considered various numbers of LFs and different combinations such as additive or multiplicative, we found that n l =3 with influence ranges (e.g., L (l) = 8, 50, and 100 nm) and multiplicative combinations as in Eq. ( 11) lead to reasonable learning.As in the multiple convolutions over several layers of deep learning, this framework compares possible combinations of multiple LFs allowing interaction among different ranges' information and finds the best performance case as illustrated in Fig. 2. Unlike hyperparameters of other ML methods, the proposed LFs seek to offer "expressions" which will be inserted or interwoven with other physical phenomena.
Bayesian update ingrained into an evolutionary algorithm.Aiming at no distributional assumptions about the priors/posteriors as well as pursuing smooth evolution, this study adopts the combination of a Bayesian update and a modified genetic algorithm 11 .The key evolutionary algorithm involves the preparation of initial generation, organism-wise evaluation of fitness score, and fitness-based spawning of the next generation.The prior best physical rules can be smoothly inherited by the Bayesian update-based fitness proportionate probability (FPP) rule.To accelerate the evolution speed of the modified genetic algorithm, an individual variable-wise gene cross-over scheme 27 has been used, and the changing search range scheme 28 is used in an iterative manner for better performance as successfully done in 11 .Since an individual s realizes a candidate of θ = (a, x * ) in current generation S, the raw cost of an individual s, termed as J s ð Þ, is calculated by where ΔV (i) is the true (measured) physical response at point (i) and ΔV (i) is the predicted response by Eq. ( 11) with θ.This fitness score simply means the mean squared errors of the observed and the predicted surface potentials.Then, following typical genetic algorithm procedures 11,29,30 , the normalized fitness score F of an individual is calculated by where s denotes an individual in the entire generation S. Learning a hidden physical rule is not a one-time task, rather a continuous activity.As diverse new experimental data become available, the physical rule learner must embrace all the previous knowledge and learn new information.To seamlessly realize this continuous learning, this study infused a Bayesian update scheme into the evolutionary algorithm's FPP rule.Suppose we have the best-so-far generation, denoted as S * and its associated fitness scores, F * s ð Þ, s ∈ S * .According to the FPP rule, the probability of selecting an θ for next parent is given by Prob θ ð Þ / F s ð Þ; s 2 S * .Thus, F * s ð Þ is regarded as a prior probability density function of parameters θ = {a, x * }, i.e. π prior (θ) in the typical Bayesian formalism.For initialization of π prior (θ), there are several choices: fully random initialization, expert knowledge-based 31 or the principle of maximum entropy-based initialization 32 .This study intentionally departs from fully random initialization to investigate positive evolution trends without special initialization assumption.Thus, this framework is purely datadriven, requiring no distributional assumptions about priors and posteriors compared to some successful auto-encoder methods 14 .For the posterior distribution, we proposed the following two-stage procedure.a Experimental data (raw data values are adapted from Fig. S2a, c in ref. 19 ) and b predicted electric potentials using the best-so-far physical rule with three link functions of convolved IIs in Eq. ( 8).c-d Downward demolding direction parallel to the Y-axis as shown in inset.c Experimental data (raw data values are adapted from Fig. 2c, f in ref. 19 ) and d predicted electric potentials using the best-so-far physical rule with three link functions of convolved IIs in Eq. ( 6).The experimental data for these prediction tests were not used for the Bayesian training, and the identified physical rules are solely based on training on the raw data of Fig. 3 and Fig. 4.
Suppose that we have the prior best LFs and their S * and that new experimental data become available.At the first learning generation with the new data, we can calculate the first fitness scores F s; S * À Á by applying the prior S * and LFs to the new experiment.After the first generation, we can estimate the Bayesian fitness score (denoted as F B ) as where κ is needed for normalizing the Bayesian fitness to unity, which is simply given by Then, from the second learning generation of the new experiment, the probability of selecting two parents is proportional to the Bayesian fitness score as Once again, an individual s realizes a candidate of θ = (a, x * ) in the new generation S, and thus the desired posterior distribution is obtained.In this way, the prior knowledge is smoothly inherited to the new experiment on the framework of evolutionary algorithm, thereby enabling constantly evolving physical rule learning.To allow for evolving with new data, the previous scores are inherited by the Bayesian score Eq. ( 14).Since the adopted evolutionary algorithm remembers prior generation's fitness scores, which offer the probability distribution of free parameters of LFs.As the Bayesian inheritance continues with new experimental data, the probability distribution of LFs will naturally evolve.Thus, the proposed framework can achieve evolving capability with increasing data.In the future, more dedicated investigations should focus on validation of the constantly evolving capability of LFs with sufficient, diverse test data.To some extent, the aforementioned combination of a Bayesian update and evolutionary algorithm can be viewed as a log-likelihood maximization as explained in Supplementary Note 4.
Overall flow of the evolving physical rule learner.Based on the aforementioned building blocks, the proposed framework has the overall architecture as shown in Fig. 6.For a target physical phenomenon, learning begins with quantifying the scientist's knowledge and experience into a simple, local information index (step a in Fig. 6).Spatial information integration is conducted with multiple influence ranges (i.e.L (l) , l = 1,…, n l ) and Gaussian weights N as marked between steps a and b in Fig. 6.Multiple convolved IIs and their interaction may be regarded as the counterparts to the deep learning's multi-layered convolutions.As mentioned earlier, the central novelty is that we "externalize" the deep learning's multi-layered convolutions by conducting multiple convolutions at the input information level (b in Fig. 6), being independent of learning engine.Then, a ML or optimization method (here, evolutionary algorithm) is used for learning and evolving internal parameters θ of the LFs, not for direct prediction of target responses (step c in Fig. 6).Thus, the ML methods' powerful strength of training and solution searching can focus on identifying mathematical expressions between the IIs and the target physical responses.The best-so-far "expressions" of the hidden rule will become a prior best generation for the next generation's LFs via the Bayesian update scheme (step d in Fig. 6) for future data sets.
Raw experimental data sets of electric potentials.The raw experimental data sets adapted for the feasibility tests are in a text-based matrix form of {x, y, z, ΔV} (i) , i = 1,…, n: (1) the horizontal demolding case (Fig. 3) has a 65536 × 4 matrix, (2) the diagonal demolding case (Fig. 4) has a 115600 × 4 matrix, and (3) the horizontal demolding case of parallel strip patterns (Fig. 5) has a 65534 × 4 matrix.Here, the unit of coordinates {x, y, z} is [nm] while the electric potential charge ΔV is in [Voltage] or [V].All of these raw training data sets are made publicly available (see Data Availability).
General settings for the evolutionary algorithm.The initial search ranges for all link functions' parameters a i ∈ a are set as a i ∈ [−3000, 3000] while x * 1 2 0; 1=3 ½ , x * 2 2 1=3; 2=3 ½ , and x * 3 2 2=3; 1 ½ for the three knots of cubic spline regression basis.For typical settings for the genetic algorithm, 4 alleles are used per gene; the mutation rate is 0.01-0.02; the total organisms per generation is 10,000-20,000.The maximum generation number is 1000 which gave reasonably converged results.
Computational implementation of proposed algorithms.The spatial convolution of the 3D point cloud to generate the convolved information index may be computationally expensive depending upon the size of data points.This framework developed a scalable parallel program with C++ and OpenMPI.All other learning, evolutionary algorithm and a Bayesian update scheme are implemented on the parallel program.The developed program will be publicly shared for academic purposes upon request to the corresponding author.Iowa State University's highperformance computing facility, Condo cluster is used for this study.
Fabrication of tribocharged nano-cup array.The tribocharged PDMS nanocup array is fabricated by the sequence of (1) prepare a PET mold with a 750 nm-pitch triangular array of nanocones (Micro-continuum Inc.), (2) pour the liquid phase PDMS, Sylgard 184, Dow Corning, mixed with the curing agent, (3) solidify the specimen, and (4) peel off the PDMS from the PET mold (for more details see ref. 19 ).
Surface characterization with Kelvin probe force microscopy.Atomic force microscopy (AFM) (Multimode, Veeco) in the tapping mode is used to obtain the topography.AFM in the Kelvin probe force microscopy mode is used to measure the surface topography and potential.The settings for Pt/Ir coated tips (purchased from Bruker) are SCM-PIT-v2, the spring constant of 2.8 N.m −1 , and the resonance frequency of 75 kHz.The lift height and the typical scanning rate are 45 nm and 0.5 Hz, respectively (for more details see ref. 19 ).

Fig. 1
Fig. 1 Basic physics and scientist's experience to be quantified.a Example plot of the 3D point cloud of a nanocup array.b Point-wise normal vector n and area A of a friction surface (see algorithms in Supplementary Note 1).c-d Illustration of the alignment of the demolding direction u and normal vector n; e Demolding of nano-patterns in the lab.f-g Definition of tangential fracture energy G f related to the tribocharging, and its calculated values over the circumference (AA′) of a nanocup.Inset shows the normalized potential measured by Kelvin probe force microscopy (adapted from refs.18,19 ).
Figure 5a, b has the same setup as Fig. 3, and Fig. 5c, d are the prediction plots.The demolding direction is different in all experiments including rightward, diagonal, and downward as marked in figure.Another complexity arises from the diverse heights of nano-cups: 154 nm for Fig. 3a, 93.5 nm for Fig. 4a, 117 nm for Fig. 5a, and 50.2 nm for Fig. 5c, respectively.Nano-scale patterns are also different, i.e. nano-cup arrays and parallel ridges.Since the goal of this study is to learn underlying physical rules, the proposed framework should overcome all the aforementioned apparent diversities and complexities.

Fig. 2
Fig. 2 Comparison of the proposed framework's convolved information index over the 3D point cloud and deep learning's typical convolutions.a Each point has its own II local , and the weighted integration of Eq. (3), a spatial convolution, generates the "convolved information index" (denoted as II) at all points.b Multiplicative interaction among several link functions (LFs) of multiple influence ranges similar to the deep learning's multiple convolutions over several layers.c Deep learning's typical 2D convolution of image and d a temporal convolution for continuous aggregation of past information (adapted from ref.17).

Fig. 3
Fig. 3 Initial training with raw data and the first prediction of electric potentials.a-b Bird's eye and top view of the real experimental data of the 3D point cloud showing a specimen with the rightward demolding direction parallel to the X-axis as shown in inset (experimental data values are adapted from Fig. 2a, d of ref.19 ).c-d Reproduced results using the identified physical rule with the best-so-far 3 link functions of convolved IIs in Eq. (6).

Fig. 4
Fig. 4 Bayesian training with raw data and reproduction of electric potentials.a-b Bird-eye and top view of the real experimental data showing a specimen with the diagonal demolding direction as shown in inset (experimental data values are adapted from Fig.2b, e of ref.19 ).c-d Reproduced results using the identified physical rule with the best-so-far 3 link functions of convolved IIs in Eq.(6).A Bayesian update is used to inherit trained physical rule from Fig.3.

Fig. 5
Fig. 5 Prediction tests with substantially different experimental data sets.a-b Rightward demolding direction parallel to the X-axis as shown in inset.a Experimental data (raw data values are adapted from Fig.S2a, c in ref.19 ) and b predicted electric potentials using the best-so-far physical rule with three link functions of convolved IIs in Eq. (8).c-d Downward demolding direction parallel to the Y-axis as shown in inset.c Experimental data (raw data values are adapted from Fig.2c, f in ref.19 ) and d predicted electric potentials using the best-so-far physical rule with three link functions of convolved IIs in Eq. (6).The experimental data for these prediction tests were not used for the Bayesian training, and the identified physical rules are solely based on training on the raw data of Fig.3and Fig.4.

Fig. 6
Fig. 6 Flowchart of the proposed framework.a Basic physics and experience is translated into a local II (II local ) at all spatial point x i .b With various ranges L (j) and Gaussian weights N , multiple convolved IIs (II) are generated; c Using II and prediction errors of target electric potentials (ΔV), evolutionary algorithm reveals LFs.d Bayesian update is incorporated for cross-specimen evolution.