Identifying a machine-learning structural descriptor linked to the creep behavior of Kob-Andersen glasses

Wu, Mingyue; Ruiz Pestana, Luis

doi:10.3389/fmats.2023.1272355

ORIGINAL RESEARCH article

Front. Mater. , 12 September 2023

Sec. Mechanics of Materials

Volume 10 - 2023 | https://doi.org/10.3389/fmats.2023.1272355

This article is part of the Research Topic Modeling and Experimentation of Imperfections in Materials View all 5 articles

Identifying a machine-learning structural descriptor linked to the creep behavior of Kob-Andersen glasses

Mingyue Wu

Luis Ruiz Pestana*

Computational Nanomaterials Laboratory, Department of Civil and Architectural Engineering, University of Miami, Coral Gables, FL, United States

A wide variety of materials, ranging from metals to concrete, experience, typically at high-temperatures or over long time scales, permanent deformations when subjected to sustained loads below their yield stress—a phenomenon known as creep. While theories grounded on defects such as vacancies, dislocations, or grain boundaries can explain creep in crystalline materials, our understanding of creep in disordered solids remains incomplete due to the lack of analogous structural descriptors. In this study, we use molecular dynamics to simulate the creep response of a Kob-Andersen glass model system under constant, uniaxial, compressive stress at finite temperature. We leverage that data to derive, using a machine-learning classification model, a structural descriptor termed looseness, $L$ , which is based on simple and interpretable local structural features and can predict imminent plastic rearrangements within the glass. We show that the average looseness of the system evolves logarithmically with time, mirroring the time dependence of the creep strain and demonstrating the ability of our model to bridge local, short-term particle dynamics with the long-term macroscopic creep response. A detailed feature importance analysis reveals the particular significance of short-range structural heterogeneity in the prediction. We also scrutinize the spatial and temporal correlations of looseness, which mirror the lack of long-range order in glasses and their dynamic heterogeneity. Our research underscores the substantial predictive potential of machine-learning-derived structural indicators in systems experiencing concurrent stress and thermal excitations, paving the way for future work to elucidate the interplay between thermal and mechanical activation of structural defects in disordered solids.

1 Introduction

Certain materials, when exposed to sustained loads below their yielding point, and typically over long time scales and/or high temperatures, exhibit permanent deformations—a phenomenon known as creep. Creep occurs in metals under high-temperature conditions, such as those found in turbine blades (McLean, 1966), in ice causing glaciers to flow (Weertman, 1983), or in amorphous materials such as polymers (Brinson and Gates, 1995), metallic glasses (Castellero et al., 2008), or even concrete (Bazant and Wittmann, 1982), the most used man-made material worldwide. While several mechanisms responsible for creep of crystalline solids have been proposed, which include the diffusion of vacancies (Nabarro, 1948; Herring, 2004), dislocation dynamics (Harper and Dorn, 1957), or grain boundary sliding (Bell and Langdon, 1967; Langdon, 2006), all these mechanisms are based on structural defects that break the long-range order of the crystal lattice and therefore can be trivially identified. Analogous knowledge for disordered solids is understandably lacking, as what constitutes as a structural defect in these systems remains an open question. For example, intuitive structural descriptors, such as free volume or bond orientational order, have been shown to be poor predictors of the plasticity of glasses (Richard et al., 2020). Other more successful indicators have also been proposed such as soft-modes (Widmer-Cooper et al., 2008; Tanguy et al., 2010), or rattling amplitude (Larini et al., 2008), but those rely on the dynamics of the system and thus are not strictly structural.

Motivated by this challenge and the tremendous power of machine learning (ML) techniques to find patterns within complex datasets, when human intuition falls short (Bishop and Nasrabadi, 2006), (Cubuk et al., 2015) pioneered the use of ML techniques to identify potentially complex structural signatures that are predictive of the particle dynamics in glassy systems. In this context and given the challenge of collecting experimental data at the needed time and length scales, molecular dynamics (MD) simulations have become indispensable in generating high-quality, comprehensive data sets essential for the successful implementation of ML models. Despite the remarkable advancements made in this field over the past few years (Schoenholz et al., 2016; Wang and Jain, 2019; Bapst et al., 2020; Boattini et al., 2020; Fan et al., 2020; Wang et al., 2020; Liu et al., 2021; Peng et al., 2021; Wang and Zhang, 2021; Xiao et al., 2021; Wu et al., 2023), prior studies have tackled thermally-driven and stress-driven relaxation events independently. Studies focused on understanding structural signatures underlying the glass transition are based on simulations of stress-free glasses near the glass transition temperature. In contrast, those focused on predicting plastic rearrangements in disordered solids under stress rely, almost exclusively, on simulations of the glass under athermal, quasistatic shear conditions. Moreover, to the best of our knowledge, the recent work by Liu et al. (2021) stands alone in its focus on creep. Interestingly, they demonstrated, for shear strains up to approximately 1%, a strong correlation between the macroscopic creep rate and a structural descriptor derived through ML based on the initial undeformed structure of the disordered colloidal gel (Liu et al., 2021). Their simulations, however, were conducted in the quasistatic athermal regime and under oscillatory shear, which is a condition more closely related to fatigue behavior than to creep. Deriving ML structural descriptors that can predict the creep response of disordered solids under sustained loads at finite temperatures remains a largely unexplored area, and it is extremely relevant in the context, for example, of bulk metallic glasses operating at high temperatures (Li et al., 2019).

Here, we employ MD simulations to investigate the creep response of a Kob-Andersen (KA) glass (Kob and Andersen, 1995) under sustained uniaxial compressive stress at finite temperature. We provide a detailed analysis of how the macroscopic creep response of the glass is affected by the level of applied stress and temperature, as well as characterize the statistical evolution during creep of the microscopic deformations, which we characterize by the non-affine squared displacements of individual particles in the glass, $D_{\min}^{2}$ . Using ML classification methods based on interpretable structural features describing the particles interstices, we are able to identify a local structural descriptor, dubbed looseness, $L$ , that can predict whether a particle in the glass will undergo an imminent plastic rearrangement based on its local interstitial environment alone. We quantify the prediction accuracy of the ML models, and explain it based on the interstitial structural features. We also study the time evolution of looseness averaged over all the particles in the glass, $〈L〉$ , as well as its spatial and temporal correlations.

2 Materials and methods

2.1 Molecular dynamics simulations

We performed MD simulations using the program LAMMPS (Thompson et al., 2022) to study the creep response of a KA glass under a sustained compressive stress at finite temperature. The KA model is a two-component Lennard-Jones (LJ) system, which has been extensively used to study the dynamics of supercooled liquids and the glass transition, due to being relatively simple and computationally efficient while still being able to capture many of the key behaviors of real glasses (Kob and Andersen, 1995). All the simulated systems here contained 10,000 particles, where 80% and 20% of them were type A and type B, respectively. The parameters of the LJ interactions are: $σ_{A A} = 1.0$ , $σ_{A B} = 0.8$ , $σ_{B B} = 0.88$ , $ϵ_{A A} = 1.0$ , $ϵ_{A B} = 1.5$ , $ϵ_{B B} = 0.5$ , $m_{A} = m_{B} = 1$ , and the cutoff for the interactions was set to $2.5 σ_{A A}$ . All the quantities reported in this study are given in reduced Lennard-Jones units, unless specified otherwise. All the simulations were performed with periodic boundary conditions (PBCs) in all dimensions (effectively simulating bulk glasses), and a time step $τ = 0.01$ .

We generated initial glass configurations as follows. First, we generated a random configuration of particles in a simulation box at density $ρ = N / V = 1.2$ , which is the equilibrium density of the KA glass. We simulated this system in the NVT ensemble for $5 \times 10^{4}$ steps, using a Langevin thermostat at $T = 3$ , which is well above the mode-coupling temperature of the KA model, $T_{M C T} = 0.435$ (Ashwin and Sastry, 2003). Once having randomized the positions of the particles, we induced glass formation by cooling down the system to $T = 0.1$ over $10^{4}$ steps in the NVT ensemble using a Nose-Hoover thermostat. As a final step, we minimized the energy of the system. We generated a total of ten unique, minimized, initial glass configurations following this process.

Starting from each of those ten glass configurations, we performed simulations in the NPT ensemble where the KA glasses were instantaneously placed under a constant uniaxial compressive stress at $T$ below $T_{M C T}$ and sustained for $10^{7}$ steps. During the simulations, we outputted optimized configurations, where the energy of the system was minimized under the constraint of the applied stress, every $10^{4}$ steps for analysis. From each MD simulation, we therefore collect $10^{3}$ optimized configurations for analysis. We carried out simulations at $T = 0.01$ , $0.1$ , $0.2$ , $0.3$ , and $0.4$ at a stress of $σ_{0} = 0.5$ , and simulations at stress levels ranging from $σ_{0} = 0.1$ to $0.9$ in increments of 0.1, at a temperature of $T = 0.1$ . We also performed ten simulations at $σ_{0} = 0.5$ and $T = 0.1$ . The data from these simulations ( $σ_{0} = 0.5$ and $T = 0.1$ ), which demonstrably reproduce the primary creep response of the KA glass, were utilized for the ML tasks.

2.2 Analysis of non-affine displacements

We calculated the non-affine squared displacement of particle $i$ over a time interval $Δ t$ measured from $t_{o}$ , $D_{\min}^{2} (i, t_{o}, ∆ t)$ , using the equations originally proposed by Langer and Falk (Falk and Langer, 1998), which can be written as:

D_{\min}^{2} (i, t_{o}, ∆ t) = \min_{ε_{i}} \{\frac{1}{n_{i}} \sum_{j} {[R_{i j} (t_{o} + ∆ t) - ε_{i} R_{i j} (t_{o})]}^{2}\} (1)

where $ε_{i}$ is the local strain tensor around particle $i$ which minimizes the quantity between the curly brackets, $n_{i}$ is the number of particles within a cutoff distance ( $R_{c u t}$ ) of particle $i$ , and $R_{i j}$ is the distance between particle $i$ and particle $j$ in its neighborhood. We select $R_{c u t} = 2.5$ , beyond which the results for $D_{\min}^{2}$ showed no sensitivity to variations in this parameter. The quantity $ε_{i} R_{i j} (t_{o})$ corresponds to the inter-particle distances predicted at $t_{o} + ∆ t$ after the affine deformation. We used our own MATLAB script to compute $D_{\min}^{2}$ .

2.3 Machine learning

2.3.1 Problem statement

Our ultimate goal is to train a ML model that can predict whether or not a particle will undergo a plastic rearrangement ( $D_{\min}^{2} > D_{\min, 0}^{2}$ ) over some time interval $∆ t$ , using as features only simple structural descriptors of the local neighborhood of that particle. Accordingly, we cast this problem as a supervised binary classification task. We call particles that undergo plastic rearrangements class 1 or loose, and those that do not class 0 or tight.

2.3.2 Dataset

Each particle in each of the configurations outputted during the MD simulations at $t_{o} = {1 \times 10}^{4}, {2 \times 10}^{4}, \dots, 10^{3} {\times 10}^{4}$ steps, corresponds to an example in the dataset. The features for each particle are calculated at each $t_{o}$ and the particles are labeled as loose (class 1) or tight (class 0) depending on $∆ t$ (the time interval over which the particle displacements are quantified) and $D_{\min, 0}^{2}$ (the threshold defining whether a particle undergoes substantial rearrangement). In this study, we focus only on the 80% of particles identified as type A, but our approach could be readily expanded to type B particles by choosing a different, suitable value for $D_{\min, 0}^{2}$ . For $∆ t = 10^{4}$ the dataset contains: $10$ independent MD simulations $\times$ $10^{3}$ configurations $\times$ $8,000$ particles of type A ${= 8 \times 10}^{7}$ examples. As shown in Supplementary Table S1, our datasets are extremely imbalanced, with class 1, the loose particles, being the minority class as most particles in the glass do not undergo plastic rearrangements during creep.

2.3.3 Feature engineering

Inspired by the work by Wang and Jain (2019), we created features that encompass easily interpretable and straightforward structural quantities that capture the interstitial environments of each particle short- and medium-range length scales. The short-range features (SRFs) are derived from the free distances, areas, and volumes between a given particle and its neighbors. The distances, areas, and volumes, determined by pairs, triplets, and groups of four particles, respectively, are corrected for the spherical particle sizes (proportional to $σ_{A A}$ and $σ_{A B}$ ), therefore representing the interstitial non-occupied space. The nearest neighbors to any particle are found by identifying the particles associated with Voronoi cells that share a boundary with the cell of the particle in question. The tetrahedral volume is calculated using Quickhull algorithm (Barber et al., 1996). For each metric—distance, area, volume—we compute four features that correspond to the mean, maximum, minimum, and standard deviation of the calculated values for the given particle. Hence, there is a total of 12 SRFs (e.g., $max (A, \dots$ ). The summary statistics aim to capture the average as well as potential anisotropy of the local interstitial environment. The medium-range features (MRFs) are computed by calculating the same summary statistics, but now of the SRFs corresponding to the neighbors of the given particle. Consequently, the MRFs consists of 48 features (e.g., $s t d [m e a n (V)]$ , …). As illustrated in Figure 1, each particle is described by a total of 60 features.

FIGURE 1

FIGURE 1. Short-Range Features (SRFs) and Medium-Range Features (MRFs). (A) Particle O, surrounded by particles a, b, c, … , f which are neighbors according to a Voronoi construction depicted by black, dashed lines. A distance (green), an area (red), and a volume (yellow) element are illustrated in the sketch. (B) SRFs are calculated based on the summary statistics (mean, maximum, minimum, and standard deviation) of the distances, areas, and volumes defined by the particle O and its neighbors The numbers shown in the arrays correspond to the indexes of the corresponding features (1–12 are SRF, and 13 to 60 MRF). (C) The MRFs assigned to particle O comprise the summary statistics of the SRFs corresponding to the neighboring particles.

2.3.4 Workflow design

All the ML tasks in this paper were executed using Python scripts with the Scikit-Learn (Pedregosa et al., 2011) and Imbalanced-learn (Lemaître et al., 2017) packages. To evaluate and assess the ML models, we utilized balanced accuracy as our evaluation metric, which is the arithmetic mean of sensitivity, $T P / (T P + F N)$ , and specificity, $T N / (T N + F P)$ , where $T$ and $F$ stand for true and false, respectively, and $P$ and $N$ for positive and negative, respectively. This metric gives an equal weight to both classes, ensuring that neither the majority nor the minority class dominates the accuracy score. In this paper, we performed 3 ML tasks: 1) an investigation to select the optimal values for $∆ t$ and $D_{\min, 0}^{2}$ , 2) implementation of recursive feature elimination (RFE) to remove highly correlated features, reduce the model complexity, and gain insight into the most important features, and 3) training a ML classification model using the optimal values of $∆ t$ and $D_{\min, 0}^{2}$ , as well as the top ranked features.

To investigate the influence of $∆ t$ and $D_{\min, 0}^{2}$ on the accuracy of the models, we first created, for each combination of $∆ t = 10^{4}$ , $10^{5}$ , or $10^{6}$ steps and $D_{\min, 0}^{2} = 0.05$ , $0.1$ , $0.15$ , $0.20$ , $0.25$ , or $0.30$ , five balanced bootstrap samples from the overall dataset using random undersampling. In random undersampling, instances of the majority class are randomly eliminated to equalize the number of instances in both the classes. Each of the five balanced samples contained 1,285 examples from each class. We maintained a consistent dataset size across all combinations of $∆ t$ and $D_{\min, 0}^{2}$ to isolate the effect of these two parameters. For each of the balanced samples, we performed feature standardization to prevent domination by larger-scale features, thereby enabling all features to contribute evenly to model predictions. Next, we used cross-validation CV to identify the optimal regularization hyperparameter, $C$ , for a logistic regression model. After determining the optimal $C$ , we used 5-fold CV to compute the validation balanced accuracy of the models.

We used RFE, once established $∆ t = 10^{4}$ and $D_{\min, 0}^{2} = 0.25$ , to remove highly correlated features, reduce the model complexity, and improve interpretability (McLean, 1966). Data from 8 of the 10 MD simulations were used for this task, with the remainder reserved for testing the final model. Using random undersampling, we generated five independent balanced bootstrap samples containing 2,115 instances from each class. After standardizing the features, RFE was then performed using a gradient boost classifier (GBC) as the estimator.

Using $∆ t = 10^{4}$ and $D_{\min, 0}^{2} = 0.25$ along with the top 10 ranked features identified through RFE, we trained an ML classification model to predict the particle labels. We applied the same training and testing split as in the RFE study and standardized the training and testing dataset features independently to avoid leakage. We used EasyEnsemble as our ML algorithm (Liu et al., 2009), where an ensemble of learners is trained on different balanced bootstrap samples. Random under-sampling was utilized to balance the samples. Our ensemble comprised 10 learners using logistic regression as the base estimator with an optimal regularization parameter $C = 1$ (see Supplementary Table S2 for details). The model output, termed as looseness, $L \in [0,1]$ , represents the probability of a particle being classified as loose or class 1. We decided to use logistic regression due to its simplicity and because it provides a probability for the predictions.

2.4 Fluctuations, space, and time autocorrelation functions

We calculated the space autocorrelation function of looseness, ${S A C F}_{L}$ , using:

{S A C F}_{L} (Δ r) = {〈(L_{i} (t_{0}) - {〈L (t_{0})〉}_{N}) ∙ (L_{j} (Δ r, t_{0}) - {〈L (t_{0})〉}_{N})〉}_{i j, t_{0}} (2)

where $L_{i} (t_{0})$ is the looseness of particle $i$ at time $t_{0}$ , $L_{j} (Δ r, t_{0})$ is the looseness of particle $j$ at a distance $Δ r$ of particle $i$ at time $t_{0}$ , and ${〈L (t_{0})〉}_{N}$ is the average looseness of the system at time $t_{0}$ . The outer angle brackets indicate the average over times $t_{0}$ and pairs of particles $i j$ .

To characterize the temporal autocorrelation, we require a fixed reference space frame. To that end, we map each configuration of the glass to a cube of side 1, discretize that space into 15 × 15 × 15 voxels, and map the looseness of individual particles to each voxel in the normalized cube. That transformation allows us to track the time correlations of a looseness field, $L^{*} (r, t)$ , in a reference frame that does not depend on the ever changing position of individual particles. The expression of the time autocorrelation function of the looseness field, ${T A C F}_{L^{*}} (Δ t)$ , is:

{T A C F}_{L^{*}} (Δ t) = {〈(L^{*} (r_{0}, t_{0}) - {〈L^{*} (r, t_{0})〉}_{r}) ∙ (L^{*} (r_{0}, t_{0} + Δ t) - {〈L^{*} (r, t_{0})〉}_{r})〉}_{r_{0}, t_{0}} (3)

Where $L^{*} (r_{0}, t_{0})$ is the value of the looseness field at time $t_{0}$ and position $r_{0}$ , $L^{*} (r_{0}, t_{0} + Δ t)$ is the value of the looseness field at the same position $r_{0}$ at time $t_{0} + Δ t$ , and ${〈L^{*} (r, t_{0})〉}_{r}$ is the average looseness of the system at time $t_{0}$ ( ${〈L^{*} (r, t_{0})〉}_{r} \equiv {〈L (t_{0})〉}_{N}$ ). The outer angle brackets indicate the average over time origins, $t_{0}$ , and spatial locations, $r_{0}$ .

We quantify the fluctuations of the looseness field as a function of system size, $Δ L^{*} (N)$ , as follows. At each time $t_{0}$ , we divide the system into voxels of the same size, each at position $r_{0}$ , as described above. Then, we calculate the fluctuations in each voxel, $Δ L^{*} (r_{0}, t_{0})$ , as the standard deviation of the looseness of the particles $j$ pertaining to that voxel, $L_{j} (r_{0}, t_{0})$ , with respect to the average looseness of the system, $〈L^{*} (t_{0})〉$ . Finally, we average the fluctuations across all voxels and all times:

Δ L^{*} (N) = {〈 \sqrt{\frac{\sum_{j} {(L_{j} (r_{0}, t_{0}) - 〈L^{*} (t_{0})〉)}^{2}}{N_{j}}} 〉}_{r_{0}, t_{0}} (4)

3 Results and discussion

3.1 Macroscopic and microscopic creep response of the KA glass from MD simulations

We use the term macroscopic response, to denote the response at the system level, as our simulations are conducted on bulk glasses. Conversely, microscopic response pertains to the dynamics of individual particles. Figure 2A shows the uniaxial strain evolution of the KA glass under a uniaxial compressive stress of $σ_{0} = 0.5$ , and at different temperatures below $T_{M C T}$ . The responses shown correspond each to the average over ten independent runs starting from different initial configurations of the glass. At the lowest temperature, the creep response is suppressed, at least over the duration of our simulations and the foreseeable future. In contrast, as the temperature nears the glass transition point, the system deforms significantly under the instantaneously applied uniaxial stress, and the strain increases dramatically fast (only a reduced range of strain is shown in Figure 2A). For the intermediate temperatures, the strain clearly shows a logarithmic dependence on time, $ε (t) \propto (σ_{0} / C) \log t + ε_{e}$ , where $C$ is the creep modulus, and $ε_{e}$ is the initial elastic deformation of the glass under $σ_{0}$ . This response is characteristic of primary creep where the rate of deformation decays inversely proportional to time, $\dot{ε} \propto t^{- 1}$ . Figure 2B shows the effect of the stress $σ_{0}$ on the creep modulus of the KA glass at $T = 0.1$ . The creep modulus remains approximately constant for stress levels below approximately $0.5$ , suggesting that inertial effects on the macroscopic mechanical response of the glasses resulting from the instantaneous application of the compressive stress are unimportant for $σ_{0} \leq 0.5$ . In Figure 2C, we show the evolution of uniaxial strain for $σ_{0} = 0.5$ and $T = 0.1$ , for each of the ten independent runs starting from different initial glass configurations (shades of blue), as well as the average response (black). The primary creep response of the glass is not only obvious for the average, but also evident in the individual responses, despite the presence of significant fluctuations, which can be attributed to the relatively modest size of the systems simulated.

FIGURE 2

FIGURE 2. Macroscopic creep response of the KA glass from MD simulations. (A) Time evolution of the average strain over ten independent runs, $〈ε〉$ , for a compressive stress of $σ_{0} = 0.5$ , and at different temperatures $T = 0.01$ , $0.1$ , $0.2$ , $0.3$ , and $0.4$ . (B) Creep modulus of the KA glass at $T = 0.1$ as a function of $σ_{0}$ . (C) Strain evolution of each of the ten simulated systems at $σ_{0} = 0.5$ and $T = 0.1$ (shades of blue) as well as the average response (black).

To characterize the microscopic response of the glasses during creep at $σ_{0} = 0.5$ and $T = 0.1$ , we calculated the non-affine squared displacements for each particle $i$ , over a time interval from $t_{o}$ to $t_{o} + ∆ t$ : $D_{\min}^{2} (i, t_{o}, ∆ t)$ . Non-affine displacements are particularly useful as they are associated to local plastic rearrangements. As discussed in the Section 2, this analysis is done using only optimized configurations of the glasses, which are outputted every $10^{4}$ steps during the simulations. In Figure 3A, we show, in the log-log scale, the distributions of non-affine displacements for ${∆ t = 10}^{4}$ steps, and taken at different times throughout the simulation, $t_{o} = 10^{4}$ , $10^{5}$ , $10^{6}$ , and ${9.99 \times 10}^{6}$ steps. The distributions incorporate data from all ten simulated systems. First, it is evident that, regardless of $t_{o}$ , all the distributions display long, power-law tails to the right. These long-tails are strong evidence of the existence of a small number of particles that undergo plastic rearrangements during $∆ t$ . The power-law structure of the tails likely emerges from the convolution of the myriad of distinct particle environments in the glass which lead to as many characteristic relaxation timescales. With the progression of time (blue to yellow in Figure 3A), the average non-affine squared displacement $〈D_{\min}^{2}〉$ trends towards lower values (Figure 3B), and decay of the power-law tail becomes steeper, as shown by the evolution of the scaling exponent, $P D F \propto D_{\min}^{2 - α}$ , shown in Figure 3C. Therefore, both the average and extreme non-affine displacements appear to shift towards lower values as creep progresses. Interestingly, the scaling exponent of the power-law tail decreases logarithmically in time, analogous to the creep strain. This suggests that the tail of the distributions of non-affine displacements contain information about the creep response of the glass.

FIGURE 3

FIGURE 3. Microscopic creep response of the KA glass from MD simulations. (A) The probability distribution of non-affine squared displacements, $D_{\min}^{2}$ , calculated over a time interval $∆ t = 10^{4}$ steps at different times, $t_{o}$ , during the simulations. (B) Average non-affine squared displacement $〈D_{\min}^{2}〉$ over time. (C) Scaling exponent of the power-law tail, $α$ , as a function of time.

3.2 Understanding the effect of $D_{\min, 0}^{2}$ and $∆ t$ on accuracy of the ML predictions

Two key parameters, $D_{\min, 0}^{2}$ and $∆ t$ , determine whether a particle is classified as loose or tight. The choice of these parameters will, therefore, directly impact the quality of the dataset, and subsequently the accuracy of the ML predictions. While the long-tails in the probability distributions of $D_{\min}^{2}$ are strong evidence of the existence some particles that undergo plastic rearrangements, it is worth noting that there is no well-defined threshold, $D_{\min, 0}^{2}$ , that can be used to unequivocally identify them, given the continuous nature of the distribution. The time interval over which the non-affine displacement in measured, $∆ t$ , is also critical, but its effect has not been investigated or discussed in previous studies. In this section, we investigate the effect of $D_{\min, 0}^{2}$ and $∆ t$ on the accuracy of a classification model aimed at predicting whether or not a particle will undergo a plastic rearrangement ( $D_{\min}^{2} > D_{\min, 0}^{2}$ ) over some time interval $∆ t$ .

Figure 4 shows the validation accuracy for each class of classification models trained as described in the Section 2. Both Figures 4A, B show that the model performs slightly better on the majority class, which is expected given a series of factors including information richness and sampling quality. It is worth noting that the datasets used for training and validation, but not testing, were balanced using random undersampling. In Figure 4A, we see that the accuracy of the models increases monotonically with the threshold $D_{\min, 0}^{2}$ , but appears to plateau beyond $D_{\min, 0}^{2} = 0.25$ . This can be explained by considering that particles with larger $D_{\min}^{2}$ possess structural environments that are highly correlated to plastic rearrangements, while those with lower $D_{\min}^{2}$ exhibit environments that may lead to such rearrangements, but with lower probability. Therefore, more selective thresholds lead to datasets with more reliable labels for the minority class, which helps to enhance the accuracy of the model. The plateauing of the accuracy is likely due to a concurrent significant decrease in the instances of the minority class within the dataset (as shown in Supplementary Table S1), which constrains the ability of the model to effectively learn from this class.

FIGURE 4

FIGURE 4. Validation accuracy for each class as a function of: (A) $D_{\min, 0}^{2}$ for $∆ t = 10^{4}$ , and (B) $∆ t$ for $D_{\min, 0}^{2} = 0.25$ . Both the average and standard deviation are shown, over the predictions of five models each trained on an independent balanced bootstrap sample.

Figure 4B shows a logarithmic decay of the model accuracy with increasing $∆ t$ . We attribute this drop in accuracy to the changes in the local environment of the particles over time, which, over extended periods, can lead to memory loss of the initial structural conditions. In other words, the structural evolution of the system weakens the correlation between the environment used for feature computation, and the future behavior that the ML model is striving to predict. The logarithmic dependence of the accuracy on $∆ t$ can be explained by the fact that the structural evolution becomes increasingly slow during creep (i.e., increasingly longer periods of time are required to achieve a similar magnitude of structural reorganization). We hypothesize that for short enough $∆ t$ , the accuracy of the model would also decrease based on the following argument. Given a local structural environment, the time it takes for a particle to undergo a plastic rearrangement will follow a statistical distribution. For example, if the process is activated, the time it takes a particle to undergo a plastic rearrangement would follow a Poisson distribution with a rate given by $r = ω_{0} e^{- E_{a} / k_{b} T}$ , where $ω_{0}$ is an attempt frequency, $E_{a}$ is an activation barrier (presumably conditioned by the particle’s structural environment), and $k_{b} T$ is the thermal energy. If ∆t is so short that it doesn't encompass reasonable extremes of that distribution, then particles will be labeled as tight or class 0, even if the structural environment is correlated to plastic rearrangements over longer, more appropriate time scales. Further investigations will be required to validate this hypothesis. Based on our results, we use $∆ t = 10^{4}$ steps and $D_{\min, 0}^{2} = 0.25$ to label the particles in the dataset moving forward. As shown in Supplementary Figure S1, other evaluation metrics, including the AUC (Area Under the Curve), which measures the ability of the model to discriminate between classes at various thresholds, and the F1 score, which captures the balance between precision and recall, are consistent with the class-specific accuracy. The consistency across these metrics provides a more robust confirmation of the predictive capability of the model and suggests its generalizability and robustness.

3.3 Feature selection and physical interpretation of the most important features

Before training the final classification ML model used to predict plastic rearrangements, we carry out RFE to identify and select the most important of the 60 features in the dataset. To this end, we use the data from the ten simulations at $σ_{0} = 0.5$ and $T = 0.1$ , where the examples were labeled using $∆ t = 10^{4}$ and $D_{\min, 0}^{2} = 0.25$ . We perform RFE following the steps outlined in the Section 2. The testing balanced accuracy as a function of the number of top ranked features is shown in Figure 5A, which shows that the model accuracy peaks around the inclusion of 10 features, after which it experiences a slight gradual decrease in accuracy with the addition of more features. This decrease in accuracy can be attributed to various factors, including linear correlations between the features and the possibility of overfitting that arises from the increased complexity of the model.

FIGURE 5

FIGURE 5. (A) Average testing balanced accuracy versus the number of top n ranked features selected via RFE. Each point corresponds to the average of five model predictions trained each on independent balanced samples generated through random undersampling. (B) Top 10 ranked features by RFE. (C) Confusion matrix corresponding to the model trained using the top 10 ranked features by RFE, evaluated on the test set.

In Figure 5B, we show the top 10 ranked features by RFE, which will be used later to train the final model. The distribution between SRFs and MRFs is 4 to 6, suggesting that the medium-range order, which in the context of our work captures the interstitial environments of a particle’s neighbors (as determined by a Voronoi construction), plays a substantial role in determining plastic rearrangements. Interestingly, none of the selected SRFs— $S t d (A)$ , $S t d (V)$ , $S t d (D)$ , and $Min (D)$ —correspond to average quantities. The features related to the standard deviation encapsulate the variability in the particle’s local structural environment, whereas the minimum-related feature denotes an extreme of this environment. For example, a high standard deviation could suggest a high degree of heterogeneity in the structural environment, while a minimum could signify a limiting factor that precludes a plastic rearrangement. Overall, the selection of these SRFs suggests that the short-range structural heterogeneity and the distance to the closest neighbor play a significant role in the prediction of plastic events in KA glasses. Regarding the MRFs, 4 out of the 6 selected MRFs correspond to averages of SRFs related to non-mean summary statistics. This suggests that across mid-range length scales, beyond the nearest neighbors, the average heterogeneity significantly influences the occurrence of plastic rearrangements. Notably, half of the selected MRFs are associated to the selected SRFs, specifically $S t d (A)$ , $S t d (V)$ , and $Min (D)$ , further underlining the importance of these structural variables in the predictive process. Overall, the majority of the selected features relate to distances and volumes, with only 2 out of 10 related to areas. The prominence of distances may reflect the influence of local spatial configurations or the connectivity of the glass, when conceptualized as a graph. The significance of volume-based features could be indicative of the importance of local density fluctuations.

3.4 Predicting creep using the ML derived structural indicator looseness

As detailed in the Section 2, we apply the EasyEnsemble algorithm with logistic regression as the estimator, using random under-sampling to balance the dataset, to train a ML model that predicts the probability of a particle to be classified as loose or class 1 within the system. We refer to this prediction metric as looseness, $L$ , which, unlike previous machine learning-derived descriptors, such as softness (Cubuk et al., 2015; Cubuk et al., 2017; Liu et al., 2021), is bounded: $L \in [0,1]$ . The balanced accuracy of the model stands at 71.6% for the (balanced) training set and 70.7% for the (unbalanced) testing set, which is comparable to the accuracy reported in previous studies (Liu et al., 2021). The close values between training and testing accuracies indicate that our model is generalizing well, being able to classify correctly over 70% of previously unseen particles (Figure 5C). Specifically, the model achieved an accuracy of 67.9% for the minority class of loose particles and 73.5% for the majority class of tight particles. Additionally, the AUC for the test set balanced using random undersampling is 0.772. The F1 score, calculated for each label and averaged with weighting based on the number of true instances for each label in the test set, is 0.848.

Figure 6A shows the probability density of particles as a function of the squared non-affine displacements, $D_{\min}^{2}$ , and the predicted looseness, $L$ . This diagram was created as follows: for each interval in $D_{\min}^{2}$ , we calculated the probability density of the looseness of all the particles in the dataset with squared non-affine displacements within that range. It is clear that the loose or tight populations of particles are well discriminated in the plane defined by $D_{\min}^{2}$ and $L$ , with the loose and tight particles being characterized by high and low $L$ values, respectively. Our results demonstrate that the thresholds used to label particles, namely, $D_{\min, 0}^{2} = 0.25$ and $L = 0.5$ , effectively serve to separate and classify the particles as loose or tight (Figure 6A). Figure 6B shows the probability density of looseness for all particles, $p (L),$ (blue line), as well as the conditional probability of looseness for just the loose or class 1 particles, $p (L |1)$ (bars). The overall distribution, which captures the underlying unbalanced character of the data set, shows that most of the particles as classified as tight, with the bulk of the looseness predictions being from about $L = 0.1$ to $0.4$ . For $L > 0.5$ , $p (L)$ decays close to linearly. The conditional probability $p (L |1)$ reveals that about 71% of particles labeled as loose are assigned $L > 0.5$ by the model, which is, as it should, consistent with the accuracy of the model. The conditional probability of $L$ given that a particle has been labeled as loose, $p (1 |L)$ , follows an approximately exponential relation with $L$ (Figure 6C), which means that it is increasingly unlikely for a particle labeled as loose to be assigned a low $L$ by the model. For example, there is a one in a million chance for a particle labeled as loose to receive a looseness assignment of 0.2 from the model. It is worth noting that because $L \in [0,1]$ is bounded, we expect the exponential relationship to break down for values of $L$ near the boundaries.

FIGURE 6

FIGURE 6. Probability distributions of looseness, $L$ . The statistics shown correspond to predictions and labels of the entire dataset (training plus testing). (A) Probability density of particles as a function of the squared non-affine displacements, $D_{\min}^{2}$ , and their predicted looseness, $L$ . The threshold $D_{\min, 0}^{2} = 0.25$ used to label particles as loose or tight is shown in the plot as a horizontal red line. The results shown in this panel were smoothed out using a very light Gaussian filter. (B) Probability density of looseness for all particles, $p (L),$ (blue line), as well as the conditional probability of looseness for just the loose particles (class 1), $p (L |1)$ (bars). (C) Conditional probability of looseness given particles labeled as loose, $p (1 |L)$ .

The time evolution of the average looseness in the glass, $〈L〉$ , where the angular brackets indicate ensemble average over all the particles in the glass, is shown in Figure 7A. The overall looseness of the system decreases as the glass creeps, which is consistent with the fact that the glass structure becomes less conducive to plastic rearrangements over time. Interestingly, this decrease in looseness approximately follows a logarithmic time dependence, $〈L (t)〉 \propto - \log t$ , which is reminiscent of the evolution of the average macroscopic strain (Figure 7B): $〈ε (t)〉 \propto - \log t$ . Our results suggest that $L$ , a machine-learned local descriptor based on simple, interpretable structural quantities, not only serves as an effective tool to predict microscopic plastic rearrangements in the KA glass during creep, but its ensemble-average, $〈L〉$ , correlates with the macroscopic creep response of the glass.

FIGURE 7

FIGURE 7. Time evolution of (A) the average looseness, and (B) the macroscopic strain response of the KA glass. Each point and its corresponding error bars represent the average and standard deviation, respectively, at each time, over the ten independent MD runs.

3.5 Fluctuations, and spatial and temporal autocorrelations of looseness

We characterize the scaling of the fluctuations of the looseness field as a function of system size, $Δ L^{*} (N)$ , as described in the Section 2. Figure 8A shows that, overall, as the system size increases, the fluctuations become smaller. It is worth noting that for a size equal to the entire simulated system (i.e., $N = 8,000$ ), the fluctuations will become (artifactually) zero, which implies that our analysis is only valid for $N \leq 8,000$ . It is well established from equilibrium statistical mechanics that the fluctuations on thermodynamics properties scale with system size as $\propto N^{- 1 / 2}$ . We observe that for $N \geq 10^{2}$ , $Δ L$ scales in a manner similar to equilibrium fluctuations in relation to system size, which is somewhat unexpected considering the non-equilibrium nature and heterogeneity of the glasses. The space autocorrelation of $L$ , ${S A C F}_{L} (Δ r)$ , shown in Figure 8B reveals short-range spatial correlations only, with a decay length scale of $\sim 0.63 σ$ . Beyond that, spatial correlations are lost, which is consistent with the lack of long-range order in the KA glass.

FIGURE 8

FIGURE 8. (A) Fluctuations of looseness field, $Δ L^{*}$ , as a function of system size given by the number of particles, $N$ . (B) Space autocorrelation function of $L$ , ${S A C F}_{L} (Δ r)$ .

As described in the Section 2, in order to characterize the temporal autocorrelations, ${T A C F}_{L^{*}} (Δ t)$ , we discretize space into voxels and map the looseness of individual particles to each voxel. That transformation allows us to track the time correlations of the looseness field, $L^{*} (r, t)$ , in a reference frame that does not depend on the ever evolving position of individual particles. The ${T A C F}_{L^{*}} (Δ t)$ , shown in Figure 9A characterized by a very sharp drop over the first time interval due to the relatively large random fluctuations of looseness between consecutive configurations outputted for analysis. If one subtracts this effect, the ${T A C F}_{L^{*}}$ decays over $Δ t \approx 2 \times 10^{6}$ steps, after which it becomes slightly negatively correlated, and finally it slowly decorrelates over the timescale of the simulation (i.e., $10^{7}$ steps). The reason for the negative correlation observed in Figure 9A, can be explained based on the evolution of looseness at the single voxel level (an example is shown in Figure 9B), which does not gradually change over time, but rather undergoes sudden changes in values. A careful look at the time autocorrelation of individual voxels in space, ${T A C F}_{L^{*}}^{i}$ , reveals that the response is highly heterogeneous (Figure 9C), and therefore the average correlation shown in Figure 9A does not reveal the full picture. We observe that some voxels, the decay time scale is almost instantaneous, while for other it lasts over $2$ to $4 \times 10^{6}$ steps (Figure 9B). We quantify the heterogeneity in the relaxation time scales in Figure 9D, where we plot the probability distribution of times at which the ${T A C F}_{L^{*}}$ for individual voxels crosses zero, $t_{C} = Δ t ({T A C F}_{L} = 0)$ . We observe a clear power-law distribution of correlation time scales, with an exponent close to $- 1$ . We also observe a limit to the power-law behavior at $t_{C}^{*} \approx 3 \times 10^{6}$ steps, beyond which the probability of observing the looseness of a given voxel decorrelating slower than that quickly becomes null. It is likely that $t_{C}^{*}$ depends on the particular glass model and loading conditions, $σ_{0}$ and $T$ .

FIGURE 9

FIGURE 9. (A) Time autocorrelation function of the looseness field, ${T A C F}_{L^{*}} (Δ t)$ . (B) Example of the evolution of looseness for a select voxel. (C) Time autocorrelation function of the looseness field, ${T A C F}_{L^{*}}^{i} (Δ t)$ , for some individual voxels. (D) Probability distribution of the relaxation timescales of looseness for individual voxels, $t_{C} = Δ t ({T A C F}_{L} = 0)$ .

4 Conclusion

In this study, we used a machine-learning (ML) classification model based on logistic regression trained with data from molecular dynamics (MD) simulations of Kob-Andersen (KA) glasses to derive a local structural descriptor, termed looseness, $L$ , which highly correlates with the propensity of particles to undergo plastic rearrangements during creep. Unlike other ML-derived structural descriptors such as softeness (Cubuk et al., 2015; Liu et al., 2021), looseness is based on straightforward, interpretable features and yields a real probability bound between 0 and 1. Our model can predict with an accuracy exceeding 70% whether an unseen particle within a KA glass will undergo a plastic rearrangement within a certain time interval. We showed that the evolution of the average looseness of the glass system, $〈L〉$ , mirrors the logarithmic time dependence observed in creep strain. This correlation highlights the link our model is able to establish between the microscopic dynamics at the single particle level over short time scales and the long-term macroscopic creep response of the KA glass. Our feature importance analysis revealed that none of the selected Short Range Features (SRFs) correspond to average quantities. Rather, features related to the extremal summary statistics of the interstitial structural environment dominate, emphasizing the critical role of short-range structural heterogeneity in predicting plastic rearrangements in KA glasses. Moreover, over half of the most important features were associated to the medium-range structural order of the glass, which highlights the importance of this length scale in predicting plastic rearrangements. Furthermore, our analysis of the spatial correlations of looseness revealed correlations only up to the medium-range length scale, beyond which the correlations die off–a finding that aligns with the lack of long-range order typical of the KA glass. Our examination of the temporal correlations of looseness unveiled a power-law distribution of relaxation timescales, which is reminiscent of the dynamic heterogeneity often postulated for glassy systems (Flenner and Szamel, 2010).

In conclusion, our research underscores the substantial predictive power of ML-derived structural indicators in systems experiencing concurrent stress and thermal excitations. Nonetheless, future research will be required to untangle the intricate interplay between thermal fluctuations and mechanical activation of structural defects in disordered solids, and how each contributes to the overall mechanical behavior of the system.

Data availability statement

The raw data supporting the conclusion of this article will be made available by the authors, without undue reservation.

Author contributions

MW: Methodology, Software, Visualization, Writing–Original draft. LR: Conceptualization, Methodology, Software, Supervision, Writing–Original draft.

Funding

The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmats.2023.1272355/full#supplementary-material

References

Ashwin, S. S., and Sastry, S. (2003). Low-temperature behaviour of the kob–andersen binary mixture. J. Phys. 15 (11), S1253–S1258. doi:10.1088/0953-8984/15/11/343

Identifying a machine-learning structural descriptor linked to the creep behavior of Kob-Andersen glasses

1 Introduction

2 Materials and methods

2.1 Molecular dynamics simulations

2.2 Analysis of non-affine displacements

2.3 Machine learning

2.3.1 Problem statement

2.3.2 Dataset

2.3.3 Feature engineering

2.3.4 Workflow design

2.4 Fluctuations, space, and time autocorrelation functions

3 Results and discussion

3.1 Macroscopic and microscopic creep response of the KA glass from MD simulations

3.2 Understanding the effect of Dmin⁡,02 and ∆t on accuracy of the ML predictions

3.3 Feature selection and physical interpretation of the most important features

3.4 Predicting creep using the ML derived structural indicator looseness

3.5 Fluctuations, and spatial and temporal autocorrelations of looseness

4 Conclusion

Data availability statement

Author contributions

Funding

Conflict of interest

Publisher’s note

Supplementary material

References

94% of researchers rate our articles as excellent or good

3.2 Understanding the effect of $D_{\min, 0}^{2}$ and $∆ t$ on accuracy of the ML predictions