Abnormal low-magnitude seismicity preceding large-magnitude earthquakes

Girona, Társilo; Drymoni, Kyriaki

doi:10.1038/s41467-024-51596-z

Download PDF

Article
Open access
Published: 28 August 2024

Abnormal low-magnitude seismicity preceding large-magnitude earthquakes

Nature Communications volume 15, Article number: 7429 (2024) Cite this article

8497 Accesses
3 Citations
215 Altmetric
Metrics details

Subjects

Abstract

Unraveling the precursory signals of potentially destructive earthquakes is crucial to understand the Earth’s crust dynamics and to provide reliable seismic warnings. Earthquake precursors are ambiguous, but recent experimental studies suggest that robust warning signs may precede large seismic events in the short (day-to-months) term. Here, we show that the M6.4-M7.1 2019 Ridgecrest sequence (California) and the M7.1 2018 Anchorage earthquake (Alaska) were preceded by up to ~3 months of tectonic unrest on regional scales, as evidenced by abnormal low-magnitude seismicity spreading over the ~15-25% of Southern California and Southcentral Alaska. This precursory unrest has been discovered with an algorithm that integrates an innovative random forest machine learning approach and statistical features built from earthquake catalogs. Supported by a novel suite of finite element solid mechanics models, we propose that precursory, abnormal, low-magnitude seismicity arises if the pore fluid pressure within large fault segments escalates significantly as they approach failure, which leads to major uneven changes in the regional stress field. Our findings and method may open up new perspectives for surveillance agencies to anticipate when a region approaches an earthquake of great magnitude weeks to months before it occurs.

A comprehensive suite of earthquake catalogues for the 2016-2017 Central Italy seismic sequence

Article Open access 18 November 2022

Improving earthquake prediction accuracy in Los Angeles with machine learning

Article Open access 18 October 2024

Insights on earthquake nucleation revealed by numerical simulation and unsupervised machine learning of laboratory-scale earthquake

Article Open access 21 November 2024

Introduction

Enhancing our ability to forecast the spatial and temporal distribution of large magnitude, and thus potentially destructive earthquakes, is a formidable challenge in modern seismology^1,2,3. Earthquake forecasting has been explored since the last century via a plethora of different methods aiming to detect geophysical, geochemical, and even biological precursors^4,5,6,7,8. Some of these possible precursors include thermal anomalies on the Earth’s surface^9,10,11; emissions of acoustic¹² and acoustic-gravity¹³ waves; changes in groundwater level and composition^14,15; release of radon and other gases^16,17; electric, magnetic, and ionospheric perturbations^18,19; crustal deformation²⁰; the emergence of slow slip events^21,22; and even anomalous behavior of farm animals²³. Earthquake precursors are still unreliable despite recent advances in data acquisition and analysis, and some of those precursors have been categorically questioned^24,25. Hence, the development of alert level systems for seismic activity, similar to those successfully implemented by surveillance agencies to track volcanic unrest²⁶, remain a chimera.

Precursory activity to large-magnitude earthquakes has also been investigated by tracking variations in the slope of the Gutenberg–Richter relationship (or b-value^27,28,29) and by exploring other anomalous statistical changes in the low-magnitude seismicity^30,31,32. In particular, low-magnitude seismicity has been proposed to experience changes over wide areas prior to large earthquakes³³, although previous analyses are inconclusive because the spatiotemporal distribution of seismic events is generally complex and specific precursory patterns are challenging to recognize and generalize^1,24. This might be alleviated thanks to the advent of openly available, high-quality, earthquake catalogs; and to new supervised and/or unsupervised machine learning-based frameworks³⁴, which may be able to identify subtle and nonlinear hidden precursory patterns from previous experience^35,36. Supervised machine learning-based frameworks have already been successfully applied to accurately anticipate laboratory earthquakes from pre-failure acoustic signals³⁷ and from the spatiotemporal deformation of analog tectonic plates³⁸, thus suggesting that similar approaches might eventually be able to anticipate real earthquakes from seismic, acoustic, and space geodesy data.

In this manuscript, we demonstrate through a new multivariate, supervised, machine learning-based algorithm that low-magnitude seismicity can alert about impending large-magnitude earthquakes in Southern California and Southcentral Alaska. Unlike previous studies focused on providing specific forecasts of the timing, magnitude, and/or epicentral location of large earthquakes and aftershocks^35,39,40, we exploit machine learning to explore the emergence of robust, nonlinear, precursory patterns that may be hidden in low-magnitude seismicity. This concept is applied to two target events: The 2019 Ridgecrest earthquake sequence and the 2018 Anchorage earthquake (Fig. 1). The former was the strongest seismic sequence occurring in Southern California in two decades⁴¹ and included an M6.4 earthquake occurring on July 4 and an M7.1 earthquakes occurring on July 5. The latter, the most societally significant seismic event occurring in Southcentral Alaska in half a century⁴², was an M7.1 earthquake occurring on November 30. In particular, our machine learning-based algorithm is trained to determine whether the aforementioned events were preceded by anomalies in the low-magnitude seismicity (defined here as earthquakes with magnitude between \({{{{\rm{M}}}}}_{{{{\rm{Min}}}}}=1\) and \({{{{\rm{M}}}}}_{{{{\rm{Max}}}}}=6\)), akin to the activity preceding other large-magnitude earthquakes (defined here as earthquakes with magnitude \(\ge 6.4\), i.e., the magnitude of the first event of the Ridgecrest sequence) of Southern California and Southcentral Alaska. A detailed description of our algorithm is provided in “Methods”.

**Fig. 1: Areas explored in this project.**

Results

Below we show the results obtained when running our machine learning-based algorithm at the epicentral location of the first event of the 2019 Ridgecrest sequence and at the epicentral location of the 2018 Anchorage earthquake (Fig. 2). These results are expressed in terms of the degree or probability of unrest (P_un), which is defined as the ratio of decision trees classifying as anomalous a set of statistical features obtained from the earthquake catalog. Note that P_un can also be interpreted as the probability of a large-magnitude earthquake (\({{{\rm{M}}}}\ge 6.4\)) occurring within the following 30 days (see “Methods”). Our model reveals that the epicenter of the target earthquakes showed signs of unrest for several weeks before the occurrence of the events. In particular, the degree of unrest is very low, with small spikes never exceeding ~5%, until just ~40 days before the Ridgecrest earthquakes (Fig. 2a). Then, mean and maximum values of P_un increased quickly to ~20–25% and ~75%, respectively, and remained that high up to the first sequence shock. After the M6.4 event, the maximum probability of unrest increased to ~90%, probably controlled by aftershocks, before it decreased again to background levels (≲5%) around three months after the sequence. On the epicenter of the 2018 Anchorage earthquake, the maximum probability of unrest increased abruptly to ~80% around three months before the event, with mean and maximum P_un reaching up to ~35% and ~85%, respectively, just a few days before the earthquake (Fig. 2b). Our model also reveals an anomalous phase from ~700 days to ~300 days before the Anchorage earthquake, although mean and maximum values of the probability of unrest are much lower (<15% and <55%, respectively) than those observed before the Anchorage shock. Similar to the Ridgecrest sequence, our model reveals an increase of P_un after large-magnitude events in Alaska, reaching up to ~99% over the three months following the Anchorage earthquake. Around 170 and 250 days after the 2018 Anchorage event, the probability of unrest decreased below ~20% and ~5%, respectively.

**Fig. 2: Temporal evolution of the probability of unrest (P_un; or probability that a large-magnitude earthquake happens in 30 days or less), from 1000 days before to 400 days after the events.**

Epicentral locations are not known before an earthquake, so a more realistic and useful analysis consists of running our machine learning-based algorithm on the geographical coordinates of grids covering all of Southern California and Southcentral Alaska (we used grids with 0.1° latitude × 0.1° longitude spacing). This approach allows us to build probability maps and thus explore the spatiotemporal evolution of abnormal low-magnitude seismicity in the run-up to large-magnitude earthquakes. Our analysis reveals that anomalous low-magnitude seismicity is not constrained to the epicentral area of large-magnitude events, but spreads over multiple fault zones of the regions explored (Figs. 3 and 4 and Supplementary Movies 1 and 2). In particular, around 90 days before the 2019 Ridgecrest sequence, unrest first emerged in the southern tip of San Joaquin Valley (e.g., Mt. Poso fault, and Wheeler Ridge and Pleito fault zones). Then, ~60 days before the sequence, abnormal low-magnitude seismicity spread over the California–Nevada-Arizona border region, including the Indian Springs Valley faults and Las Vegas area (e.g., Eglington and Frenchman Mountains faults) (Fig. 3a, b). Later, ~30 days before the sequence, unrest kept spreading in the California–Nevada-Arizona border region and emerged in the epicentral area of the Ridgecrest events (e.g., Little Lake and Panamint Valley—southern section—fault zones); southwest flank of Sierra Nevada mountains (e.g., Ken Canyon fault); Antelope Valley (e.g., Lenwood–Lockhart and Helendale–South Lockhart fault zones; San Andreas fault –Mojave section-); San Gabriel Mountains (e.g., San Gabriel, San Jacinto, Sierra Madre fault zones); Los Angeles county (e.g., Raymond, Santa Monica, and Elsinore fault zones); and the Southern Channel Islands (up to San Clemente Island; e.g., Palos Verdes, San Pedro Basin, Catalina fault zones). All these areas continued showing anomalous seismic behavior until the occurrence of the Ridgecrest events (Fig. 3c). In the run-up to the 2018 Anchorage earthquake, unrest emerged in two main areas of Southcentral Alaska: Center-northeast (e.g., Cook Inlet Folds, Castle Mountain faults, northern Kenai Peninsula, Copper River Basin, Denali fault –west Muldrow–Alsek section–, Billy Creek faults; Fig. 4a, b) and south-southeast (e.g., Alaska–Aleutians subduction zone –Aleutian Megathrust–; Kodiak shelf fault zone; Fig. 4c) around 80 and 10 days before the earthquake, respectively. More information on the main fault zones present in the areas where unrest is detected can be found in the US Geological Survey’s interactive Quaternary faults database (https://doi.org/10.5066/F7S75FJM).

**Fig. 3: Spatiotemporal evolution of the maximum probability of unrest (P_un; or probability that a large-magnitude earthquake happens in 30 days or less) in Southern California.**

**Fig. 4: Spatiotemporal evolution of the maximum probability of unrest (P_un; or probability that a large-magnitude earthquake happens in 30 days or less) in Southcentral Alaska.**

The analysis of the total area experiencing tectonic unrest (i.e., area with the maximum probability of unrest \(\ge\)50%) reveals three major outcomes (Fig. 5). First, small-scale, isolated patches of tectonic unrest emerged sporadically in the target regions (Figs. 3a and 4a). These patches constitute a background noise level that do not warn of large-magnitude earthquakes, and that might be associated with failed large-magnitude events or with local anomalous patterns produced by the swarms of low-to-mid size earthquakes. For example, we find that ~1.5–7% of Southern California’s total area experimented unrest from ~1000 days to ~100 days before the 2019 Ridgecrest sequence (Fig. 5). This analysis also reveals that the anomalous phase detected on the epicenter of the Anchorage earthquake from ~700 to ~300 days before its occurrence (Fig. 2b) is just a spurious signal associated with spatial background noise. Second and most important, the area experiencing unrest spread significantly in the run-up to large earthquakes (Figs. 3b, c and 4b, c). In particular, we find that the area with abnormal low-magnitude seismicity in Southern California and Southcentral Alaska increased by up to one order of magnitude (from ~2% to ~17%, and from ~2% to ~22%, respectively) during the three months preceding the 2019 Ridgecrest sequence and the 2018 Anchorage earthquake (Fig. 5). Third, the area and probability of unrest around the epicenters suddenly increased after large earthquakes, but the regions returned back to background levels shortly after (Fig. 5; Supplementary Movie 1 and 2). Our model also shows a gradual long-term (years) decreasing trend of the unrest area from ~1000 days to ~500 days before the 2018 Anchorage earthquake (Fig. 5), which may reflect the return to background levels after the occurrence of two other large earthquakes in the region (M6.4 earthquake occurring 70 km SSW of Redoubt volcano on July 20, 2015; and M7.1 earthquake occurring 86 km east of Old Iliamna on January 24, 2016; see Supplementary Movie 2). It is worth noting that we used the same algorithm both for Southern California and Southcentral Alaska.

**Fig. 5: Temporal evolution of the area (in percentage) of Southern California (red) and Southcentral Alaska (blue) that is subject to unrest.**

The sensitivity of our results to the key hyperparameters and statistical features of our machine learning-based algorithm is also explored through a parametric analysis. In particular, we run our algorithm for a total of 20 different scenarios varying the definition of low-magnitude seismicity (i.e., using different values for \({{{{\rm{M}}}}}_{{{{\rm{Min}}}}}\) and \({{{{\rm{M}}}}}_{{{{\rm{Max}}}}}\)), the unrest period assumed for training (from T_unrest = 10 days to T_unrest = 100 days), and the radius of the region of interest where the statistical features are calculated (from \({{{\rm{R}}}}=20\) km to \({{{\rm{R}}}}=150\) km) (Supplementary Table 1). In addition, we run five other scenarios in which our algorithm is trained with each of the statistical features separately (Supplementary Table 1). For simplicity, the rest of parameters of our model are kept the same for all the runs (time series length \({{{\rm{T}}}}=2\) years, 1-year backward sliding windows, 1-day time steps, and N = 50 random nodes used for training); more details on the different hyperparameters and statistical features used in the model can be found in Methods. The parametric analysis reveals that: (i) Our results are little sensitive to the hyperparameters, thus demonstrating the robustness of the method; (ii) precursory patterns emerge more significantly when low-magnitude seismicity is broadly defined (i.e., when using \({{{{\rm{M}}}}}_{{{{\rm{Min}}}}}=1\) and \({{{{\rm{M}}}}}_{{{{\rm{Max}}}}}=6\)), when a short unrest period is assumed for training (\({{{{\rm{T}}}}}_{{{{\rm{unrest}}}}} \, \lesssim \, 30\) days), and for large regions of interest (\({{{\rm{R}}}}\gtrsim 120\) km); (iii) precursory patterns vanish if using \({{{{\rm{M}}}}}_{{{{\rm{Min}}}}} \,\gtrsim \, 1.5\), thus suggesting that regional unrest preceding large-magnitude earthquakes is mostly reflected in the very low-magnitude seismicity; and (iv) precursory patterns mostly reflect anomalous variations in the depth of the low-magnitude earthquakes, while the statistical feature reflecting anomalous variations in magnitude is the least relevant. A detailed description of the parametric exploration and results can be found in Supplementary Tables 1 and 2.

Discussion

Interpretation of our results: dissimilar accumulation of stresses

The fact that the observed anomalies mostly reflect abnormal, very low-magnitude (\(1 \, \lesssim \, {{{\rm{M}}}} \, \lesssim \, 1.5\)) seismicity suggests that the probability of unrest may elevate due to temporal variations of the magnitude of completeness in the catalog, which might be produced by mid-magnitude foreshocks preceding the main shocks⁴³. To test this hypothesis, we adopted the approach outlined by Zhuang et al.⁴³ and plotted event magnitudes against the seismic event order across multiple sectors in Southern California and Southcentral Alaska, which coincide with areas with and without precursory anomalies (Supplementary Figs. 1–3). None of these plots reveal any obvious variation in catalog completeness in the weeks to months before the Ridgecrest and Anchorage earthquakes, whereas incompleteness clearly manifests after the occurrence of the main shocks (Supplementary Figs. 2a and 3a).

To further test the catalog completeness hypothesis, we plotted the Gutenberg–Richter (GR) relationship using the low-magnitude seismicity occurring around the epicenters of the target events (within a 120 km radius, as used in our method) and for each moving window used to calculate the statistical features feeding our machine learning-based algorithm (Supplementary Figs. 4 and 5). In addition, we calculated the temporal evolution of the absolute value of the slope of the GR relationship (m_GR) for magnitudes between M = 1 and M = 3 (Supplementary Fig. 6); note that m_GR is the so-called b-value²⁷ only if the magnitude of completeness is below M = 1, and that m_GR is expected to decrease if the magnitude of completeness is above M = 1 and if it increases in the run-up to the target events. Our analysis reveals that, in the run-up to the Ridgecrest sequence and the Anchorage earthquake, the GR law holds for low magnitudes as well (i.e, in the range from M = 1 to M = 3–4, and thus the magnitude of completeness is below M = 1); and that m_GR does not show any precursory decreasing trend or anomaly. All this together confirms that the precursory abnormal seismicity reported in this work is not an artifact produced by a decline in catalog quality and, consequently, suggests that our method does indirectly identify regional stress alterations taking place in the shallow crust during the preparatory phase of major earthquakes⁴⁴.

Interestingly, there was a noteworthy swarm of low-magnitude earthquakes approximately 30 days prior to the Ridgecrest events around the epicentral area of the sequence (Sector 1; see Supplementary Fig. 1a) and in the area spanning from Antelope Valley to Southern Chanel Islands (Sector 2; Supplementary Fig. 1a) (Fig. 6). This swarm coincides with a big spread of anomalous seismicity, as detected by our algorithm; however, the first anomalous behavior in the aforementioned areas arises ~90 days before the sequence, i.e., around two months before the occurrence of the swarm (Fig. 6a, b). In addition, the probability of unrest increases in the California–Nevada-Arizona border region (Sector 3; see Supplementary Fig. 1a) around 60 days before the Ridgecrest sequence, yet this surge does not align with any swarm of small-scale seismic events in that area (Fig. 6c), same as in other areas where no anomaly is detected (Fig. 6d, e). Moreover, the area subject to unrest in Southern California starts to increase ~70 days before the occurrence of the swarm (Fig. 5), and precursory unrest without noticeable swarms of small-scale seismic events is also observed for Alaska (Fig. 7). Collectively, these observations suggest that while precursory unrest may be influenced to some extent by local swarms of low-magnitude earthquakes, it is not unequivocally linked to such swarms. Instead, precursory unrest seems to be associated with nonlinear, multivariate, subtle statistical fluctuations in the spatiotemporal distribution of low-magnitude seismicity; this phenomenon demands further investigation and analysis, and a potential explanation is provided in the following.

**Fig. 6: Exploration of the possible link between abnormal low-magnitude seismicity and the occurrence of seismic swarms in Southern California.**

**Fig. 7: Exploration of the possible link between abnormal low-magnitude seismicity and the occurrence of seismic swarms in Southcentral Alaska.**

Several studies have hypothesized that low-magnitude seismicity should undergo changes over vast areas prior to large-magnitude earthquakes if the regional stress field varies when large, locked fault segments get closer to failure. This hypothesis has been explored by analyzing the energy release and accelerating seismicity patterns before large earthquakes, and by coupling Coulomb stress analyses with fault modeling^33,45. Faults are usually modeled in seismology as dislocations and cracks without internal structures⁴⁶; whereas in other disciplines, such as volcano tectonics, faults are commonly modeled as elastic inclusions with an internal mechanical structure that allows accounting for the shear stress accumulation inside and around them, their slip potential, or permeability⁴⁷. The latter approach allows considering faults as shear fractures composed of two hydromechanical units^47,48: The damage zone, primarily composed of fractures and veins (fractured media); and the fault core, which consists of breccia and gouge (porous media).

Studies on fault architecture in earthquake mechanics have shown that active faults concentrate stresses around them before failure^49,50,51. Stress concentration, in turn, generates interconnected fractures in the damage zone, which increase fault permeability and porosity⁴⁷. The latter induces the circulation of crustal fluids, which in turn increases the fluid pressure and enhances the aperture of other fractures in the damage zone⁴⁷. This positive feedback also enhances a reduced cohesion and a friction decrease, which lead to a decreasing fault stiffness before failure^52,53. For that reason, active faults can be characterized by higher fluid circulation and lower stiffness in the run-up to failure.

Below, we use the elastic inclusion fault modeling approach to explore the regional stress changes that may occur in the run-up to large-magnitude earthquakes, depending on fault stiffness and regional loading conditions. This exploration is performed by running a suite of finite element method (FEM) solid mechanics 2D models built with the COMSOL Multiphysics (v6.2) software (a detailed description of our FEM models’ buildup is provided in Methods). In particular, we investigated how the existence of a large (850 km), locked fault (mimicking a fault segment responsible for large-magnitude seismicity; LF from now on) can affect the von Mises stress accumulation in an adjacent (~250 km away) network of small (40–97 km), randomly distributed, locked fault lines (mimicking fault segments responsible for low-magnitude seismicity) upon two geometrical settings and two end-member conditions that are expected to precede large-magnitude earthquakes (Fig. 8): (1) Progressive loading of the region with constant LF stiffness (0.01 GPa), for which we applied horizontal compression in the numerical domain from 10 MPa to 20 MPa, consistent with borehole data and field and geodetic analysis in California^54,55,56; and (2) constant loading (20 MPa) in the region with a progressive decrease of the LF stiffness of up to four orders of magnitude, ranging from 10 GPa to 0.01 GPa. For this analysis, we assume constant stiffness for the small faults of the network for simplicity.

**Fig. 8: 2D FEM models of a faulted domain that gradually reaches the conditions for failure.**

Under Scenario 1, the von Mises stress (further details in “Methods”) linearly increases by a factor of two throughout the domain (Fig. 8a–d). For instance, around Faults 1 to 4 of the network, the maximum von Mises stress rises from ~2.7 to ~5.5 MPa, from ~3.3 to ~6.7 MPa, from ~4.7 to ~9.4 MPa, and from ~7.8 to ~15.6 MPa, respectively (Fig. 9a). This indicates that as a large-magnitude earthquake approaches, compression propels all the small faults within the region toward failure. While this may elucidate a heightened rate of low-magnitude seismicity in the region, as reported prior to some large-magnitude earthquakes³³, it remains unclear how this mechanism could induce the precursory, nonlinear, statistical anomaly found in this study.

**Fig. 9: Results of our 2D FEM numerical models.**

Conversely, when we impose a constant compression (20 MPa) but progressively decrease the LF stiffness (Scenario 2), a dissimilar and nonlinear concentration of stresses emerges in the region (Fig. 8e–h). Specifically, the reduction in LF stiffness leads to varying increases and decreases in the accumulated von Mises stress at different small faults of the network (Fig. 9b). For instance, the maximum von Mises stress around Fault 3 declines non-linearly from ~12.7 MPa to ~9.4 MPa as the LF stiffness drops by four orders of magnitude, from 10 GPa to 0.01 GPa. In contrast, the von Mises stress around Faults 1, 2, and 4 increases non-linearly by ~2.8 Mpa, ~2.1 MPa, and ~2.4 Mpa, respectively. The magnitude and sign of this variation are subject to: (i) The stiffness of the LF, with more significant stress changes occurring when the LF becomes very soft (\(\lesssim\)0.1 GPa); (ii) the arrangement of small faults within the network, as interactions between closely situated soft faults lead to more pronounced stress distribution changes; and (iii) the distance from the network to the LF, since greater stress accumulation occurs closer to the LF. In short, the run-up to large-magnitude earthquakes can lead certain small faults in the region closer to failure, while simultaneously pushing others further away from failure. This uneven behavior is a consequence of the softening of large regional faults, characterized by a reduction in stiffness due to heightened fluid circulation.

As part of our parametric analysis, our numerical study was extended to additional 2D and 3D geometrical settings to provide further insights on how the strike and dip of faults may affect the distribution of von Mises stresses (Supplementary Fig. 7). Our findings indicate that varying strike and dip angles influence fault interactions and the accumulation of von Mises stresses in the region; nevertheless, the models confirm the main conclusions drawn from the simpler 2D scenario outlined above (Supplementary Fig. 8): A decrease in LF stiffness leads to a regional, dissimilar concentration of stresses. It is worth noting that the different 2D and 3D scenarios analyzed, which incorporate various strike and dip configurations, simulate conditions expected for earthquakes with varied focal mechanisms (e.g., normal or thrust faults with dip <90° and strike-slip faults with dip = 90°).

In essence, our FEM-based numerical models show from a mechanical standpoint that irregular accumulation of stresses can appear in small fault networks when the stiffness of large faults declines—a circumstance expected as failure approaches^52,53. Based on these findings, we conjecture that the emergence of anomalous low-magnitude seismicity before major earthquakes, as observed prior to the 2019 Ridgecrest sequence and the 2018 Anchorage earthquake, could be ultimately driven by the reduction in stiffness of major faults rather than by gradual regional loading. The parametric analysis of our numerical simulations also reveals that the differential buildup of stresses in nearby minor faults becomes more significant when very low stiffness is applied to the major fault (\(\lesssim\)0.1 GPa), implying that abnormal, precursory low-magnitude seismicity can only arise when fault segments prone to major earthquakes are highly fractured and/or contain permeable cores (such as breccias and cataclastic rocks) susceptible to elevated pore fluid pressure^57,58. It is worth noting that other mechanisms might also cause uneven stress accumulations in a fault network, such as dissimilar fluid transport in faults of the network; transitions from distributed deformation to progressive shear localization; stress transfer between faults; and fault friction, cohesion, strength, and weaknesses³². However, these reflect local processes and properties, and do not allow establishing an obvious cause-effect relationship between abnormal low-magnitude seismicity and the conditions leading up to the failure of a large fault segment. In other words, it remains uncertain that the aforementioned local processes and properties would translate into a regional phenomenon driven by the precursory conditions of a large fault segment that is close to failure and produce a major earthquake. In the future, geodetic data might provide further insights into the regional, dissimilar accumulation of stresses driven by decreasing stiffness in large faults, potentially enhancing our ability to anticipate large-magnitude earthquakes weeks to months in advance³⁸.

Forecasting potential of our machine learning-based approach

Earthquake forecasting involves anticipating the timing, magnitude, and/or epicenter of major earthquakes and/or aftershocks^35,39,40,59. Here, we have delved into this topic by devising a method capable of uncovering potential, hidden precursory patterns concealed within earthquake catalogs, with our primary focus being on the timing aspect. However, it is worth highlighting that our methodology also provides valuable insights into refining the size and location of major earthquakes; this contrasts with prior studies, which often concentrated on providing forecasts of one of the aforementioned observables only^35,39,40.

Specifically, our machine learning-based analysis reveals that the preparatory phase of the Ridgecrest and Anchorage earthquakes shared temporal similarities with previous seismic events in Southern California and Southcentral Alaska, allowing for the identification of precursory activity manifesting over timescales spanning from weeks to months. This timescale aligns with the preparatory phase reported for five large strike-slip earthquakes in Southern California since 1960, inferred from the combination of renormalized patterns related to increasing low-magnitude seismic activity and earthquake correlation range³⁰. Our results, however, contrast to longer and shorter lead times reported in other studies on Southern California and elsewhere. For instance, Keilis-Borok et al.⁶⁰ documented low-magnitude seismicity changes occurring 2–3 years before strong earthquakes across various tectonic settings, suggesting a more protracted preparatory phase. Interestingly, Ben-Zion and Zaliapin³² examined localization processes of low-magnitude seismicity and found evidence of rock damage accumulation around rupture zones for the 2019 Ridgecrest sequence and other large earthquakes in Southern and Baja California during the preceding decade. This was followed by seismic localization and event clustering around 2–3 years and 1–2 years, respectively, before the main shock. Huang et al.⁶¹ found a foreshock sequence around 30 min before the initiation of the 2019 Ridgecrest sequence. These previous studies, combined with our findings, highlight a multifaceted process involving the accumulation and release of tectonic stresses over different timescales.

In addition, while our methodology considers magnitude as an input parameter, the inherent structure of our algorithm yields the probability of earthquakes surpassing a predefined magnitude threshold, which can be adjusted to suit specific requirements. This serves as the foundation for calculating probabilities of earthquakes of specific magnitude or greater. Moreover, although the precursory anomalies identified by our method hinder precise pinpointing of exact earthquake locations, we have noted that the epicenters of the two case studies fall within well-defined unrest areas spanning spatial scales ranging from tens to hundreds of kilometers. Indeed, this adheres to the well-established approach of examining seismic precursors within large regions, encompassing several hundred kilometers, to account for potential long-range correlations⁶²; and aligns with prior research suggesting that the preparatory phase of major earthquakes involves broad-scale processes affecting fault networks, resulting in non-local seismic indicators^30,62,63. In any case, while unrest areas are broad, our results suggest that they notably restrict the potential location of major events in the target regions. For instance, before the Ridgecrest sequence, there were no signs of potential large-magnitude earthquakes in other highly seismic zones, such as the San Francisco Bay Area, but the precursory anomaly was confined to locations surrounding the sequence’s epicenters. Similarly, our method did not reveal any hints of potential large-magnitude earthquakes in the western fault zones of Southcentral Alaska before the occurrence of the Anchorage event. All these findings together indicate that our approach shows potential for improving seismic monitoring and surveillance by constraining the timing (over temporal spans from days to months), magnitude (surpassing predefined thresholds), and location (across spatial scales from tens to hundreds of kilometers) of forthcoming major earthquakes.

A major significance of our method lies in its potential to be integrated into operational earthquake monitoring systems and forecasting schemes⁶⁴, enabling the swift detection of tectonic unrest. This, in turn, could facilitate the rapid deployment of additional instruments to advance our understanding of the short-term preparatory phase of major earthquakes⁶⁵. The early identification of tectonic unrest also holds the promise of bolstering societal preparedness for the potential occurrence of a major earthquake in a specific region, aiding in the assessment of alert levels. This aligns with decision-making strategies that focus on acquiring reliable signals indicating an increasing probability of an impending disaster, rather than assigning specific probability values⁶⁶. Similar alert level systems are employed in volcano observatories to communicate volcanic activity^26,67.

While our algorithm has been tested on two specific regions, it has the potential for broader applicability. It can be readily applied to other regions globally, provided robust, highly complete, and long-lasting (~decades) earthquake catalogs become available. However, it is essential to emphasize that our algorithm should not be employed in new regions without training the random forest models with the historical seismicity of that specific area. Moreover, to maintain consistency and robustness, our algorithm should be periodically updated, involving re-training each time a major earthquake occurs in the region where it is being implemented. For instance, we recommend incorporating the 2019 Ridgecrest sequence and the 2018 Anchorage earthquake in the training process moving forward.

As any other method, the algorithm presented in this paper also has limitations. In particular, it cannot assess the level of unrest in regions with minimal low-magnitude earthquake activity because the model statistical features cannot be reliably computed. In addition, our choice of Southern California and Southcentral Alaska for analysis was based on the availability of a sufficient number of earthquakes with magnitude M ≥6.4 over the past ~30 years (Supplementary Table 3). Finally, the success of our approach relies on the assumption that future seismicity will produce anomalies similar to those produced by the historical seismicity used for training. These limitations underscore the importance of ongoing research and earthquake forecasting adaptation as machine learning and monitoring techniques continue to evolve.

In summary, gaining insights into the multivariate temporal patterns of seismic activity leading up to major earthquakes is a vital stride in enhancing our capacity to anticipate devastating events. Our findings concerning the 2019 Ridgecrest sequence and the 2018 Anchorage earthquake highlights the significance of identifying subtle, nonlinear anomaly trends in near-real time, and might serve as a foundation for crafting efficient operational forecasting algorithms⁶⁴. This, in turn, plays a pivotal role in strengthening earthquake preparedness and thus in reducing the risks in communities susceptible to major seismic threats.

Closing remarks

Although machine learning has been successfully applied to anticipate laboratory earthquakes^37,38, the application of these methods to real earthquakes is more challenging, and their predictive power has been proposed not to be necessarily better than simpler and more traditional approaches³⁴. However, this work shows that machine learning offers a lot of potential to classify low-magnitude seismicity, better identify precursory activity, and detect hidden patterns in the run-up to large-magnitude events. In this regard, we found that abnormal low-magnitude seismicity, or regional tectonic unrest, emerged in Southern California and Southcentral Alaska for weeks to months prior to the 2019 Ridgecrest sequence and the 2018 Anchorage earthquake. This is identified with a new multivariate, random forest-based classification algorithm that detects when the statistical properties of low-magnitude earthquakes deviate from regular patterns. These deviations are challenging to identify by visual inspection of single variables, but they can be recognized with a nonlinear approach (random forest) that considers the complex interplay between the different model variables. Nonlinear classification algorithms trained from previous experience are more efficient and versatile in detecting anomalous patterns of low-magnitude seismicity than classical methods consisting of tracking single variables (e.g., acceleration of seismicity rates⁴⁵), whose temporal evolution may differ a lot from one case to another and thus results are difficult to generalize.

Our analysis provides observational evidence of the regional interaction between different fault zones during stress-loading to failure^68,69; supports that different large-magnitude earthquakes have similar preparatory phases, at least for the two regions explored; and is consistent with previous laboratory experiments showing abnormal, large-scale signals that emerge over vast areas before failure^37,38. Furthermore, our analysis and mechanistic finite element solid mechanics models support the long-held hypothesis that abnormal low-magnitude seismicity should appear over vast areas before large-magnitude earthquakes because largely locked fault sections cause alterations in the regional stress field^33,45. In particular, our finite element solid mechanics models suggest that precursory variations in the regional stress field may occur because fault segments behave as elastic inclusions with dissimilar mechanical properties from those of the surrounding host crust, thus triggering an uneven response to the processes leading to major earthquakes and the accumulation of stresses both at local and regional scales⁵⁸.

Finally, it is worth highlighting that the machine learning-based approach presented in this paper only requires information that is currently being archived routinely in earthquake catalogs; could help to better understand the dynamics of fault networks and identify variations in the regional stress field; and can be easily implemented by surveillance agencies to monitor low-magnitude seismicity in near-real time. Eventually, our approach could help to design earthquake alert level strategies based on the detection of regional tectonic unrest, and to improve the forecast of large-magnitude earthquakes from weeks to months in advance in Southern California, Southcentral Alaska, and potentially elsewhere.

Methods

Machine learning-based model

Our machine learning-based algorithm is trained to determine whether the 2019 Ridgecrest seismic sequence and the 2018 Anchorage earthquake were preceded by anomalous low-magnitude seismicity (defined here as earthquakes with magnitude between \({{{{\rm{M}}}}}_{{{{\rm{Min}}}}}=1\) and \({{{{\rm{M}}}}}_{{{{\rm{Max}}}}}=6\)). Training is performed with the low-magnitude seismicity that preceded other large seismic events (defined here as earthquakes with magnitude \({{{\rm{M}}}}\ge 6.4\), i.e., the magnitude of the first event of the Ridgecrest sequence) that occurred in Southern California and Southcentral Alaska between January 1, 1989, and December 31, 2012. Our algorithm approach is based on two hypotheses: (i) The large-magnitude earthquakes used for training were preceded by tectonic unrest occurring for some weeks at least in the epicentral area. Hence, if the 2019 Ridgecrest sequence and the 2018 Anchorage earthquake were preceded by analog tectonic unrest, this should emerge spontaneously when running our model on unseen data (from January 1, 2013, onwards). (ii) Anomalous tectonic unrest and quiescence can be recognized with a random forest classification model when monitoring the scattering or variability of different features of the low-magnitude seismicity, including the inter-event time, depth, magnitude, and latitude and longitude of the epicenters. Note that we utilize the standard deviation as a metric to represent the variability of the features, but it has no statistical meaning in this context. The analysis we undertake in this study consists of five main steps (data retrieval, algorithm setup, calculation of statistical features, labeling and training of the algorithm, and testing of the algorithm on unseen data) that are detailed in the next subsections.

Data retrieval

We retrieved data from the publicly available earthquake catalog of the United States Geological Survey (USGS) (https://earthquake.usgs.gov/earthquakes/search/). In particular, we downloaded all the information relative to the \({{{\rm{M}}}}\ge 1.0\) earthquakes (on any magnitude scale) that occurred between January 1, 1989, and December 31, 2020, in Southern California (in the area delimited by [32.5°, 38°] latitude and [−124°, −112°] longitude) and in Southcentral Alaska (in the area delimited by [56°, 65°] latitude and [−159°, −143°] longitude). It is important to clarify that data between January 1, 1989, and December 31, 2012, were only used to train the algorithms. Also, we only retrieved \({{{\rm{M}}}}\ge 1\) earthquakes to maximize the homogeneity and completeness of the catalog explored (e.g., by minimizing any possible bias produced by the increasing number of earthquakes detected with time due to the deployment of new seismic networks and better detection limits of modern seismic instruments). Finally, we extracted from the downloaded catalogs the date of occurrence, depth, and assigned magnitude of the earthquakes, as well as the latitude and longitude of their epicenters.

Algorithm setup

We set up our training algorithm by selecting a random distribution of N nodes with coordinates \(({{{\rm{i}}}},{{{\rm{j}}}},{{{\rm{t}}}})\) within the spatiotemporal windows of analysis, where \({{{\rm{i}}}},\, {{{\rm{j}}}},\) and \({{{\rm{t}}}}\) indicate latitude, longitude, and time, respectively. We also selected the nodes \(({{{{\rm{i}}}}}_{{{{\rm{LME}}}}},\, {{{{\rm{j}}}}}_{{{{\rm{LME}}}}},\, {{{{\rm{t}}}}}_{{{{\rm{LME}}}}})\), where \({{{{\rm{i}}}}}_{{{{\rm{LME}}}}}\) and \({{{{\rm{j}}}}}_{{{{\rm{LME}}}}}\) are the geographical coordinates of the epicenter of the large-magnitude earthquakes (i.e., earthquakes with magnitude above a given threshold; \({{{\rm{M}}}}\ge {{{{\rm{M}}}}}_{{{{\rm{LME}}}}}\)), and \({{{{\rm{t}}}}}_{{{{\rm{LME}}}}}\) is their date of occurrence. For each node (\(({{{\rm{i}}}},\, {{{\rm{j}}}},\, {{{\rm{t}}}})\) and \(({{{{\rm{i}}}}}_{{{{\rm{LME}}}}},\, {{{{\rm{j}}}}}_{{{{\rm{LME}}}}},\, {{{{\rm{t}}}}}_{{{{\rm{LME}}}}})\)), we designed circular regions of radius \({{{\rm{R}}}}\) centered on the spatial coordinates (\(({{{\rm{i}}}},\, {{{\rm{j}}}})\) and \(({{{{\rm{i}}}}}_{{{{\rm{LME}}}}},\, {{{{\rm{j}}}}}_{{{{\rm{LME}}}}})\)) and intervals of interest consisting of 1-year backward temporal windows sliding from \({{{\rm{t}}}}-{{{\rm{T}}}}\) and \({{{{\rm{t}}}}}_{{{{\rm{LME}}}}}-{{{\rm{T}}}}\) years to \({{{\rm{t}}}}\) and \({{{{\rm{t}}}}}_{{{{\rm{LME}}}}}\), respectively, at time steps of \(1\) day (Supplementary Fig. 9). Note that a node \(({{{\rm{i}}}},\, {{{\rm{j}}}},\, {{{\rm{t}}}})\) or \(({{{{\rm{i}}}}}_{{{{\rm{LME}}}}},\, {{{{\rm{j}}}}}_{{{{\rm{LME}}}}},\, {{{{\rm{t}}}}}_{{{{\rm{LME}}}}})\) is discarded if the circular region of interest does not contain low-magnitude seismicity. For this study, we chose \({{{{\rm{M}}}}}_{{{{\rm{LME}}}}}=6.4\) (the magnitude of the first main shock of the Ridgecrest sequence), \({{{\rm{T}}}}=2\) years, \({{{\rm{R}}}}=120\) km, and only \({{{\rm{N}}}}=50\) nodes to maintain a reduced computational cost, although we also performed a parametric exploration to analyze the influence of several of the parameters (Supplementary Tables 1 and 2). For training purposes, we used four earthquakes of the Southern California region (M7.3 1992 Landers, M6.7 1994 Northridge, M7.1 1999 Hector Mine, and M6.6 2003 San Simeon earthquakes), seven earthquakes from Southcentral Alaska (M7.0 1999, M6.4 1999, M6.5 2000, and M6.9 2001 Kodiak Island region; M6.7 2001 Southern Alaska; and M6.6 2002 and M7.9 2002 Central Alaska earthquakes), and up to 50 random nodes distributed over the areas of analysis in Southern California and Southcentral Alaska (Supplementary Table 3). This algorithm step was repeated 200 times to generate different ensembles of up to N = 50 nodes, aiming to assess how the selection of the random nodes affects machine learning predictions.

Calculation of statistical features

Using the earthquake catalog downloaded, we calculated five statistical features from the low-magnitude seismicity (defined here as earthquakes with magnitude in the range \({{{\rm{M}}}}={{{{\rm{M}}}}}_{{{{\rm{Min}}}}}-{{{{\rm{M}}}}}_{{{{\rm{Max}}}}}\)) that occurred within the circular regions and temporal sliding windows of the nodes (Supplementary Fig. 9). These statistical features are the standard deviation of the inter-event time (\({{{{\rm{\sigma }}}}}_{{{{\rm{IET}}}}}\)), or time between low-magnitude earthquakes; and the standard deviation of their depth (\({{{{\rm{\sigma }}}}}_{{{{\rm{D}}}}}\)), latitude (\({{{{\rm{\sigma }}}}}_{{{{\rm{LAT}}}}}\)), longitude (\({{{{\rm{\sigma }}}}}_{{{{\rm{LON}}}}}\)), and magnitude (\({{{{\rm{\sigma }}}}}_{{{{\rm{MAG}}}}}\)). This approach yields \({{{\rm{T}}}}\)-year time series of \({{{{\rm{\sigma }}}}}_{{{{\rm{IET}}}}}\), \({{{{\rm{\sigma }}}}}_{{{{\rm{D}}}}}\), \({{{{\rm{\sigma }}}}}_{{{{\rm{LAT}}}}}\), \({{{{\rm{\sigma }}}}}_{{{{\rm{LON}}}}}\), and \({{{{\rm{\sigma }}}}}_{{{{\rm{MAG}}}}}\) that are assigned to each node (\(({{{\rm{i}}}},\, {{{\rm{j}}}},\, {{{\rm{t}}}})\) and \(({{{{\rm{i}}}}}_{{{{\rm{LME}}}}},\, {{{{\rm{j}}}}}_{{{{\rm{LME}}}}},\, {{{{\rm{t}}}}}_{{{{\rm{LME}}}}})\)) of the spatiotemporal grid (Supplementary Figs. 10 and 11). Finally, the T-year time series were standardized (i.e., mean-removed and divided by the standard deviation) to enhance the performance of machine learning-based classifications.

Labeling and training of the algorithm

We labeled the \({{{\rm{T}}}}\)-year time series in two classes: unrest (class 1) and quiescence (class 0). Unrest (class 1) was assigned to every time step of the \({{{\rm{T}}}}\)-year time series of the spatiotemporal nodes \(({{{{\rm{i}}}}}_{{{{\rm{LME}}}}},\, {{{{\rm{j}}}}}_{{{{\rm{LME}}}}},\, {{{{\rm{t}}}}}_{{{{\rm{LME}}}}})\) between \({{{{\rm{t}}}}}_{{{{\rm{LME}}}}}-{{{{\rm{T}}}}}_{{{{\rm{unrest}}}}}\) days and \({{{{\rm{t}}}}}_{{{{\rm{LME}}}}}\), i.e., over the \({{{{\rm{T}}}}}_{{{{\rm{unrest}}}}}\) days preceding the large-magnitude earthquakes. In contrast, quiescence (class 0) was assigned to every time step of the time series of the nodes \(({{{\rm{i}}}},\, {{{\rm{j}}}},\, {{{\rm{t}}}})\) between \({{{\rm{t}}}}-{{{\rm{T}}}}\) years and \({{{\rm{t}}}}\), and to every time step of the time series of the nodes \(({{{{\rm{i}}}}}_{{{{\rm{LME}}}}},\, {{{{\rm{j}}}}}_{{{{\rm{LME}}}}},\, {{{{\rm{t}}}}}_{{{{\rm{LME}}}}})\) between \({{{{\rm{t}}}}}_{{{{\rm{LME}}}}}-{{{\rm{T}}}}\) years and \({{{{\rm{t}}}}}_{{{{\rm{LME}}}}}-{{{{\rm{T}}}}}_{{{{\rm{unrest}}}}}\) days (Supplementary Figs. 10 and 11). This means that unrest was assigned to the epicenter of the large-magnitude earthquakes during the \({{{{\rm{T}}}}}_{{{{\rm{unrest}}}}}\) preceding days, and quiescence was assigned to the rest of spatiotemporal nodes. Analogue labeling strategies have been proposed for forecasting volcanic eruptions⁷⁰. Then, we trained 200 random forest (RF) models, one for each of the ensembles of the aforementioned random nodes, to classify the time steps of the \({{{\rm{T}}}}\)-year time series as unrest (class 1) or quiescence (class 0) from the statistical features (\({{{{\rm{\sigma }}}}}_{{{{\rm{IET}}}}}\), \({{{{\rm{\sigma }}}}}_{{{{\rm{D}}}}}\), \({{{{\rm{\sigma }}}}}_{{{{\rm{LAT}}}}}\), \({{{{\rm{\sigma }}}}}_{{{{\rm{LON}}}}}\), \({{{{\rm{\sigma }}}}}_{{{{\rm{MAG}}}}}\)). Each of our RF models contained 500 decision trees and was generated by randomly picking two features for each model iteration, whereas training and evaluation were performed through a tenfold cross-validation technique with three repeats. For example, using an unrest period of \({{{{\rm{T}}}}}_{{{{\rm{unrest}}}}}=30\) days and a maximum low-magnitude seismicity of \({{{{\rm{M}}}}}_{{{{\rm{MAX}}}}}=4\), we generated RF models for both Southern California and Southcentral Alaska that properly classified over 99.5% of the unrest and quiescence time steps of the training dataset. In particular, our models yield an increase from \({{{{\rm{P}}}}}_{{{{\rm{un}}}}} \,\approx \, 0\) to \({{{{\rm{P}}}}}_{{{{\rm{un}}}}} \, \approx \, 100\%\) within one month of the training events (Supplementary Fig. 12), where \({{{{\rm{P}}}}}_{{{{\rm{un}}}}}\) is the degree or probability of tectonic unrest (defined as the ratio of decision trees classifying the statistical features as unrest). Note that \({{{{\rm{P}}}}}_{{{{\rm{un}}}}}\) can be also interpreted as the probability of occurrence of a large-magnitude earthquake within the next \({{{{\rm{T}}}}}_{{{{\rm{unrest}}}}}\) days.

It is worth noting that, since we employ 1-year backward windows to generate 2-year time series of statistical features, the first data point of the time series in Supplementary Figs. 10 and 11 reflects seismic events occurring up to three years before the occurrence of each of the large-magnitude earthquakes (M ≥ 6.4) used for training. Our specific setup ensures that no events with magnitude M ≥ 6.4 occurred within the defined regions of interest (\({{{\rm{R}}}}=120\) km) during the three years preceding the large-magnitude earthquakes used for training. Expanding the training period would elevate the chances of encountering additional large-magnitude earthquakes during the timeframe labeled as quiescence, which might result in incorrect labeling (quiescence vs unrest) and subsequent inadequate training of our machine learning model. Labeling also requires special attention if enlarging the regions of interest and lowering the magnitude threshold for defining large-magnitude earthquakes.

Testing of the algorithm on unseen data

We run our machine learning models (trained using \({{{{\rm{T}}}}}_{{{{\rm{unrest}}}}}=30\) days) on unseen data, i.e., the low-magnitude seismicity (\({{{\rm{M}}}}={{{{\rm{M}}}}}_{{{{\rm{Min}}}}}-{{{{\rm{M}}}}}_{{{{\rm{Max}}}}}\), with \({{{{\rm{M}}}}}_{{{{\rm{Min}}}}}=1\) and \({{{{\rm{M}}}}}_{{{{\rm{Max}}}}}=6\)) occurred from January 1, 2013, onwards. To do this, we first calculated the T-year time series of statistical features (\({{{{\rm{\sigma }}}}}_{{{{\rm{IET}}}}}\), \({{{{\rm{\sigma }}}}}_{{{{\rm{D}}}}}\), \({{{{\rm{\sigma }}}}}_{{{{\rm{LAT}}}}}\), \({{{{\rm{\sigma }}}}}_{{{{\rm{LON}}}}}\), \({{{{\rm{\sigma }}}}}_{{{{\rm{MAG}}}}}\)) for the spatiotemporal coordinates \(({{{{\rm{i}}}}}_{{{{\rm{tar}}}}},\, {{{{\rm{j}}}}}_{{{{\rm{tar}}}}},\, {{{{\rm{t}}}}}_{{{{\rm{tar}}}}})\), where \({{{{\rm{i}}}}}_{{{{\rm{tar}}}}}\) and \({{{{\rm{j}}}}}_{{{{\rm{tar}}}}}\) are the geographical coordinates of the target location (e.g., epicenters of the first main shock of the 2019 Ridgecrest sequence and the 2018 Anchorage earthquake, or any other location) and \({{{{\rm{t}}}}}_{{{{\rm{tar}}}}}\) is the target prediction date (e.g., 365 days, 90 days, or 1 day before the occurrence of the earthquakes). The statistical features for each of the T-year time series were calculated following the same approach described above, and we also used the same hyperparameters utilized to train the RF models (\({{{\rm{T}}}}=2\) years, \({{{\rm{R}}}}=120\) km, 1-year backward sliding windows, and 1-day time steps). Finally, we run the 200 RF models ensemble over the standardized T-year time series of statistical features and retrieved the minimum, maximum, and mean values of \({{{{\rm{P}}}}}_{{{{\rm{un}}}}}\). To successfully mimic the appropriate use of our method during near-real-time monitoring, we saved only the prediction of \({{{{\rm{P}}}}}_{{{{\rm{un}}}}}\) for the last time step of the T-year time series (i.e., one day before the target prediction date, \({{{{\rm{t}}}}}_{{{{\rm{tar}}}}}\)), and repeated the process from \({{{{\rm{t}}}}}_{{{{\rm{tar}}}}}=1000\) days before to \({{{{\rm{t}}}}}_{{{{\rm{tar}}}}}=\,\)400 days after the Ridgecrest sequence and the Anchorage earthquake. This approach also ensures that the temporal evolution of \({{{{\rm{P}}}}}_{{{{\rm{un}}}}}\) (Figs. 2–5) does not contain artifacts produced by the standardization of the T-year time series, and corresponds to the day-to-day prediction of \({{{{\rm{P}}}}}_{{{{\rm{un}}}}}\) at the target locations.

Finite element solid mechanics model

Our 2D finite element solid mechanics models aim to explore regional stress changes occurring in the run-up to large-magnitude earthquakes. We built these models with COMSOL Multiphysics (v6.2), solving the stationary momentum equation and applying the elastic inclusion fault modeling approach⁵⁸.

In particular, we designed a 2000 × 2000 km domain and imported fifteen small fault segments with 40–97 km length and one large fault (LF) segment with 850 km length as rectangular features. Furthermore, we designed two settings (different geometrical configurations) in 2D to investigate whether the orientation (strike) and geometry of the small fault segments (crosscuts or misaligned/nonplanar faults) affect the stress distribution and accumulation produced by the large fault segment. That said, we modeled mainly two scenarios: (1) progressive loading (horizontal compression), and (2) constant loading with a progressive decrease of the large fault stiffness. In all runs the top and the bottom of the domain are free surfaces, and we calculated the Von Mises stress at the fault segments. Finally, these models were further explored in 3D to investigate how the strike and dip of the small faults influence the accumulation of stresses in the domain (Supplementary Figs. 7 and 8).

We assigned material properties to the host rock and the faults based on the common crustal values used in the literature, which reflect a linear elastic behavior. The host rock was modeled as a stiff homogenized domain with Young’s modulus of 30 GPa, whereas the LF was given values from 10 GPa (stiff) to 0.01 GPa (soft) to mimic the progression from a less active fault segment to a more active one. Soft values represent aseismic creep or locked conditions of the fault segment, whereas stiff values reflect the conditions of a not recently active fault in the region⁶⁷. The stiffness of the small faults is initally defined as random values between 0.03 to 0.92 GPa, and this value is kept constant for all the simulations. The host rock and all the faults have Poisson’s ratios equal to 0.25 and density values of 2300 kg/m³. Regarding boundary conditions, we impose a compressional stress regime between 10 MPa and 20 MPa, which mimics the realistic regional stress conditions for Southern California and Southcentral Alaska^54,55.

Data availability

The data used in this study are available in the earthquake catalog database of the United States Geological Survey (USGS) [https://earthquake.usgs.gov/earthquakes/search/]. The specific dataset used for training and testing can be downloaded from ref. ⁷¹ https://doi.org/10.5281/zenodo.13212238.

Code availability

The R script used for data processing and analysis; the Matlab script used to create Figs. 3, 4, and Supplementary Movies 1 and 2; the machine learning-based models generated; the specific dataset used for training and testing; and a description of the files generated can be downloaded from ref. ⁷¹ https://doi.org/10.5281/zenodo.13212238.

References

Turcotte, D. L. Earthquake prediction. Annu. Rev. Earth Planet. Sci. https://doi.org/10.1146/annurev.ea.19.050191.001403 (1991).
Rundle, J. et al. The complex dynamics of earthquake fault systems: new approaches to forecasting and nowcasting of earthquakes. Rep. Prog. Phys. https://doi.org/10.1088/1361-6633/abf893 (2021).
Pei, W., Zhou, S., Zhuang, J., Xiong, Z. & Piao, J. Application and discussion of statistical seismology in probabilistic seismic hazard assessment studies. Sci. China Earth Sci. 65, 1–12 (2021).
Kanamori, H. Earthquake prediction: an overview. In International Handbook of Earthquake & Engineering Seismology (eds Lee, W. H. K. et al.) 1205–1216 (Academic Press, Amsterdam, 2003).
Mogi, K. Earthquake Prediction (Academic Press, 1985).
Lomnitz, C. Fundamentals of Earthquake Prediction (John Wiley, 1994).
Ouzounov, D., Pulinets, S., Hattori, K. & Taylor, P. Pre-earthquake Processes: A Multidisciplinary Approach to Earthquake Prediction Studies Vol. 234 (John Wiley & Sons, 2018).
Conti, L., Picozza, P. & Sotgiu, A. A critical review of ground based observations of earthquake precursors. Front. Earth Sci. 9, 676766 (2021).
Article Google Scholar
Tramutoli, V. et al. Assessing the potential of thermal infrared satellite surveys for monitoring seismically active areas: The case of Kocaeli (Izmit) earthquake, August 17, 1999. Remote Sens. Environ. 96, 409–426 (2005).
Article ADS Google Scholar
Pulinets, S. A., Ouzounov, D., Karelin, A. V., Boyarchuk, K. A. & Pokhmelnykh, L. A. The physical nature of thermal anomalies observed before strong earthquakes. Phys. Chem. Earth https://doi.org/10.1016/j.pce.2006.02.042 (2006).
Lu, X. et al. Thermal infrared anomalies associated with multi-year earthquakes in the Tibet region based on China’s FY-2E satellite data. Adv. Space Res. 58, 989–1001 (2016).
Article ADS Google Scholar
Dolgikh, G. I. et al. Deformation and acoustic precursors of earthquakes. Dokl. Earth Sci. 413, 281–285 (2007).
Hayakawa, M. et al. On the precursory signature of Kobe earthquake on VLF subionospheric signals. In IEEE International Symposium on Electromagnetic Compatibility. (Institute of Electrical Engineers INC, 1997).
Koizumi, N. et al. Preseismic changes in groundwater level and volumetric strain associated with earthquake swarms off the east coast of the Izu Peninsula, Japan. Geophys. Res. Lett. 26, 3509–3512 (1999).
Article ADS Google Scholar
Khilyuk, L. F., Robertson Jr, J. O., Endres, B. & Chilingarian, G. V. Gas Migration: Events Preceding Earthquakes (Elsevier, 2000).
Hwa Oh, Y. & Kim, G. A radon-thoron isotope pair as a reliable earthquake precursor. Sci. Rep. https://doi.org/10.1038/srep13084 (2015).
Pulinets, S. A. & Boyarchuk, K. A. Ionospheric Precursors of Earthquakes (Springer, Berlin, 2004).
Uyeda, S., Nagao, T. & Kamogawa, M. Short-term earthquake prediction: current status of seismo-electromagnetics. Tectonophysics 470, 205–213 (2009).
Article ADS Google Scholar
Sorokin, V. M., Chmyrev, V. M. & Hayakawa, M. A review on electrodynamic influence of atmospheric processes to the ionosphere. Open J. Earthq. Res. 9, 113–141 (2020).
Article Google Scholar
Sun, Y., Niu, F., Liu, H., Chen, Y. & Liu, J. Crustal structure and deformation of the SE Tibetan plateau revealed by receiver function data. Earth Planet. Sci. Lett. 349, 186–197 (2012).
Article ADS Google Scholar
Ito, Y. et al. Episodic slow slip events in the Japan subduction zone before the 2011 Tohoku-Oki earthquake. Tectonophysics 600, 14–2 (2013).
Article ADS Google Scholar
Ruiz, S. et al. Intense foreshocks and a slow slip event preceded the 2014 Iquique Mw 8.1 earthquake. Science 345, 1165–1169 (2014).
Article ADS CAS PubMed Google Scholar
Wikelski, M. et al. Potential short‐term earthquake forecasting by farm animal monitoring. Ethology 9, 931–941 (2020).
Article Google Scholar
Geller, R. J., Jackson, D. D., Kagan, Y. Y. & Mulargia, F. Earthquakes cannot be predicted. Science 5306, 1616–1616 (1997).
Article Google Scholar
Zöller, G., Hainzl, S., Tilmann, F., Woith, H. & Dahm, T. Comment on “Potential short‐term earthquake forecasting by farm animal monitoring” by Wikelski, Mueller, Scocco, Catorci, Desinov, Belyaev, Keim, Pohlmeier, Fechteler, and Mai. Ethology 127, 302–306 (2021).
Article Google Scholar
Winson, A. E., Costa, F., Newhall, C. G. & Woo, G. An analysis of the issuance of volcanic alert levels during volcanic crises. J. Appl. Volcanol. 3, 1–12 (2014).
Article Google Scholar
Smith, W. D. The b-value as an earthquake precursor. Nature 289.5794, 136–139 (1981).
El-Isa, Z. H. & Eaton, D. W. Spatiotemporal variations in the b-value of earthquake magnitude–frequency distributions: classification and causes. Tectonophysics 615, 1–11 (2014).
Article ADS Google Scholar
Peng, C. et al. Performance evaluation of an earthquake early warning system in the 2019–2020 M 6.0 Changning, Sichuan, China, Seismic Sequence. Front. Earth Sci. 9, 699941 (2021).
Article Google Scholar
Keilis-Borok, V. I., Shebalin, P. N. & Zaliapin, I. V. Premonitory patterns of seismicity months before a large earthquake: five case histories in Southern California. Proc. Natl. Acad. Sci. USA https://doi.org/10.1073/pnas.202617199 (2002).
Marzochhi, W., Taroni, M. & Falcone, G. Earthquake forecasting during the complex Amatrice-Norcia seismic sequence. Sci. Adv. https://doi.org/10.1126/sciadv.1701239 (2017).
Ben-Zion, Y. & Zaliapin, I. Localization and coalescence of seismicity before large earthquakes. Geophys. J. Int. 223, 561–583 (2020).
Article ADS Google Scholar
Jaumé, S. & Sykes, L. Evolving towards a critical point: A review of accelerating seismic moment/energy release prior to large and great earthquakes. Pure appl. geophys. 155, 279–305 (1999).
Mignan, A. & Broccardo, M. Neural network applications in earthquake prediction (1994–2019): meta‐Analytic and statistical insights on their limitations. Seismol. Res. Lett. https://doi.org/10.1785/0220200021 (2020).
Asencio–Cortés, G., Morales–Esteban, A., Shang, X. & Martínez–Álvarez, F. Earthquake prediction in California using regression algorithms and cloud-based big data infrastructure. Comput. Geosci. https://doi.org/10.1016/j.cageo.2017.10.011 (2018).
Bergen, K. J., Johnson, P. A., de Hoop, M. V. & Beroza, G. C. Machine learning for data-driven discovery in solid earth geoscience. Science https://doi.org/10.1126/science.aau0323 (2019).
Rouet-Leduc, B. et al. Machine Learning Predicts Laboratory Earthquakes. Geophys. Res. Lett. https://doi.org/10.1002/2017GL074677 (2017).
Corbi, F. et al. Machine learning can predict the timing and size of analog earthquakes. Geophys. Res. Lett. https://doi.org/10.1029/2018GL081251 (2019).
Panakkat, A. & Adeli, H. Neural network models for earthquake magnitude prediction using multiple seismicity indicators. Int. J. Neural. Syst. https://doi.org/10.1142/S0129065707000890 (2007).
DeVries, P. M. R., Viégas, F., Wattenberg, M. & Meade, B. J. Deep learning of aftershock patterns following large earthquakes. Nature https://doi.org/10.1038/s41586-018-0438-y (2018).
Barnhart, W. D., Hayes, G. P. & Gold, R. D. The July 2019 Ridgecrest, California, earthquake sequence: Kinematics of slip and stressing in cross‐fault ruptures. Geophys. Res. Lett. https://doi.org/10.1029/2019GL084741 (2019).
West, M. E. et al. The 30 November 2018 Mw 7.1 Anchorage Earthquake. Seismol. Res. Lett. 91, 66–84 (2019).
Article Google Scholar
Zhuang, J., Ogata, Y. & Wang, T. Data completeness of the Kumamoto earthquake sequence in the JMA catalog and its influence on the estimation of the ETAS parameters. Earth Planets Space 69, 36 (2017).
Article ADS Google Scholar
Tape, C. et al. Earthquake nucleation and fault slip complexity in the lower crust of central Alaska. Nat. Geosci. 11, 536–541 (2018).
Article ADS CAS Google Scholar
Bowman, D. D. & King, G. C. P. Accelerating seismicity and stress accumulation before large earthquakes. Geophys. Res. Lett. https://doi.org/10.1029/2001GL013022 (2001).
Rowe, C. D. & Griffith, W. A. Do faults preserve a record of seismic slip: a second opinion. J. Struct. Geol. 78, 1–26 (2015).
Article ADS Google Scholar
Gudmundsson, A. Volcanotectonics: Understanding the Structure, Deformation and Dynamics of Volcanoes (Cambridge University Press, 2020).
Caine, J. S., Evans, J. P. & Forster, C. B. Fault zone architecture and permeability structure. Geology 11, 1025–1028 (1996).
Article Google Scholar
Dieterich, J. H. Preseismic fault slip and earthquake prediction. J. Geophys. Res. Solid Earth 83, 3940–3948 (1978).
Article Google Scholar
Sibson, R. H. Structural permeability of fluid-driven fault-fracture meshes. J. Struct. Geol. 18, 1031–1042 (1996).
Article ADS Google Scholar
Harris, R. A. Introduction to special section: stress triggers, stress shadows, and implications for seismic hazard. J. Geophys. Res. Solid Earth 103, 24347–24358 (1998).
Article Google Scholar
Yoshioka, N. & Iwasa, K. A laboratory experiment to monitor the contact state of a fault by transmission waves. Tectonophysics 413, 221–238 (2006).
Article ADS Google Scholar
Warren-Smith, E. et al. Episodic stress and fluid pressure cycling in subducting oceanic crust during slow slip. Nat. Geosci. 12, 475–481 (2019).
Article ADS CAS Google Scholar
Zoback, M. D. et al. New evidence on the state of stress of the San Andreas fault system. Science 238, 1105–1111 (1987).
Article ADS CAS PubMed Google Scholar
Lund Snee, J. E. & Zoback, M. D. Multiscale variations of the crustal stress field throughout North America. Nat. Commun. 11, 1951 (2020).
Article ADS CAS Google Scholar
Fialko, Y. Interseismic strain accumulation and the earthquake potential on the southern San Andreas fault system. Nature 441, 968–971 (2006).
Article ADS CAS PubMed Google Scholar
McKenzie, D. & Brune, J. N. Melting on fault planes during large earthquakes. Geophys. J. Int. 29, 65–78 (1972).
Article ADS Google Scholar
Drymoni, K., Browning, J. & Gudmundsson, A. Volcanotectonic interactions between inclined sheets, dykes, and faults at the Santorini Volcano, Greece. J. Volcanol. Geothermal Res. 416, 107294 (2021).
Field, E. H. & Milner, K. R. Candidate products for operational earthquake forecasting illustrated using the HayWired planning scenario, including one very quick (and not‐so‐dirty) hazard‐map option. Seismol. Res. Lett. 89, 1420–1434 (2018).
Article Google Scholar
Keilis-Borok, V. I. & Rotwain, I. M. Diagnosis of time of increased probability of strong earthquakes in different regions of the world: algorithm CN. Phys. Earth Planet. Inter. 61, 57–72 (1990).
Article ADS Google Scholar
Huang, H., Meng, L., Bürgmann, R., Wang, W. & Wang, K. Spatio-temporal foreshock evolution of the 2019 M 6.4 and M 7.1 Ridgecrest, California earthquakes. Earth Planet. Sci. Lett. 551, 116582 (2020).
Article CAS Google Scholar
Peresan, A., Kossobokov, V., Romashkova, L. & Panza, G. F. Intermediate-term middle-range earthquake predictions in Italy: a review. Earth Sci. Rev. 69, 97–132 (2005).
Article ADS Google Scholar
Dobrovolsky, I. P., Zubkov, S. I. & Miachkin, V. I. Estimation of the size of earthquake preparation zones. Pure Appl. Geophys. 117, 1025–1044 (1979).
Article ADS Google Scholar
Peresan, A., Kossobokov, V. G. & Panza, G. F. Operational earthquake forecast/prediction. Rend. Lincei 23, 131–138 (2012).
Article Google Scholar
Sugan, M., Kato, A., Miyake, H., Nakagawa, S. & Vuan, A. The preparatory phase of the 2009 Mw 6.3 L’Aquila earthquake by improving the detection capability of low‐magnitude foreshocks. Geophys. Res. Lett. 41, 6137–6144 (2014).
Article ADS Google Scholar
Iturrieta, P. et al. Evaluation of a Decade‐Long Prospective Earthquake Forecasting Experiment in Italy. Seismol. Res. Lett. https://doi.org/10.1785/0220230247 (2024).
Cameron, C. E. et al. Alaska volcano observatory alert and forecasting timeliness: 1989–2017. Front. Earth Sci. 6, 86 (2018).
Article ADS Google Scholar
Rundle, P. B. et al. Nonlinear network dynamics on earthquake fault systems. Phys. Rev. Lett. https://doi.org/10.1103/PhysRevLett.87.148501 (2001).
Dolan, J. F., Bowman, D. D. & Sammis, C. G. Long range and long-term fault interactions in Southern California. Geology https://doi.org/10.1130/G23789A.1 (2007).
Dempsey, D. E., Cronin, S. J., Mei, S. & Kempa-Liehr, A. W. Automatic precursor recognition and real-time forecasting of sudden explosive volcanic eruptions at Whakaari, New Zealand. Nat. Commun. 11, 3562 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Girona, T. & Drymoni, K. Abnomal low-magnitude seismicity preceding large-magnitude earthquakes -- dataset, scripts, machine learning models, and results. (1.0). Zenodo https://doi.org/10.5281/zenodo.13212238 (2024).

Download references

Acknowledgements

Earthquake catalog courtesy of the U.S. Geological Survey. This research project and publication was supported by the University of Alaska Fairbanks Geophysical Institute. K.D. acknowledges financial support from the European Research Council (ERC) Consolidator Grant MODERATE (10100165). T.G. appreciates discussions on the matter of this paper with Vicente Girona Sánchez, María del Rosario Hernández Gil, Warner Marzocchi, and Patricia Martínez Garzón.

Author information

Authors and Affiliations

Geophysical Institute, University of Alaska Fairbanks, Fairbanks, AK, USA
Társilo Girona
Earth and Environmental Sciences, Ludwig-Maximilians-Universität in Munich, Munich, Germany
Kyriaki Drymoni

Authors

Társilo Girona
View author publications
You can also search for this author inPubMed Google Scholar
Kyriaki Drymoni
View author publications
You can also search for this author inPubMed Google Scholar

Contributions

T.G. conceived the project and led the design and implementation of the machine learning-based algorithm. K.D. led the FEM numerical modeling. T.G. and K.D. wrote the manuscript, designed figures, and discussed the models, results, and conclusions.

Corresponding author

Correspondence to Társilo Girona.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Miao Zhang and the other, anonymous, reviewer for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Movie 1

Supplementary Movie 2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Girona, T., Drymoni, K. Abnormal low-magnitude seismicity preceding large-magnitude earthquakes. Nat Commun 15, 7429 (2024). https://doi.org/10.1038/s41467-024-51596-z

Download citation

Received: 03 October 2023
Accepted: 13 August 2024
Published: 28 August 2024
DOI: https://doi.org/10.1038/s41467-024-51596-z

This article is cited by

A Deep-Learning P-Wave Arrival Picker for Laboratory Acoustic Emissions: Model Training and Its Performance
- Tian Yang Guo
- Tiziana Vanorio
- Jihui Ding
Rock Mechanics and Rock Engineering (2025)