Introduction

The Gutenberg-Richter law1 is an empirical relation that describes the distribution of the magnitudes of seismic events:

$${{Log}}_{10}\left(N\right)=a-{bM}$$
(1)

where \(N\) is the number of events with magnitude \(\ge M\), and \(a\) and \(b\) are the parameters controlling the distribution. This relation corresponds to an exponential distribution with a single parameter, the b-value2. Earthquake kinematics and differential stress influence the b-value3,4,5. The b-value is a fundamental parameter for seismic hazard studies, particularly for probabilistic seismic hazard analyses, since it rules the percentage between small and large events6. Therefore, it is of primary importance to understand its variability across different portions of the Earth’s crust and its correlations with other geophysical quantities. Studies on regional catalogs correlate the b-value with the earthquakes’ depth: the shallower the event, larger the b-value7, apart from peculiar anomalies of high b-values in some subduction zones8. Regarding global catalogs, several studies have reported a b-value around 1.0 for regions undergoing strike-slip regime, b-value > 1 in extensional settings, and b-value < 1 in compressional ones9,10,11.

For assessing the relationship of the earthquake size distribution with the kinematics, two different strategies are generally used to quantify the b-value variations: a) analyzing the events according to the focal mechanisms; b) proceeding to a definition of a seismotectonic zonation and then analyzing the earthquakes of each zone. Both selection methods can potentially cause a mixture of geodynamics settings or faulting/tectonic styles. In the first case, a simplistic choice driven by focal mechanisms defining a normal-faulting earthquake class may mix two contrasting geodynamic settings: for example, seafloor spreading and back-arc extension. In the former, a new basaltic crust creates with the asthenosphere upwelling at the oceanic ridges in divergent plate boundaries. The extension in back-arc regions leads to the stretching and thinning of the continental or oceanic crust, forming basins filled with sediment and volcanic deposits. Notably, back-arc extension occurs near convergent boundaries behind the volcanic arc. Similarly, the thrust-faulting earthquake class mixes subduction-related and continental-convergent earthquakes. Though convergent settings characterize both, these geodynamic processes maintain substantial differences in fluid circulation, thermal state, and volcanic activity. These processes are expected to influence the b-value5.

In the second case, earthquakes are grouped using kinematic information from a regional seismotectonic model10,12. In this case, the accuracy of the b-values determination is mainly related to the correctness of the seismotectonic model. For example, a rough definition of a seafloor spreading seismotectonic zone overpopulates the extensional-earthquake group with the transform-fault earthquakes, resulting in a mixed faulting mechanism style, which is expected to influence the b-value. Consequently, the primary caution regarding this second approach is that there must be a reproducible and transparent definition of both the geographic boundaries of each seismogenic zone and its expected kinematics.

For this aim, geodetic measurements provide an unmatchable, still underrated source of information to be used for an alternative selection method in the b-value estimation. The geodetic strain rates capture multiple rheological behaviors of the lithosphere and asthenosphere, spanning from the interseismic elastic deformation of rock volumes bordering active faults13 to frictional sliding and viscoelastic relaxation induced by a strong earthquake14 or deglaciation15. However, numerical models tuned on interseismic geodetic horizontal velocities and stress orientation data records have proved to adequately describe the long-term dynamics16,17,18 and kinematics at the local scale19,20. In this sense, crustal deformation measurements do not need any subjective choice for kinematics descriptions: the elastic rebound theory suggests that crustal volumes experience deformation in the interseismic phase, suddenly released as coseismic slip when frictional forces can no longer prevent the fault from slipping. Thus, the tectonic regime shown by geodetic strain rates can approximate the tectonic regime of earthquake strains and can be used to determine their kinematics quantified by the ratio of the two principal components of the geodetic horizontal strain rate tensor (see Methods). Furthermore, geodetic-derived strain rates are largely independent of the information in the seismic catalog and allow to delimit crustal volumes undergoing extension or compression, increasing the overall reliability of possible differences in the b-values21.

We propose an approach based on the 2D strain-rate tensor derived by the joint inversion of geodetic measurements and interpolated stress direction data (Fig. 1) to group earthquakes according to the interseismic deformation pattern of the volumes surrounding active faults. We test our approach for the Italian seismicity, representing an optimal target due to an adequate geodetic benchmark distribution and an excellent instrumental catalog. We intentionally delimited the southern boundary of our model to the parallel N39.6°, because geodetic data in the Calabria region are influenced by two conflicting processes: the extension of the forearc and the compression along the subduction plane. In this case, the 2D strain rate tensor cannot be used to determine the long-term one without inserting the active faults and their rates19.

Fig. 1: Geodetic and stress orientation data.
figure 1

a Selected horizontal velocities from Serpelloni et al.22. b: Interpolated SHmax direction with the scheme of Carafa and Barba53 and Carafa et al.54 applied to the Mariucci and Montone23 dataset.

Results and discussion

Calculation of compressional and extensional zones

The interseismic strain rate tensor has been obtained by inverting geodetic horizontal velocities22 and SHmax orientations23. Notably, the 2D strain rate tensor, with \({\dot{{{{{{\rm{\varepsilon }}}}}}}}_{{{{{{\rm{1h}}}}}}}\) and \({\dot{{{{{{\rm{\varepsilon }}}}}}}}_{{{{{{\rm{2h}}}}}}}\) being its principal horizontal values, allows us to determine any interseismic kinematics of the upper crust as long as data covers the study area, which is not the case for the Adriatic Sea region(see Methods). Consequently, the b-value variability of the strike-slip regimes (possibly including deep and/or offshore earthquakes24) can not be further investigated in our study area. These (excluded) events are probably due to bending in the dipping Adriatic slab, and, therefore, might have different physics and/or more complex stress regimes than shallow earthquakes. For the same reasons, we fixed the seismicity depth cutoff to 15 km to exclude deeper earthquakes with mixed regimes occurring onshore on the eastern side of the Apennines chain25. The earthquake catalog used to estimate the b-values is the 2005–2022 instrumental catalog (with homogenized moment magnitudes26).

We defined the compressional and extensional areas considering adequate coverage of geodetic and stress data records; then, we drew the borders at \(|{\dot{{{{{{\rm{\varepsilon }}}}}}}}_{{{{{{\rm{1h}}}}}}}|\approx 6\cdot {10}^{-9}{{{{{{\rm{yr}}}}}}}^{-1}\) (where \({\dot{{{{{{\rm{\varepsilon }}}}}}}}_{{{{{{\rm{1h}}}}}}}+{\dot{{{{{{\rm{\varepsilon }}}}}}}}_{{{{{{\rm{2h}}}}}}} \, < \, 0\)) and \(|{\dot{{{{{{\rm{\varepsilon }}}}}}}}_{{{{{{\rm{2h}}}}}}}|\approx 12\cdot {10}^{-9}{{{{{{\rm{yr}}}}}}}^{-1}\) (where \({\dot{{{{{{\rm{\varepsilon }}}}}}}}_{{{{{{\rm{1h}}}}}}}+{\dot{{{{{{\rm{\varepsilon }}}}}}}}_{{{{{{\rm{2h}}}}}}} \, > \, 0\)), respectively. The surface projections of both regions are shown in Fig. 2. Notably, the extensional region doesn’t contain any volcanic area, which is expected to exhibit higher b-values.

Fig. 2: Greatest horizontal strain rate absolute value.
figure 2

Map (split according to the expected regime, a compression, b extension) of the greatest absolute value of the horizontal strain rate.

Comparison of expected and observed kinematics

First, we inspected the literature and found several lines of evidence (paleotrenches, fault ribbon exposure rates, historical reports on past earthquakes’ coseismic effects) unambiguously confirming our results for both extensional or compressional kinematics of the uppermost and frictional parts of the crust as shown in Fig. 227,28. Then, we determined that the lateral variation of the strain rate tensor in Italy coherently reproduces the spatial distribution of earthquake kinematics reported for the past century in Mariucci and Montone23 (see Methods). Finally, we explored whether the seismic moment release in each area is governed by a few earthquakes with different kinematics than the expected one (according to the geodetic data). To this end, we determined the ratio of the seismic moment of earthquakes with correct kinematics to the total seismic moment for both compressional and extensional areas. Kinematics of each earthquake was inferred by looking at the rake angles of the nodal planes. Table 1 shows the results of these tests: the ratio is about 0.96 for the compressional area, which raises to 0.99 for the extensional one (see Methods for details). We repeated the test for different borders of both regions to reject the possibility that the subjective choice of the bounding strain-rate isoline is conditioning the final b-values (see Supplementary Material). In detail, we changed the strain-rate isoline to \(|{\dot{{{{{{\rm{\varepsilon }}}}}}}}_{{{{{{\rm{1h}}}}}}}|\approx 9\cdot {10}^{-9}{{{{{{\rm{yr}}}}}}}^{-1}\) for compression and \(|{\dot{{{{{{\rm{\varepsilon }}}}}}}}_{{{{{{\rm{2h}}}}}}}|\approx 16\cdot {10}^{-9}{{{{{{\rm{yr}}}}}}}^{-1}\) for extension without finding any appreciable difference in the seismic moment ratio (see Table 1).

Table 1 Expected vs observed kinematics.

All these tests confirm that our two zones correctly delimit extension and compression for Italy and, more in general, that the geodetically-derived definition of the expected kinematics of earthquakes is a reliable and useful approach.

The spatial pattern of expected kinematics is consistent with the seismotectonic zonation of Stucchi et al.29 (hereafter St11). In contrast, it differs from Gulia and Wiemer10 (hereafter GW10) for the central-southern Apennines, where the authors draw a large transcurrent region based on the moment tensor of a few deep and easterly-located strike-slip earthquakes (Fig. 3). However, extensive paleo-seismological and Quaternary geology studies30,31,32,33, macroseismic data of historical earthquakes (1805 Matese earthquake, Mw = 6.8, see Guidoboni et al.34 and references therein), and recent low35 to moderate earthquakes (1984 Villetta Barrea earthquake, Mw = 5.836,37) unambiguously indicate that the region upper crust is undergoing extension, as correctly depicted by data used in our work. This overall coherence with independent information suggests that the 2D-strain rate tensor derived from geodetic measurements should be incorporated into geophysical data to be considered for a reliable seismotectonic model.

Fig. 3: Comparison of expected kinematics among different zonations.
figure 3

The borders of the two zones determined in this work are shown in both pictures (COMP: compressional region; EXTN: extensional region). a The colors refer to the expected kinematics from GW1010; b, the kinematics from the zonation of St1131 are reported. Focal mechanisms for earthquakes Mw > 5.8 are shown23.

b-value estimation

For both compressional and extensional regions, the b-value and its uncertainty are estimated by adequately considering the magnitude of completeness and the possible short-term incompleteness after a main event in an earthquake sequence since it can strongly influence the resulting b-value38,39,40. In detail, we used different levels of completeness to also include the events in the early stage of aftershock sequences using a recent developed b-value estimation method41.

In Fig. 4, we show the results of the b-value estimations with different magnitudes of completeness thresholds for both the compressional and extensional regions. The statistical difference between the b-values estimated in the two crustal volumes is then tested using the Utsu test42. The b-value of the volume in extension is always significantly larger than in compression, confirming that the differential stress likely influences the earthquake size distribution. However, the b-values of both kinematic regimes are adjacent and defined in a relatively small range: 0.92–1.16, considering the bounds of the 95% confidence intervals; 0.98 ± 0.03 for compression (978 events from Mw 2.5), 1.13 ± 0.01 for extension (5077 events from Mw 2.5).

Fig. 4: B-value estimation and comparison.
figure 4

a Extensional (red) and compressional (blue) zones, with the epicenters of events used for the b-value estimation. b: b-values (solid lines) for the extensional (red) and compressional (blue) zones, along with their 95% confidence intervals (dashed lines) for different magnitudes of completeness, from 2.5 to 2.7 (these thresholds are increased by 1.0 during short-term aftershocks incompleteness periods, see Methods for details); lower part of the panel represents the p value of the Utsu test (solid black line) in a log10 scale, the dashed line represents the 0.05 p value threshold. c survival function of the magnitude frequency distributions in a log10 scale (red dots for extension and blue circles for compression since we used different levels of completeness in the b-value estimation, this X-axis shows \(M-{M}_{C}\) instead of \(M\)63. d: Zones of the St1131 zonation (dashed polygons), colored according to the kinematics of the zones determined in our work (red for extension and blue for compression).

Comparison with previous studies and test of our method

The b-values we have determined are statistically different, but closer than expected or previously reported. As for the extensional zone, the b-value is similar to the one estimated by GW1010 (1.09 ± 0.02 vs 1.13 ± 0.01 for our analyses); however, it differs for the compressional region (0.75 ± 0.02/0.78 ± 0.02 vs 0.98 ± 0.03 for our analyses). Notably, their compressional zone is well aligned with known active seismogenic sources43 located in the northeast of Italy; instead, our compressional area is bigger (more than double the size of GW1010). These differences became smaller if we applied our method and our catalog to the GW1010 zones (Fig. S1 and Table S1 in Supplementary Information), suggesting that the discrepancies in the aforementioned estimations could be due to a different magnitude scale adopted (Mw in our case vs ML in the original GW1010 estimation).

As for the b-values determined by St1131, our extensional area overlies on nine of their zones, for which a b-value variability between 0.74 and 1.96 is reported (1.13 ± 0.01 in our case). Similarly, our compressional area overlaps four zones of St1131, characterized by a larger range of 1.04–1.35 than our estimates (0.98 ± 0.03). These differences remain large also if we apply our method and our catalog to the St1131 zones (Fig. S1 and Table S1 in Supplementary Information).

Hence, is the higher b-value variability of St1131 due to the limited number of earthquakes falling in the zones, or is it due to local earthquake characteristics (host rocks, thermal conditions, rheology parameters) in a homogenous tectonic regime? We performed a simple but rigorous two-steps experiment to investigate this point.

In the first step, we split the earthquake catalog into learning and testing datasets (25–75%, respectively). Using the learning dataset, we estimated the b-value for the compressional and extensional areas determined in our work. Similarly, we calculated the b-value for 13 zones (9 zones for the extensional setting and four zones for the compressional setting) of St1131 overlapping our extensional and compressional zones. Therefore, using the same dataset, we have determined 13 different b-values for St1131 zones and two b-values for the extensional and compressional areas determined with geodetic data. In the second step, we used the testing dataset to check whether the magnitude distribution in the 13 zones of St1131 is better described by their local b-value determined in the first step or with the b-value estimated for the larger zones determined in our work with the corresponding tectonic regime. As a comparison metric, we adopt the Bayes Factor44, already used in seismological comparison studies45,46. We found that two b-values of compressional and extensional areas better describe the magnitude distribution of the testing dataset in the 13 zones of St1131 than the 13 b-values determined with the learning dataset (Table 2, see Methods for details).

Table 2 Models comparison results.

Conclusions

Interseismic geodetic strain rates and horizontal stress directions can be used to define the extensional and compressional zones in the Italian region. The resulting b-value of the Gutenberg-Richter law in these zones is significantly larger in the extensional setting than in the compressional one, although these differences are much smaller than previously reported. We finally verify that our zones and their small b-value differences are coherent with observed seismicity.

Consequently, the large b-value differences seen in previous studies can appear as artifacts when the spatial zones of a seismotectonic model are too small; the undersampling of the earthquake size distribution affects the computation, resulting in an overdispersed representation of b-values. Similarly, some b-value variations of adjacent small regions are wrongly attributed to different kinematics when using subjective and sharp boundaries, but they average out when using independent and reproducible methods such as the geodetically derived strain rates. It follows that some large spatial b-value fluctuations are probably due to intrinsic stochastic variability more than a physical feature of the seismicity. These principles of interpreting apparent b-value variations (governed by undersampling more than by physical processes) could be valid for other applications by using an appropriate and meaningful resolution/selection46,47,48,49,50.

Some direct implications follow from our work for the probabilistic seismic hazard analysis. Before using b-values for earthquake forecasting, careful inspection of their variations is required because it could mainly derive from an excessive splitting of the study area in small (as in the case of Stucchi et al.29) and/or kinematically wrong zones (as in the case of some zones drew in Gulia and Wiemer10). At least for the Italian peninsula, where geodetic data density is adequate, using simple, reproducible (and falsifiable) rules based on the 2D strain rate tensor reduces the difference in b-values between extensional and compressional settings. Clearly, in global seismic catalogs that also include oceanic ridges (for extension) and subduction zones (for compression), these b-value differences can be much larger.

Methods

Definition of compressional and extensional domains by inverting GNSS-derived horizontal velocities and interpolated SHmax directions

The diagonalization of a \(2\times 2\) geodetically-derived horizontal-plane strain-rate tensor gives the principal horizontal values \({\dot{\varepsilon }}_{1{{{{{\rm{h}}}}}}}\) (more negative or compressional) and \({\dot{\varepsilon }}_{2{{{{{\rm{h}}}}}}}\) (more positive or extensional) and their associated principal axis directions. If we plot both values in a cartesian plane of the horizontal principal strain rates, it is always possible to define the kinematics of a given 2D geodetically-derived strain rate tensor. By definition of \({\dot{\varepsilon }}_{1{{{{{\rm{h}}}}}}}\) and \({\dot{\varepsilon }}_{2{{{{{\rm{h}}}}}}}\), only the area above the bisector of the 1st−3rd quadrants can be populated. In detail, the first quadrant unambiguously defines a normal-faulting tectonic regime, whereas the third quadrant delimitates the thrust-faulting one. The transition from transtension to pure strike-slip to transpression can be identified in the second quadrant, moving counterclockwise51.

The spatial distribution of the 2D strain rates is obtained with the NeoKinema code52, which has repeatedly been proven to be successful in modeling the active tectonics of the central Mediterranean13,19. The model strain rates are obtained by simultaneously inverting the GPS velocities of Serpelloni et al.22 and interpolated SHmax directions of Mariucci and Montone23 on a 15-km laterally-spaced 2-D finite element grid. The SHmax directions are interpolated with the procedure described in Carafa and Barba53 and Carafa et al.54. The general objective function of our inversion with the NeoKinema code is:

$$\Pi \equiv -{\left(\vec{p}-\vec{r}\right)}^{T}\left[{\widetilde{C}}_{{GPS}}^{-1}\right]\left(\vec{p}-\vec{r}\right)-\frac{1}{{A}_{0}}\mathop{\sum }\limits_{n=1}^{3}\mathop{\iint }\limits_{{area}}\,\frac{{\left({p}_{n}-{r}_{n}\right)}^{2}}{{\sigma }_{n}^{2}}{{{{{\rm{d}}}}}}a$$
(2)

where \(\overrightarrow{p}\) is the vector of (scalar) model predictions, and \(\overrightarrow{r}\) is the vector of (scalar) data; and \(n=1,2,3\) enumerates 3 target rates associated with three independent functionals of permanent strain-rate at each continuum point. Although the kinematics of both compressional and extensional remain stable for a large range of the dimensional parameter \({A}_{0}\), we set it to \({A}_{0}=1\cdot {10}^{8}{{{{{{\rm{m}}}}}}}^{2}\). In this case, the error (L1 norm) of the model predictions is 0.34 mm/yr relative to the geodetic data and 10° relative to the interpolated stress directions.

The target velocities derived from geodetic data are included in the first term of the objective function. In contrast, the second term contains the targets derived from interpolated stress-direction data and the stiff-continuum assumption. This second term describes the size and orientation of distributed permanent deformation rate tensors for each finite element. The elastic rebound theory suggests that the strain rate is high near active faults or plate boundaries and tends to zero in stable plate interiors. An appropriate formalism for this “microplate assumption” is to assign a zero target strain rate with a statistical uncertainty \(\mu\)51,55 that can be seen as the regional RMS of the interseismic strain rate. We determined \(\mu\) by iteration: we choose a starting \(\mu\) value as an input parameter in Eq. (2) and then compare it to the average interseismic strain rate from the model output. For our work, the value \(\mu =6.56\cdot {10}^{-9}{{{{{{\rm{yr}}}}}}}^{-1}\) minimizes the difference between the model input and output and is selected as optimal.

The resulting strain rate map (Fig. 2) derived from GNSS horizontal velocities shows two well-delimited and divided regions undergoing compression/transpression (Southern Alps) and extension/transtension (Apennines). Notably, the extensional region doesn’t contain any volcanic area, which is expected to exhibit higher b-values.

As for the transition in the second quadrant from extension/transtension to pure strike-slip to compression/transpression, we derived it from the ratio of \({\dot{\varepsilon }}_{{{{{{\rm{1h}}}}}}}\) and \({\dot{\varepsilon }}_{{{{{{\rm{2h}}}}}}}\) resulting from the kinematic inversion. When the principal values differ in sign, we can derive that the kinematic of a 2D strain-rate tensor is transtensional (thus undergoing extension) if \({\dot{\varepsilon }}_{{{{{{\rm{2h}}}}}}} > -(1-{k}_{1})\cdot {\dot{\varepsilon }}_{{{{{{\rm{1h}}}}}}}\); transpressional (compression) if \({\dot{\varepsilon }}_{{{{{{\rm{2h}}}}}}} < -\frac{1}{(1-{k}_{1})}\cdot {\dot{\varepsilon }}_{{{{{{\rm{1h}}}}}}}\), strike-slip otherwise. In Fig. 5, we show that for our study area, the definition of transtension and transpression in the second quadrant resulting from \({k}_{1}=0.25\) is consistent with the observed principal components of the 2D strain rate tensor for finite elements falling in both areas. For the extensional area, this observation is consistent with \({\dot{\varepsilon }}_{{{{{{\rm{1h}}}}}}}\) being negative (though marginal) and linked to the far-field Africa-Eurasia convergence17. Similarly, in the Southern Alps, the clockwise rotation of Adria relative to the Eurasia plate determines a lateral transition from pure thrust earthquakes to transpression to strike-slip ones56, thus \({\dot{\varepsilon }}_{{{{{{\rm{2h}}}}}}}\) being positive though marginal.

Fig. 5: Analysis of horizontal strain rate principal values.
figure 5

a Horizontal strain rate principal values (\({\dot{\varepsilon }}_{{{{{{\rm{1h}}}}}}}\) and \({\dot{\varepsilon }}_{{{{{{\rm{2h}}}}}}}\)) for finite elements falling in extensional and compressional areas versus the tectonic regime definition resulting from \({k}_{1}=0.25\). b IPSI earthquake kinematics versus the tectonic regime definition resulting from \({k}_{1}=0.25\).

To confirm the coherence between interseismic strain rates and earthquake kinematics, we used the IPSI focal mechanisms23 for earthquakes that occurred after 1900 AD, which are already grouped according to their tectonic regime. In the cartesian plane of the horizontal strain rates, we plotted each IPSI earthquake labeled with “TF” or “TS” and “NF” or “NS” and that fell in our compressional and extensional regions. The earthquake coordinates in the strain-rate cartesian plane are the principal values \({\dot{\varepsilon }}_{{{{{{\rm{1h}}}}}}}\) and \({\dot{\varepsilon }}_{{{{{{\rm{2h}}}}}}}\) of the element where the earthquake falls. The excellent overlap in the cartesian plane of each earthquake point with its expected tectonic regime resulting from \({k}_{1}=0.25\) confirms our basic assumption that interseismic strain rate can be used as a good proxy for the earthquake kinematics (Fig. 5).

Determining the ratio between the correctly determined and the total seismic moment for each area

We checked the consistency between our compressional and extensional areas using independent observations. These observations are the earthquakes of two catalogs: the regional CMT catalog57 (1997–2022), and the Italian CMT catalog58 (1976–2015).

For both catalogs, we selected all earthquakes falling in these two areas and calculated the ratio of the seismic moment between the earthquakes with matched kinematics with the total one. To select the correct kinematics, we used the rake angle of the first and the second nodal planes, available in the catalogs. The moment magnitudes of the catalogs are converted into seismic moments using the classical Kanamori equation. The resulting values are illustrated in Table 1.

Magnitude of completeness

Estimating the magnitude of completeness is a fundamental step in correctly analyzing the b-value. Here, we used the approach of Herrmann and Marzocchi38, based on the Lilliefors test. We consider the catalog complete above a threshold magnitude if it follows an exponential distribution, as the Gutenberg-Richter law states. We apply this method to the Italian instrumental HORUS catalog from 2005-4-16 to 2022-12-3126 (from the 16th of April 2005, there was a change in the Italian seismic network and increased data availability). We consider the events inside our two areas, with a maximum depth of 30 km. This depth is larger than the one finally used for the b-value estimation, 15 km, but since we also investigate this second threshold of depth in a sensitivity analysis we need to set a reliable completeness until 30 km. The resulting magnitude of completeness is Mw 1.8. We then carefully check the short-term aftershocks incompleteness problem (STAI effect40); for each earthquake with a magnitude larger than Mw 5.5, we consider that in the following 3 days, the completeness threshold is incremented by 1 (i.e., from 1.8 to 2.8). This increment should remove the STAI effect without losing all the events of the aftershock sequences. However, recent works suggest that modern statistical methods for the completeness estimation could also underestimate the real completeness thresholds39,59. Zhuang et al.60 suggest looking to the incremenal number vs magnitude plot to find possible incompleteness in the catalog: the incomplete parts are indicated by white empty spaces in the lower part of the plot or vertical alignment of dots. Taroni et al. (2021)61 showed that this method is also effective in the catalogs with a variable magnitude of completeness (e.g., during STAI periods), if Magnitude – Mc (i.e., the magnitude of the earthquake minus the corresponding magnitude of completeness) is plotted instead of the magnitude. In Fig. 6 these plots are shown: some incompleteness is present in the case of Mc = 1.8 (panel a) and Mc = 1.8 with an increase of 1.0 during STAI periods (panel b). Thus, we decided to use the value of Mc = 2.5. This value also has an operational meaning because it matches the communication threshold of the seismic surveillance room at INGV with the Italian Department of Civil Protection62. Thus, by definition, Mc = 2.5 ensures the detection of all seismic events in the Italian territory. Figure 6 (panel c) demonstrates that this 2.5 threshold shows no suspect incompleteness period.

Fig. 6: Temporal variation of the completeness magnitude.
figure 6

a Incremental number vs magnitude plot for the earthquakes in the catalog with magnitude ≥1.8. b Incremental number vs Magnitude – Mc plot for the earthquakes in the catalog with magnitude ≥1.8, with Mc increased by 1.0 during STAI periods. c Incremental number vs Magnitude – Mc plot for the earthquakes in the catalog with magnitude ≥2.5, with Mc increased by 1.0 during STAI periods.

B-value estimation and test

After the computation of the completeness, we selected the events in the extensional and compressional regions, with a maximum depth of 15 km (see main text for this choice of depth). Since we used different completeness thresholds to account for the STAI effect, we estimate the b-value (\(\hat{b}\)) using the Taroni41 approach:

$$\hat{b}=\frac{\frac{N-1}{N}}{\log (10)\left[\frac{{\sum }_{i=1}^{N}\left({M}_{i}-{M}_{i}^{c}\right)}{N}+\frac{\triangle M}{2}\right]}$$
(3)

Where \(N\) is the total number of events above the completeness, \({M}_{i}\) is the magnitude of the i-th event, \({M}_{i}^{c}\) is the completeness magnitude at the time of the i-th event, and \(\triangle M\) is the magnitude binning. This equation also contains the correction for an unbiased estimation (the \(\frac{N-1}{N}\) in the numerator) and the correction for the magnitude binning (the \(\frac{\triangle M}{2}\) in the denominator). We computed the 95% confidence interval using the normal approximation \([\hat{b}-1.96\hat{\sigma \,},\hat{b}+1.96\hat{\sigma }]\), where \(\hat{b}\) and \(\hat{\sigma \,}\) (sigma) are estimated with the Taroni (2021)41 approach. To test the significance of the b-value difference, we used the classical Utsu test42, which is still valid if the b-value is estimated with the Taroni (2021)41 approach. The Utsu test uses the information on the number of events \(N\) and the estimated b-value \(\hat{b}\) in each of the two sets of observations. The null hypothesis of this test is an equal distribution for the two sets of observations: in the case of a low p-value of the test (e.g., p-value < 0.05), the null hypothesis must be rejected and the two b-values can be considered significantly different. To avoid possible bias in the estimation due to undetected incompleteness, we also performed b-value estimation and test for higher thresholds of completeness (until Mw 2.7, and in this case, the completeness after strong earthquakes is Mw 3.7, i.e., 2.7 + 1). A straightforward sensitivity test with depth cutoff set to 25 km instead of 15 km obtained the same results, implying a marginal effect of depth on b-value estimation in this analysis (Fig. S2 in Supplementary Information). Very similar results are also obtained using Type 2 of compressional and extensional zones (see Table 1 for the definition of these zones, and Fig. S3 in Supplementary Information for the relative results).

Two-steps experiment on b-value comparison between different seismotectonic models

Here, we better explain our comparison check between the Stucchi et al.29 zonation approach and our model. We split the catalog into a learning dataset (20-25-33% of the data) and a testing dataset (80-75-67% of the data). Note that to have a meaningful experiment the testing dataset must contain more events than the learning dataset. To avoid possible temporal bias for the two subsets, we selected the learning dataset uniformly with respect to time (in the case of 25–75%, 1st, 5th, 9th, 13th, etc. events) and the same for the testing dataset (2nd, 3rd, 4th, 6th, 7th, etc. events). We used the events falling in the intersection of our two zones and the Stucchi et al.29 zonation (4 zones for our compressional region and 9 zones for our extensional region). We computed the b-value for each of the 13 zones and for our two larger regions; then, we used the events of the testing catalog inside each of the 13 zones to check if these events are better modeled by the local b-value of their own zone of by the global b-value of our two large regions. We used the Bayes Factor44, a simple, robust, and popular approach already used in the b-value comparison41. By aggregating the comparisons for all 13 zones, we obtain decisive/strong evidence favoring our approach (Table 2).