On the extreme hydrologic events determinants by means of Beta-Singh-Maddala reparameterization

Domma, Filippo; Condino, Francesca; Franceschi, Sara; De Luca, Davide Luciano; Biondi, Daniela

doi:10.1038/s41598-022-19802-4

Download PDF

Article
Open access
Published: 15 September 2022

On the extreme hydrologic events determinants by means of Beta-Singh-Maddala reparameterization

Filippo Domma¹,
Francesca Condino¹,
Sara Franceschi²,
Davide Luciano De Luca³ &
…
Daniela Biondi³

Scientific Reports volume 12, Article number: 15537 (2022) Cite this article

743 Accesses
1 Citations
Metrics details

Subjects

Abstract

In previous studies, beta-k distribution and distribution functions strongly related to that, have played important roles in representing extreme events. Among these distributions, the Beta-Singh-Maddala turned out to be adequate for modelling hydrological extreme events. Starting from this distribution, the aim of the paper is to express the model as a function of indexes of hydrological interest and simultaneously investigate on their dependence with a set of explanatory variables in such a way to explore on possible determinants of extreme hydrologic events. Finally, an application to a real hydrologic dataset is considered in order to show the potentiality of the proposed model in describing data and in understanding effects of covariates on frequently adopted hydrological indicators.

Changes in the distribution of hydro-climatic extremes in a non-stationary framework

Article Open access 30 May 2019

Origin and variability of statistical dependencies between peak, volume, and duration of rainfall-driven flood events

Article Open access 04 March 2021

A mixed distribution to fix the threshold for Peak-Over-Threshold wave height estimation

Article Open access 15 October 2022

Introduction

Over the last decades, growing attention have been addressed to the impact of hydrologic extreme events and to their possible relationship with climate change. It is indeed well known how the occurrence of extreme events, such as heavy rain, are responsible for a unduly large part of climate-related damages and hence are of great concern to the impact community and stakeholders^1,2. The update and the improvement of useful models for better exploring observed extremes, with an emphasis on flood quantiles, are therefore strategic activities for the assessment of current and future exposure to risks. In this context, the hydrologists need for the most suitable model which not only gives rise to a good fit of data but is also based on realistic return level. The most used approach for modelling extreme events is conventional frequency analysis by adopting several common probability distributions such as the log Pearson type III³, the three-parameter Lognormal (⁴, pp. 208–238), the Generalized Pareto (see e.g.^4,5,6, p. 615), the Generalized Logistic (see e.g.^7,8), the Generalized Extreme Value (see e.g.^9,10), the Generalized Gumbel (see e.g.^11,12), the Two Component Extreme Value (TCEV¹³), and the Generalized Lindley¹⁴. Recently, more flexible distributions were proposed (see e.g.^15,16,17): theoretically derived distributions of flood frequency account for the observed rainfall probability distribution and exploit rainfall-runoff models parameterized by means of geomorphological information (see e.g.^18,19,20); other approaches represent non-asymptotic distributions for the annual maxima, and explicitly accounts for the random nature of the number of events/year and the inter-annual variability of the distributions of the ordinary events in each year^21,22.

Moreover²³, proposed the use of a new distribution function, namely four parameters Beta-Singh-Maddala distribution (so called because it is obtained by setting a parameter equal to 1 in the five parameters Beta-Singh-Maddala distribution), showing, by means of an application on real data regarding river flow maxima, its potentiality in extreme events analysis. A specific connection with two of the three special case of Generalized Extreme Value distribution has been proved, since this distribution belongs to the Fréchet maximal domain of attraction and to the Weibull minimal domain of attraction. With reference to the different techniques used in the literature to expand the families of distribution functions, we highlight that the four parameters Beta Singh-Maddala distribution corresponds to a proportional reversed hazard model or to a Lehmann type I distribution and can also be referred to as exponentiated Singh -Maddala distribution.

An additional common problem in hydrology is the estimation of flood quantiles in catchments having short data records or ungauged. Indeed accurate estimates of various streamflow statistics are crucial for water infrastructures design and for flood risk assessment, and they are routinely needed for ungauged catchments that lack nearby streamflow-gauged stations from which streamflow statistics could be directly computed. In this context, regional flood frequency analysis (RFFA) has been proposed in many regions worldwide on the basis of the concept that regional flood flow characteristics are closely related to basin and climate characteristics^{24,25,26,27,28,29,30,31,32}. Moreover³³, stated that regionalization should always be used in statistical analysis of extreme hydrological events because of the large influence that higher moments exert on the shape of the tail of the distribution which are focused by practical applications. Direct regression, geostatistical procedures, and index-flood method³⁴ can be mentioned as example for RFFA³⁵. In particular, the index-flood method coupled with L-moments³⁶, has been extensively used worldwide^{37,38,39,40,41,42,43,44,45}.

As⁴⁶ emphasized, RFFA essentially consists of two principal steps: (i) identification of groups of hydrologically similar catchments, usually named homogeneous regions (HRs); (ii) development of prediction equations within each delineated region.

The identification of HRs often depends on subjective decisions⁴⁷; traditionally, geographic and administrative boundaries have been used for defining homogeneous regions. Nevertheless, regions purely based on these characteristics may lack in hydrological homogeneity^48,49. The method for identifying homogeneous regions in RFFA can be broadly divided in: (a) canonical correlation analysis, (b) cluster analysis, (c) hierarchical approach, (d) method of residuals, (e) region-of-influence (ROI), (f) canonical kriging and (g) flood seasonality regionalization (see e.g.^{50,51,52,53,54,55}). As reported in⁵⁶, cluster analysis can be considered as a state-of-art technique that can reduce the process subjectivity and regroup in a more appropriate way under a hydrological point of view. According to⁵⁷, algorithms used for cluster analysis in regionalization studies can be categorized in hard (e.g., hierarchical, partitional, or hybrid) and fuzzy clustering. Moreover, there was a recent increase in the use of artificial intelligence (AI) and such techniques often provide superior results when compared with partitional clustering algorithms^54,58. Whatever technique is adopted for HRs delineation, a critical point is the assessment of the plausibility of the obtained grouping and of the hypothesis of homogeneity for the proposed regions⁵⁹. Moreover, the estimates are not smooth (both in geographic or physiographic space) due to possible discontinuities. Consequently, approaches that do not define fixed-boundary regions^60,61 are receiving increasing attention: methods based on the interpolation of the hydrological variable in the descriptors space^24,62, or based on the so-called top-kriging⁶³.

As regards development of prediction equations, log-linear regressions techniques represent the most commonly used models. They allow to establish a relationship between hydrological variables and explanatory variables (such as drainage area, slope of the main channel, etc.). However, hydrological processes are naturally complex and consequently a simple log transformation could be insufficient for capturing this complexity. A recent work by⁶⁴ compared the performances of several RFFA methods with respect to variable selection, variable transformation and delineation of regions. In particular they proposed the use of a generalized additive model (GAM) for dealing with nonlinearity between the dependent and predictor variables showing that, on the basis of their data, this approach generally outperforms the other methods even without linking GAM with a neighbourhood/region-of-influence approach.

Therefore, there is a huge literature on extreme hydrologic events concentrating on the modeling of extreme events and on regression techniques separately, while methods which simultaneously allow these evaluations seem to be less investigated. Some examples can be found in^65,66,67 in the context of non-stationary series analysis. In this paper, we try to contribute by using the logic of the reparameterizations of the families of distribution functions and the one underlying the construction of the GAMLSS models. As known, the reparameterization techniques, when possible, allow to express the distribution function as a function of indicators of specific interest in the field of application, making it easier to interpret the behavior of the probability density function. The first contribution of this paper is to propose a new reparametrization of the Beta-Singh-Maddala, introduced by²³, which allows us to express the distribution as a function of indicators of specific interest in the field of hydrological studies such as, for example, median and return level of hydrological variables. The second contribution of the paper consists in specifying regressive models for the dependence of the indicators on a set of explanatory variables using appropriate link functions in a similar way to what was done in the GAMLSS models. Consequently, the proposed method allows to overcome the fragmentation characterizing the generally used approach, that involves a first step aimed at indicator estimates (such as return level, mean, median) and a second step regarding regression on them.

The Model

In this section, after a brief description of the Beta-Singh-Maddala distribution with four parameters (Beta-SM4), we present the general reparameterization assuming that there are four indicators, functions of the four parameters of Beta-SM4, of specific interest in the hydrological field and a set of explanatory covariates. Next, we study the particular case in which interest is placed on the median and on the return level.

Reparameterization of the four parameters Beta-Singh-Maddala distribution

The four Beta-Singh-Maddala (Beta-SM4) has been proposed in the context of hydrologic data analysis by²³, with the aim to properly describe some relevant aspects, such as the extent of return period and the amount and frequency of extreme values. Among the different properties demonstrated, we emphasize that the Beta-SM4 distribution turns out to be the distribution of the maximum of Singh-Maddala random variables, which belongs to the Frèchet maximal and to the Weibull minimal domain of attraction. Moreover, the authors highlight that this four-parameter distribution not only show a good overall fit on real data, but also a suitable representation of the extreme tails. Here, starting from this distribution, we propose its reparameterization, in order to make the parameters directly interpretable in terms of measures particularly relevant for hydrologic events description.

The Beta-SM4 distribution, in its original parameterization, has the following distribution function (df):

$$\begin{aligned} F_{Beta-SM4}(x;\varvec{\xi })=\left[ 1-\left( 1+\gamma _{2} x^{\gamma _{3}} \right) ^{-\gamma _{1}}\right] ^{a} \end{aligned}$$

(1)

where $\varvec{\xi }^{'}=(\gamma _{1},\gamma _{2},\gamma _{3},a )$, with $a>0$ and $\gamma _{k}>0$ for $k=1,2,3$. The probability density function (pdf) is given by

$$\begin{aligned} f_{Beta-SM4}(x;\varvec{\xi })= a\left[ F_{SM}(x;\varvec{\gamma }) \right] ^{a-1} f_{SM}(x;\varvec{\gamma }), \end{aligned}$$

(2)

where

$$\begin{aligned} F_{SM}(x;\varvec{\gamma })=1-\left( 1+\gamma _{2} x^{\gamma _{3}} \right) ^{-\gamma _{1}} \end{aligned}$$

(3)

and

$$\begin{aligned} f_{SM}(x;\varvec{\gamma })=\gamma _{1}\gamma _{2}\gamma _{3} x^{\gamma _{3}-1} \left( 1+\gamma _{2} x^{\gamma _{3}} \right) ^{-\gamma _{1}-1} \end{aligned}$$

(4)

are the df and pdf of Singh-Maddala distribution, where $\varvec{\gamma '}=(\gamma _1,\gamma _2,\gamma _3)$, see⁶⁸.

From the expression of the pth quantile

$$\begin{aligned} x_{Beta-SM4}(p)=\gamma _{2}^{-\frac{1}{\gamma _3}} \left[ (1-p^{\frac{1}{a}})^{-\frac{1}{\gamma _1}} -1 \right] ^{\frac{1}{\gamma _3}} \end{aligned}$$

(5)

it is immediate to obtain median of Beta-SM4 distribution:

$$\begin{aligned} me_{Beta-SM4}(p)=\gamma _{2}^{-\frac{1}{\gamma _3}} \left[ (1-0.5^{\frac{1}{a}})^{-\frac{1}{\gamma _1}} -1 \right] ^{\frac{1}{\gamma _3}}. \end{aligned}$$

(6)

Furthermore, fixed the return period $\pi _{x_0}=[1-F_{Beta-SM4}(x_0;\varvec{\xi })]^{-1}$, the corresponding return level $x_0$ is given by

$$\begin{aligned} x_0(\pi _{x_0})=\gamma _{2}^{-\frac{1}{\gamma _3}} \left\{ \left[ 1-\left( 1-\frac{1}{\pi _{x_0}}\right) ^{\frac{1}{a}}\right] ^{-\frac{1}{\gamma _1}} -1 \right\} ^{\frac{1}{\gamma _3}}. \end{aligned}$$

(7)

For further properties and details on Beta-SM4 distrinution, see²³.

Following the proposal of⁶⁹, we consider the possibility of reformulate the Beta-SM4$(\gamma _1,\gamma _2, \gamma _3, a)$ in terms of new parameters, $I_j, j=1,...,4$, that are indicators describing some peculiarities of hydrologic data distribution and such that there exist a one-to-one transformation of the kind $I_j=g_j(\gamma _{1},\gamma _{2},\gamma _{3},a), j=1,...,4$, so that the system

$$\begin{aligned} {\left\{ \begin{array}{ll} I_1= g_{1}(\gamma _{1},\gamma _{2},\gamma _{3},a) \\ I_2= g_{2}(\gamma _{1},\gamma _{2},\gamma _{3},a) \\ I_3= g_{3}(\gamma _{1},\gamma _{2},\gamma _{3},a) \\ I_4= g_{4}(\gamma _{1},\gamma _{2},\gamma _{3},a) \end{array}\right. } \end{aligned}$$

(8)

has a unique solution in terms of $\gamma _{1},\gamma _{2},\gamma _{3}$ and a:

$$\begin{aligned} {\left\{ \begin{array}{ll} \gamma _{1}= \gamma _{1}(I_1, I_2, I_3,I_4) \\ \gamma _{2}= \gamma _{2}(I_1, I_2, I_3,I_4) \\ \gamma _{3}= \gamma _{3}(I_1, I_2, I_3,I_4) \\ a= a(I_1, I_2, I_3,I_4) \end{array}\right. }. \end{aligned}$$

(9)

Substituting the solution (9) in (1), (2) and (5), it is possible to obtain, respectively, the expressions of the df, the pdf and pth quantile the in terms of the chosen indicators. So, for example, the distribution function of reparameterized Beta-SM4 is

$$\begin{aligned}&F_{R-Beta-SM4}(x; I_1, I_2, I_3, I_4)= \nonumber \\&\left[ 1-\left( 1+\gamma _{2}(I_1, I_2, I_3,I_4) x^{\gamma _{3}(I_1, I_2, I_3,I_4)} \right) ^{-\gamma _{1}(I_1, I_2, I_3,I_4)}\right] ^{a(I_1, I_2, I_3,I_4)}. \end{aligned}$$

(10)

Now, in order to evaluate how climatic or physic characteristics could affect the chosen indicators, we will express them as functions of a set of covariates that could have an effect separately and/or simultaneously. If for each sampled catchment i, $(i=1,...,n)$, an hydrological variable of interest (e.g. annual streamflow maximum) and a set of explanatory covariates are observed, indicators (8) can be reformulated by specifying their relationship with covariates. Denoting by $\varvec{w}_{1,i}$, $\varvec{w}_{2,i}$, $\varvec{w}_{3,i}$ and $\varvec{w}_{4,i}$ the vectors that, in general, affect the four indicators separately, the relationship between indicators and covariates can be specified as follow

$$\begin{aligned} I_{j,i}=h_j(\varvec{w}_{j,i},\varvec{\beta }_j) \end{aligned}$$

(11)

for $j=1,2,3,4$ and $i=1,...,n$, where $h(\cdot )$ is an appropriate link function chosen according to whether $I_{j,i}$ is positive or varies in (0, 1). Parameters $\varvec{\beta }_j$ indicate the regression coefficients associated with the covariates that need to be estimated on the basis of available observations by the maximum likelihood method. It is worth to note that this approach allow to take into account also the possible nonlinearity between the hydrological variable of interest and covariates since all the parameters of the conditional distribution of the response can be modelled through parametric linear or non-linear functions of explanatory variables. Moreover, we are able to obtain estimation of measures of interest, such as flood quantiles, also for ungauged catchments or for catchments having short data records.

Formulation in terms of median and return level

In the following subsection we propose a particular reparameterization of Beta-SM4 distribution, involving median and return level as indicators of interest. Both these indicators have a simple and direct interpretation in terms of hydrologic meaning and the inspection of the possible effect of some covariates on them could be of particular interest in many real contexts. This particular reparameterization is just a possible example, indeed different reformulations could be considered, depending on the features to be investigated.

The original parameters are substituted by the following one-to-one transformation

$$\begin{aligned} (\gamma _1, \gamma _2, \gamma _3,a) \mapsto (\tau , me, x_0, a) \end{aligned}$$

(12)

where $\tau =\frac{1}{\gamma _1}$, me is the median of distribution, $x_0$ is the return level, corresponding to a pre-fixed return period $\pi _{x_0}$, and parameter a remain unchanged.

The adopted reparameterization for $\gamma _1$ is similar to the one proposed in⁶⁹ for Dagum distribution and it is especially convenient when the feature of interest is transferable, since it is a direct indicator of concentration level. For the sake of generality, we mantain this reparemeterization as it could be usefull in other contexts of study.

From (6) and (7), after simple algebra, we obtain

$$\begin{aligned} {\left\{ \begin{array}{ll} \gamma _{1}= \frac{1}{\tau } \\ \gamma _{2}= [(1-0.5^{1/a})^{-\tau }-1] \cdot me^{-\frac{\log \left\{ \left[ 1-\left( 1-\frac{1}{\pi _{x_0}}\right) ^{1/a}\right] ^{-\tau }-1\right\} - \log \left\{ [1-0.5^{1/a}]^{-\tau }-1\right\} }{\log x_0- \log me}} \\ \gamma _{3}=\frac{\log \left\{ \left[ 1-\left( 1-\frac{1}{\pi _{x_0}}\right) ^{1/a}\right] ^{-\tau }-1\right\} - \log \left\{ [1-0.5^{1/a}]^{-\tau }-1\right\} }{\log x_0- \log me}\\ a= a \end{array}\right. }. \end{aligned}$$

(13)

From (10) and (13), it is immediate to obtain the new expression of df of Beta-SM4 r.v. in terms of median and return level:

$$\begin{aligned} F_{R1-Beta-SM4}(x;\tau ,me,x_0,a)= \left[ 1-\left( 1+\left[ \left( 1-0.5^{\frac{1}{a}}\right) ^{-\tau }-1\right] \right. \right. \nonumber \\ \left. \left. \left( \frac{x}{me}\right) ^{\frac{\log \left\{ \left[ 1-\left( 1-\frac{1}{\pi _{x_0}}\right) ^{1/a}\right] ^{-\tau }-1\right\} - \log \left\{ [1-0.5^{1/a}]^{-\tau }-1\right\} }{\log x_0- \log me}} \right) ^{-\frac{1}{\tau }}\right] ^{a} \end{aligned}$$

(14)

In order to identify a specific link function, we observe that all the indicators involved in the reparameterization (ie $\tau$, me, $x_0$ and a) are positive; in this context, it is usual to choose a log-linear link, ie $\exp ({\textbf {w}}'\varvec{\beta })$, where ${\textbf {w}}'\varvec{\beta }$ is the linear predictor.

Estimation

By specifying relationship between indicators and covariates as in (11), the general parameterization in Eq. (9) may be rewritten as

$$\begin{aligned} {\left\{ \begin{array}{ll} \breve{\gamma }_{1,i}= \gamma _{1}(I_{1,i}, I_{2,i}, I_{3,i},I_{4,i}) \\ \breve{\gamma }_{2,i}= \gamma _{2}(I_{1,i}, I_{2,i}, I_{3,i},I_{4,i}) \\ \breve{\gamma }_{3,i}= \gamma _{3}(I_{1,i}, I_{2,i}, I_{3,i},I_{4,i}) \\ \breve{a}_{i}= a(I_{1,i}, I_{2,i}, I_{3,i},I_{4,i}) \end{array}\right. }. \end{aligned}$$

(15)

Consequently the log-likelihood function expressed in terms of the unknow coefficients $\varvec{\beta }=(\varvec{\beta '}_{1},\varvec{\beta '}_{2},\varvec{\beta '}_{3},\varvec{\beta '}_{4})'$ is given by

$$\begin{aligned} \ell (\varvec{\beta };\varvec{x},\varvec{W})=\sum _{i=1}^n \ln {\breve{a}_i}+\sum _{i=1}^n (\breve{a}_i-1) \ln [F_{SM}(x_i;\breve{\varvec{\gamma }}_i)]+\sum _{i=1}^n\ln [f_{SM}(x_i;\breve{\varvec{\gamma }}_i)] \end{aligned}$$

(16)

where $\varvec{W}$ denotes the matrix containing the covariates for the four indicators and $\varvec{\breve{\gamma }'}_{i}=(\breve{\gamma }_1,\breve{\gamma }_2,\breve{\gamma }_3)$. Assuming that the vectors of regression coefficients are of size $p_j$ for $j=1,2,3,4$, the $(p_1+p_2+p_3+p_4)$ likelihood equations are

$$\begin{aligned} \frac{\partial \ell (\varvec{\beta };\varvec{x},\varvec{W})}{\partial {\beta _{j,r_j}}}&=\sum _{i=1}^n \frac{\dot{a}_{j,r_j,i}}{\breve{a}_i}\nonumber \\&\quad + \sum _{i=1}^n \biggl [\dot{a}_{j,r_j,i} \ln [F_{SM}(x_i;\breve{\varvec{\gamma }}_i)]+(\breve{a}_i-1) \frac{\dot{F}_{SM}(x_i;\breve{\varvec{\gamma }}_i)_{j,r_j,i}}{F_{SM}(x_i;\breve{\varvec{\gamma }}_i)} \biggl ] \nonumber \\&\quad + \sum _{i=1}^n \frac{\dot{f}_{SM}(x_i;\breve{\varvec{\gamma }}_i)_{j,r_j,i}}{f_{SM}(x_i;\breve{\varvec{\gamma }}_i)}=0 \end{aligned}$$

(17)

where the quantities $\dot{a}_{j,r_j,i}$, $\dot{F}_{SM}(x_i;\breve{\varvec{\gamma }}_i)_{j,r_j,i}$ and $\dot{f}_{SM}(x_i;\breve{\varvec{\gamma }}_i)_{j,r_j,i}$ denote the partial derivatives of $\breve{a}_i$, $F_{SM}(x_i;\breve{\varvec{\gamma }}_i)$ and $f_{SM}(x_i;\breve{\varvec{\gamma }}_i)$ with respect to the parameter $\beta _{j,r_j}$, for $j=1,2,3,4$ and $r_j=1,2,...,p_j$, i.e.

$$\begin{aligned} \dot{a}_{j,r_j,i}=\frac{\partial \breve{a}_i}{\partial \beta _{j,r_j}}=\frac{\partial \breve{a}_i}{\partial I_{j, i}}\times \frac{\partial I_{j, i}}{\partial \beta _{j,r_j}}, \end{aligned}$$

$$\begin{aligned} \dot{F}_{SM}(x_i;\breve{\varvec{\gamma }}_i)_{j,r_j,i}= & {} \frac{\partial F_{SM}(x_i; I_{1,i}, I_{2,i}, I_{3,i}, I_{4,i}) }{\partial \beta _{j,r_j}} \\= & {} \frac{\partial F_{SM}(x_i; I_{1,i}, I_{2,i}, I_{3,i}, I_{4,i}) }{\partial I_{j,i}} \times \frac{ \partial I_{j, i}}{\partial \beta _{j,r_j}} \end{aligned}$$

and

$$\begin{aligned} \dot{f}_{SM}(x_i;\breve{\varvec{\gamma }}_i)_{j,r_j,i}= & {} \frac{\partial f_{SM}(x_i; I_{1,i}, I_{2,i}, I_{3,i}, I_{4,i}) }{\partial \beta _{j,r_j}} \\= & {} \frac{\partial f_{SM}(x_i; I_{1,i}, I_{2,i}, I_{3,i}, I_{4,i}) }{\partial I_{j,i}} \times \frac{ \partial I_{j, i}}{\partial \beta _{j,r_j}}. \end{aligned}$$

The coefficients $\varvec{\beta }_j$ will highlight the impact of the corresponding covariates directly on indicators $I_{j,i}$ of interest, $j=1,2,3,4$.

The system of the likelihood equations in (17) does not admit any explicit solution therefore, the ML estimates $\hat{\beta }_{j,r_j}$, for $j=1,2,3,4$ and $r_j=1,2,...,p_j$, can only be obtained by means of numerical procedures. Under the usual regularity conditions, the known asymptotic properties of the maximum likelihood method ensure that $\sqrt{n}(\hat{\varvec{\beta }}_n-\varvec{\beta })\xrightarrow {d}N(\varvec{0},\varvec{\Sigma }_{\varvec{\beta }})$, where $\varvec{\Sigma }_{\varvec{\beta }}=[\lim _{n\rightarrow \infty }\varvec{I(\varvec{\beta })}/n]^{-1}$ is the $(p_1+p_2+p_3+p_4)\times (p_1+p_2+p_3+p_4)$ asymptotic variance-covariance matrix and $\varvec{I(\varvec{\beta })}$ is the Fisher information matrix, given by $\varvec{I(\varvec{\beta })}=-E\left( {\textbf {H}}\right)$ where ${\textbf {H}}$ is the Hessian matrix of the second partial derivatives of the log-likelihood function, ie $\frac{\partial ^2 \ell (\varvec{\beta };\varvec{x},\varvec{W})}{\partial {\beta _{j,r_j} \beta _{h,r_h }}}$. The elements of the Fisher information matrix can be determined in a similar way to what was done in⁶⁹ and are available on request.

Application

In this example, we consider real time series of annual streamflow maxima data relative to a set of 14 catchments located in Calabria or partially included in the Basilicata region, southern Italy (Fig. 1, left panel). Timeseries for the 14 stream gauges refer to the official and validated database of the “Centro Funzionale Multirischi” of the Calabria Region (data are available upon request at https://www.cfd.calabria.it/) that is the institution in charge for data collection and management. Data are annual maxima obtained from hourly or sub-hourly resolution discharge measurements. The series have different length, ranging from 7 to 59 observations and cover non-homogeneous periods, from 1925 to 2009 (Fig. 1, right panel). Selected catchments range in size from 27 to 1323 km², while mean elevation varies from about 300 m to more than 1300 m a.s.l.. Table 1 summarizes some geomorphoclimatic characteristics of the investigated catchments. The area is characterized by a Mediterranean climate, with rainy periods mainly coinciding with fall and winter months while summers are hot and dry, strongly affecting the seasonal runoff cycle of the streams. Nevertheless, there are considerable differences in temperature and rainfall heights between mountainous territory in interior areas and coastal zones. The mean annual precipitation over the catchment set averages approximately 1000 mm: it is usually greater than 1100 mm for elevations above of 500 m a.s.l. and decreases to 700 mm in the Ionian coast (east coast).

Table 1 Geomorphoclimatic characteristics of the investigated catchments: A is the catchment area, n is the record length in years; Hm is the mean catchment elevation; Ybar is the latitude of the center of the basin; LCV6 is the LCV of annual maxima of rainfall heights for a duration of six hours; ${\hbox {Q}}_{\hbox {m}}$ is the mean value of the annual streamflow maxima; S.D. is the standard deviation of annual maxima time series.

Full size table

In order to show the adequacy of the proposed model in describing this kind of data, we consider the reparameterization reported in (12), involving the four indicators $I_1=\tau$, $I_2=me$, $I_3=x_0$ and $I_4=a$, where $I_1$ is a direct indicator of distribution concentration, $I_2$ is the median of streamflow maxima, $I_3$ is chosen to be the 5-years return level and $I_4$ is equal to the parameter a in the original parameterization. First of all, we obtain the maximum likelihood estimates (MLEs) of the parameters under the homogeneity hypothesis of the catchments, i.e. the estimates obtained in absence of covariates effects. To this end, we consider the udometric coefficient, to take into account the different basin areas. The obtained MLEs and corresponding $95\%$ confidence interval (in brackets) are: $\hat{\tau }=1.87\times 10^{-5}$ $(1.59\times 10^{-5}; 2.20\times 10^{-5})$, $\hat{me}=0.449$ (0.393; 0.513), $\hat{x_0}=0.958$ (0.806; 1.139) and $\hat{a}=129.13$ (112.8; 147.8). The adequacy of the model to the analyzed data is graphically confirmed from the probability plot presented in Fig. 2: the trend appears to be linear by fitting a straight line through the points, suggesting that the Beta-SM4 is an appropriate model for these data. The details of the construction of the probability plot are given in the Appendix (see Supplementary Information).

Focusing on data set of udometric coefficient for Calabria, the obtained Beta-SM4 performances were compared, besides with the widely used Generalized Extreme Value (GEV) distribution, also with the Two-Component Extreme Value (TCEV) distribution^13,43,70, that is a 4-parameter probability function and it is widely adopted for RFFA in Calabria region. The underlying hypothesis for the TCEV formulation is the existence of two kinds of flood populations for series of annual maximum flows of many Italian rivers, and particularly in the Mediterranean area. This theoretical consideration can be reconducted to different physical interpretations of the event genesis: ordinary floods are generated by frontal-type rainfalls, which is the most frequent type of rainfall and produces smaller events; conversely, extraordinary floods are less frequent, more severe and mostly generated by heavy convective rainfall events. At this step, authors considered MLEs for the parameters under the homogeneity hypothesis of the catchments for GEV and TCEV. Moreover, as a global selection criterion, Akaike Information Criterion (AIC⁷¹), is evaluated. The result obtained for Beta-SM4 ($AIC=414,613$) suggests a better and a similar performance, when compared, respectively, with that for TCEV ($AIC=438,787$) and GEV ($AIC=413,667$). In order to better interpretate the obtained AIC values and to better quantify the information loss experienced by using Beta-SM4 or TCEV rather than GEV, it is convenient to rescale AIC values to the differences between each AIC value and their minimum (see e.g.⁷², pp. 270–271 and⁷³, section 2.6). By considering the frequently adopted rule of thumb for assessing the relative merits of a model, it can be concluded that the Beta-SM4 represents a good alternative to the GEV distribution (difference equal to 0.946 and therefore lower than 2), while the TCEV distribution have essentially no support (difference equal to 25.12 and therefore substantially greater than 10).

This conclusion is supported by the graphical comparison reported in an EV1 probabilistic plot (Fig. 3), from which it is clear that Beta-SM4 model performs better than TCEV and it is a valid alternative to the GEV model, especially in describing the extreme right tail. To explore more in depth the performance of these two models in describing extreme quantiles, we consider the resampling procedure reported in⁷⁴ and already recalled in²³. In this case, we follow the procedure fitting models on 1000 bootstrap samples of size 50 and extrapolating the $i-th$ extreme right-tail quantiles, corresponding to the empirical cumulative probability given by $\frac{i}{N+1}$ (where $N=397$ is the total number of observation and $i=394,395,396,397$). Figure 4 shows the sample characteristics of these extreme quantiles and the corresponding observed quantiles, depicted in red. The simulation shows that GEV model tends to overrepresent the extreme quantiles, while Beta-SM4 shows a better performance, confirming the findings already reported in²³ on different data. These preliminary evidences suggest that a regression procedure based on Beta-SM4 could be suitable for investigating the impact of some covariates on streamflow maxima distributions and its features. In particular, the reparameterization proposed in “The Model” section will allow to explore the possible effects on the median and 5-years return level for each catchment. To this end, we consider some catchments characteristics, such as the latitude of the centre of the basin (Ybar, in tens of km), the catchment area (A, in km²), the mean elevation (Hm, m.a.s.l.) and the sample values of the coefficient of L-variation of annual maxima of rainfall heigths with a duration of six hours (LCV6). We consider the reparameterization reported in (12) and since the indicators are all positive the most appropriate link function appears to be that the log-linear to relate the above mentioned characteristics to the indicators, ie $\tau =\exp (\varvec{w'}_{1} \cdot \varvec{\beta }_1)$, $me=\exp (\varvec{w'}_{2} \cdot \varvec{\beta }_2)$, $x_0=\exp (\varvec{w'}_{3} \cdot \varvec{\beta }_3)$ and $a=\exp (\varvec{w'}_{4} \cdot \varvec{\beta }_4)$.

Table 2 reports the obtained MLEs of the parameters, the corresponding standard errors (SE) and the results from Wald test (t and p value) for statistical significance of parameters related to the four indicators $I_j$ for $j=1,...4$. As we expected, the considered variables seem to have a significant influence on the median of annual streamflow maxima distribution and on return level. In particular both median and return level seem to be positively associated with Ybar, LCV6 and A, while streamflow decreases as the mean elevation Hm of basin increases. Having obtained the MLEs, it is also possible to look at the fitted distributions for each catchment $F_{Beta-SM4-I}(x;\hat{\tau }_i,\hat{me}_i,\hat{x}_{0_i},\hat{a}_i)$, for $i=1,...,14$, where the generic indicator is obtained substituting in (11) the estimates of regression coefficients and covariates values for the catchment. To obtain the MLE estimates we considered numerical procedure based on a quasi-Newton method (BFGS method implemented in R), imposing some constraints to ensure admissible results.

Observed versus estimated median and 5-years return level for different catchments is compared in Fig. 5. In this case, median values were rescaled by considering the catchment area. As it can be seen, fitted and observed values are similar, except for some catchment, confirming that this regression approach allows to adequately and simultaneously estimate substantial features of streamflow maxima in presence of heterogeneity.

Table 2 MLEs of the parameters (log-likelihood: − 2146.313).

Full size table

It is worth to note that the dependence find in the regressive structures is of general nature and not ensure a causal relationship between floods and covariates⁷⁵. The derivation of relationships intended for practical applications require interpretation from a hydrological perspective and further investigations related to the use of different descriptors, model robustness, model efficiency and associated uncertainties that are beyond the scope of this paper. Another aspect to be investigated for hydrological extremes should be the potential presence of long-range dependence or strong clustering (grouping) of similar values, or the Hurst phenomenon^{76,77,78,79,80,81}, which is quite common in natural processes. However, the annual maxima series usually tend to hide the Hurst behaviour, as explained in^20,82. In addition, for the selected case studies, this analysis cannot be easily carried out, because it would require datasets without missing data, while many “holes” are present in the investigated time series (Fig. 1, right panel). For the same reason, it clearly difficult to also evaluate the eventual existence of a compound effect, which indicates that if these catchments are close to each other, then the probability occurrence of an extreme value to one site may be overestimated if the same extreme storm event caused an extreme value in an adjacent catchment. In fact, missing data could make unreliable a multivariate analysis, as proposed in⁸³. As examples of application for specific catchments, the results for Amato at Marino, Ancinale at Razzona, Crati at Conca and Esaro at La Musica are reported in Fig. 6.

Conclusions

In this paper we proposed a new parameterization of the Beta-Singh-Maddala distribution in order to model extreme hydrologic events and simoultaneously investigate on their possible determinants. As shown in²³, this distribution is related to the Dagum distribution, recently considered in the context of analysis of hydrologic extreme events¹² and, owing to the fact that it can be viewed as a generalization of a Beta-p distribution, to other distributions frequently used in the specific literature. In presenting the general reparameterization it is assumed that there are four indicators, functions of the four parameters of the distribution, of specific interest in the hydrological field and a set of explanatory covariates. The particular case in which interest is placed on the median and on the return level is also presented. Finally, an application to a real hydrologic dataset is reported. The application results confirm that the proposed parameterization well describes the observed data and allows for an understanding on the effects of covariates on interest indicators, such as median and return level. The obtained findings suggest that the proposed reparameterization of Beta-Singh-Maddala distribution can be considered as a valid alternative to some classical models for extreme value analysis, simoultaneously allowing for a direct interpretation in terms of particular factors impact on aspects of hydrological interest.

References

Katz, R., Brush, G. & Parlange, M. Statistics of extremes: Modeling ecological disturbances. Ecology 86, 1124–1134 (2005).
Article Google Scholar
Tebaldi, C., Hayhoe, K., Arblaster, J. & Meehl, G. Going to the extremes. Clim. Change 79(3–4), 185–211 (2006).
Article ADS Google Scholar
Bobee, B. The log Pearson type 3 distribution and its application in hydrology. Water Resour. Res. 11(5), 681–689 (1975).
Article ADS Google Scholar
Johnson, N. L., Kotz, S. & Balakrishnan, N. Continuous Univariate Distributions 2nd edn, Vol. 1 (John Wiley and Sons, New York, 1994).
MATH Google Scholar
Hosking, J. R. M. & Wallis, J. R. Parameter and quantile estimation for the generalized Pareto distribution. Technometrics 29(3), 339–349 (1987).
Article MathSciNet MATH Google Scholar
Dargahi-Noubary, G. R. On tail estimation: An improved method. Math. Geol. 21(8), 829–842 (1989).
Article Google Scholar
Balakrishnan, N. & Leung, M. Y. Means, variances and covariances of order statistics, BLUEs for the Type-I generalized logistic distribution, and some applications. Commun. Stat. Simul. Comput. 17(1), 51–84 (1988).
Article MathSciNet MATH Google Scholar
Dyrrdal, A. V. Estimation of extreme precipitation in Norway and a summary of the state of the art. Report No. 08/2012, Climate, Norwegian Meteorological Institute (2012).
Bücher, A., Lilienthal, J., Kinsvater, P. & Fried, R. Penalized quasi-maximum likelihood estimation for extreme value models with application to flood frequency analysis. Extremes 24, 325–348 (2021).
Article MathSciNet MATH Google Scholar
Mujere, N. Flood frequency analysis using the Gumbel distribution. Int. J. Comput. Sci. Eng. 3(7), 2774–2778 (2011).
Google Scholar
Jeong, B. Y., Murshed, S. M., Seo, Y. A. & Park, J. S. A three-parameter kappa distribution with hydrological application: A generalized Gumbel distribution. Stoch. Environ. Res. Risk Assess. 28, 2063–2074 (2014).
Article Google Scholar
Murshed, S. M., Kim, S. & Park, J. S. Beta-k distribution and its application to hydrologic events. Stoch. Env. Res. Risk Assess. 25, 897–911 (2011).
Article MATH Google Scholar
Rossi, F., Fiorentino, M. & Versace, P. Two-component extreme value distribution for flood frequency analysis. Water Resour. Res. 20, 847–856 (1984).
Article ADS Google Scholar
Zakerzadeha, H. & Dolati, A. Generalized Lindley distribution. J. Math. Ext. 3(2), 13–25 (2009).
MathSciNet MATH Google Scholar
Domma, F. & Condino, F. The Beta-Dagum distribution: Definition and properties. Commun. Stat. Theory Methods 42(22), 4070–4090 (2013).
Article MathSciNet MATH Google Scholar
Hussain, T., Bakouch, H. S. & Iqbal, Z. A new probability model for hydrologic events: Properties and applications. Jo. Agricol. Biol. Environ. Stat. 23(1), 63–82 (2017).
Article MathSciNet MATH Google Scholar
Paranìba, P. F., Ortega, E. M. M., Cordeiro, G. M. & Pescima, R. R. The beta Burr XII distribution with application to lifetime data. Comput. Stat. Data Anal. 55(2), 1118–1136 (2011).
Article MathSciNet MATH Google Scholar
Icobellis, V. & Fiorentino, M. Derived distribution of floods based on the concept of partial area coverage with a climatic appeal. Water Resour. Res. 36(2), 469–482 (2000).
Article ADS Google Scholar
De Michele, C. A. & Salvadori, G. On the derived flood frequency distribution: Analytical formulation and the influence of antecedent soil moisture condition. J. Hydrol. 262, 245–258 (2002).
Article Google Scholar
Koutsoyiannis, D. Stochastics of Hydroclimatic Extremes—A Cool Look at Risk (Kallipos Open Academic Editions, Athens, 2021).
Google Scholar
Marani, M. & Ignaccolo, M. A metastatistical approach to rainfall extremes. Adv. Water Resour. 79, 121–126 (2015).
Article ADS Google Scholar
Zorzetto, E., Botter, G. & Marani, M. On the emergence of rainfall extremes from ordinary events. Geophys. Res. Lett. 43(15), 8076–8082 (2016).
Article ADS Google Scholar
Domma, F. & Condino, F. Use of the Beta-Dagum and Beta-Singh-Maddala distributions for modeling hydrologic data. Stoch. Environ. Res. Risk Assess. 31, 799–813 (2016).
Article Google Scholar
Chebana, F. & Ouarda, T. B. M. J. Depth and homogeneity in regional flood frequency analysis. Water Resour. Res. 44 (11) (2008).
Gruber, A. M. & Stedinger, J. R. Models of LP3 regional skew, data selection, and Bayesian GLS regression. In World Environmental & Water Resources Conference 12–16 (2008).
Micevski, T. & Kuczera, G. Combining site and regional flood information using a Bayesian Monte Carlo approach. Water Resour. Res. 45 (2009).
Gaume, E. et al. Bayesian MCMC approach to regional flood frequency analyses involving extraordinary flood events at ungauged sites. J. Hydrol. 394, 101–117 (2010).
Article ADS Google Scholar
Martel, B. et al. Regional frequency analysis of autumnal floods in the province of Quebec. Canada. Nat. Hazards 59, 681–698 (2011).
Article Google Scholar
Nezhad, M. K., Chokmani, K., Ouarda, T., Barbet, M. & Bruneau, P. Regional flood frequency analysis using residual kriging in physiographical space. Hydrol. Process. 24, 2045–2055 (2010).
Google Scholar
Nyeko-Ogiramoi, P., Willems, P., Mutua, F. & Moges, S. A. An elusive search for regional flood frequency estimates in the River Nile basin. Hydrol. Earth Syst. Sci. 16, 3149–3163 (2012).
Article ADS Google Scholar
Ahn, K. & Palmer, R. Regional flood frequency analysis using spatial proximity and basin characteristics: Quantile regression vs. parameter regression technique. J. Hydrol. 540, 515–526 (2016).
Article ADS Google Scholar
Ojha, R. & Tripathi, S. Using attributes of ungauged basins to improve regional regression equations for flood estimation: A deep learning approach. ISH J. Hydraul. Eng. 24(2), 239–248 (2018).
Article Google Scholar
Wallis, J. R., Matalas, N. C. & Slack, J. R. Just a moment!. Water Resour. Res. 10(2), 211–219 (1974).
Article ADS Google Scholar
Dalrymple T. Flood frequency analysis. U.S. geological survey. Water Supply Paper 1543-A (1960).
Smith, A., Sampson, C. & Bates, P. Regional flood frequency analysis at the global scale. Water Resour. Res. 51, 539–553 (2015).
Article ADS Google Scholar
Hosking, J. R. M. & Wallis, J. R. Some statistics useful in regional frequency analysis. Water Resour. Res. 29, 271–281 (1997).
Article ADS Google Scholar
Abida, H. & Ellouze, M. Probability distribution of flood flows in Tunisia. Hydrol. Earth Syst. Sci. 12, 703–714 (2008).
Article ADS Google Scholar
Hussain, T. & Pasha, G. Regional flood frequency analysis of the seven sites of Punjab, Pakistan, using L-moments. Water Resour. Manag. 23, 1917–1933 (2009).
Article Google Scholar
Noto, V. L. & La Loggia, G. Use of L-moments approach for regional flood frequency analysis in Sicily, Italy. Water Resour. Manag. 23, 2207–2229 (2009).
Article Google Scholar
Saf, B. Regional flood frequency analysis using L- moments for the West Mediterranean region of Turkey. Water Resour. Manag. 23, 531–551 (2009).
Article Google Scholar
Seckin, N., Haktanir, T. & Yurtal, R. Flood frequency analysis of Turkey using L-moments method. Hydrol. Process. 25, 3499–3505 (2011).
Article ADS Google Scholar
Laio, F., Ganora, D., Claps, P. & Galeati, G. Spatially smooth regional estimation of the flood frequency curve (with uncertainty). J. Hydrol. 408, 67–77 (2011).
Article ADS Google Scholar
Biondi, D., Claps, P., Cruscomagno, F., De Luca, D. L., Fiorentino, M., Ganora, D., Gioia, A., Iacobellis, V., Laio, F., Manfreda, S. & Versace, P. After the VAPI Project: Evaluation of the design maximum floods concerning the Calabria POR project (in Italian). In Proceedings of XXXIII Italian National Conference on Hydraulics and Hydraulic Engineering 10–15 September 2012, Brescia, Italy (2012).
Haddad, K. & Rahman, A. Regional flood frequency analysis in eastern Australia: Bayesian GLS regression-based methods within fixed region and ROI framework—quantile regression vs. parameters regression technique. J. Hydrol. 430–431(2012), 142–161 (2012).
Article Google Scholar
Aydoğan, D., Kankal, M. & Onsoy, H. Regional flood frequency analysis for Çoruh Basin of Turkey with L-moments approach. J. Flood Risk Manag. 9, 69–86 (2016).
Article Google Scholar
Ouarda, T. B. M. J. Hydrological frequency analysis, regional. Encycl. Environ. 3, 1311–1315 (2013).
Google Scholar
Farsadnia, F. et al. Identification of homogeneous regions for regionalization of watersheds by two-level self-organizing feature maps. J. Hydrol. 509, 387–397 (2014).
Article ADS Google Scholar
Burn, D. H., Zrinji, Z. & Kovvalchuck, M. Regionalisation of catchments for regional flood frequency analysis. J. Gydrol. Eng. 2(2), 76–82 (1997).
Google Scholar
Chebana, F. & Ouarda, T. B. M. J. Mulivariate L-moment homogeneity test. Water Resour. Res. 43, W08406 (2007).
Article ADS Google Scholar
Ouarda, T. B. M. J. et al. Data-based comparison of seasonality-based regional flood frequency methods. J. Hidrol. 330, 329–339 (2006).
Article Google Scholar
Ouarda, T. B. M. J., St-Hilaire, A. & Bobée, B. Synthgése des dévelopments récents en analyse régionale des extremes hydrologiques/A review of recent developments in regional frequency analysis of hydrological extremes. Revue des Sciences de l’eau/J Watr Sci 21, 219–232 (2008).
Article Google Scholar
Ouarda, T. B. M. J. et al. Intercomparison of regional flood frequency estimation methods at ungauged sites for a Mexican case study. J. Hydrol. 348, 40–58 (2008).
Article ADS Google Scholar
Haddad, K., Rahman, A., Zaman, M. & Shrestha, S. Applicability of Monte Carlo cross validation technique for model development and validation using generalised least squares regression. J. Hydrol. 482, 119–128 (2013).
Article ADS Google Scholar
Goyal, M. K. & Gupta, V. Identification of homogeneous rainfall regimes in Northeast Region of India using fuzzy cluster analysis. Water Resour. Manag. 28, 4491–4511 (2014).
Article Google Scholar
Saunders, K. R., Stephenson, A. G. & Karoly, D. J. A regionalisation approach for rainfall based on extremal dependence. Extremes 24, 215–240 (2021).
Article MathSciNet MATH Google Scholar
Cassalho, F. et al. Artificial intelligence for identifying hydrologically homogeneous regions: A state-of-the-art regional flood frequency analysis. Hydrol. Process. 33, 1101–1116 (2019).
Article ADS Google Scholar
Rao, A. R. & Srinivas, V. V. Introduction. In Regionalization of Watersheds: An Approach Based on Cluster Analysis (eds Rao, A. R. & Srinivas, V. V.) 1–16 (Springer Science+Business Media B.V, Dordrecht, 2008).
Google Scholar
Beskow, S. et al. Artificial intelligence techniques coupled with seasonality measures for hydrological regionalization of Q90 under Brazilian conditions. J. Hydrol. 541, 1406–1419 (2016).
Article ADS Google Scholar
Viglione, A., Laio, F. & Claps, P. A comparison of homogeneity tests for regional frequency analysis. Water Resour. Res. 43, W03428 (2007).
Article ADS Google Scholar
Stedinger, J. R. & Tasker, G. D. Regional hydrologic analysis: 1. Ordinary, weighted, and generalized least squares compared. Water Resour. Res. 21, 1421–1432 (1985).
Article ADS Google Scholar
Griffis, V. W. & Stedinger, J. R. The use of GLS regression in regional hydrologic analyses. J. Hydrol. 344, 82–95 (2007).
Article ADS Google Scholar
Chokmani, K. & Ouarda, T. Physiographical space-based kriging for regional flood frequency estimation at ungauged sites. Water Resour. Res. 40 (2004).
Skoien, J., Merz, R. & Bloschl, G. Top-kriging—geostatistics on stream networks. Hydrol. Earth Syst. Sci. 10, 277–287 (2006).
Article ADS Google Scholar
Rahman, A., Charron, C., Ouarda, T. B. M. J. & Chebana, F. Development of regional flood frequency analysis techniques using generalized additive models for Australia. Stoch. Env. Res. Risk Assess. 32, 123–139 (2018).
Article Google Scholar
Lee, T. & Ouarda, T. B. Long–term prediction of precipitation and hydrologic extremes with nonstationary oscillation processes. J. Geophys. Res. Atmos. 115(D13) (2010).
Marra, F., Armon, M., Adam, O., Zoccatelli, D., Gazal, O., Garfinkel, C. I., Rostkier–Edelstein, D., Dayan, U., Enzel, Y. & Morin, E. Toward narrowing uncertainty in future projections of local extreme precipitation. Geophys. Res. Lett. 48(5) (2021).
Ouarda, T. B., Yousef, L. A. & Charron, C. Non-stationary intensity-duration-frequency curves integrating information concerning teleconnections and climate change. Int. J. Climatol. 39(4), 2306–2323 (2019).
Article Google Scholar
Singh, S. K. & Maddala, G. A function for the size distribution and incomes. Econometrica 44, 963–970 (1976).
Article Google Scholar
Domma, F., Condino, F. & Giordano, S. A new formulation of the Dagum distribution in terms of income inequality and poverty measures. Phys. A 511, 104–126 (2018).
Article MathSciNet MATH Google Scholar
Versace, P., Ferrari, E., Gabriele, S. & Rossi, F. Valutazione delle Piene in Calabria (CNR-IRPI e GNDCI: Geodata, Cosenza, Italy, 1989). In Italian.
Google Scholar
Akaike, H. Information theory and an extension of the maximum likelihood principle. In Proceedings of the 2nd International Symposium on Information Theory (eds. Petrov, B. N. & Csaki,F.) 267–281 ( Akademiai Kiado, Budapest, 1973).
Burnham, K. P. & Anderson, D. R. Model Selection and Multimodel Inference: A Practical Information-Theoretic Approach 2nd edn. (Springer-Verlag, Berlin, 2002).
MATH Google Scholar
Burnham, K. P. & Anderson, D. R. Multimodel inference: Understanding AIC and BIC in model selection. Sociol. Methods Res. 33(2), 261–304 (2004).
Article MathSciNet Google Scholar
Wilks, D. S. Comparison of three-parameter probability distributions for representing annual extreme and partial duration precipitation series. Water Resour. Res. 29(10), 3543–3549 (1993).
Article ADS Google Scholar
Rosbjerg, D. et al. Prediction of floods in ungauged basins. In Runoff Prediction in Ungauged Basins, Synthesis across Processes Places and Scales (eds Blöschl, G. et al.) (Cambridge University Pres, Cambridge, UK, 2013).
Google Scholar
Hurst, H. E. Long-term storage capacity of reservoirs. Trans. Am. Soc. Civ. Eng. 116, 770–799 (1951).
Article Google Scholar
Beran, J., Feng, Y., Ghosh, S. & Kulik, R. Long-Memory Processes (Springer, Berlin, 2013).
Book MATH Google Scholar
Dimitriadis, P., Koutsoyiannis, D., Iliopoulou, T. & Papanicolaou, P. A global-scale investigation of stochastic similarities in marginal distribution and dependence structure of key hydrological-cycle processes. Hydrology 8(2), 59 (2021).
Article Google Scholar
Klemeš, V. The Hurst phenomenon: A puzzle?. Water Resour. Res. 10(4), 675–688 (1974).
Article ADS Google Scholar
Mandelbrot, B. B. & Wallis, J. R. Noah, Joseph, and operational hydrology. Water Resour. Res. 4(5), 909–918 (1968).
Article ADS Google Scholar
Koutsoyiannis, D. Climate change, the Hurst phenomenon, and hydrological statistics. Hydrol. Sci. J. 48(1), 3–24 (2003).
Article Google Scholar
Iliopoulou, T. & Koutsoyiannis, D. Revealing hidden persistence in maximum rainfall records. Hydrol. Sci. J. 64(14), 1673–1689 (2019).
Article Google Scholar
Serinaldi, F. & Kilsby, C. A blueprint for full collective flood risk estimation: Demonstration for European river flooding: Blueprint for Collective flood risk estimation. Risk Anal.https://doi.org/10.1111/risa.12747 (2016).
Article PubMed PubMed Central Google Scholar

Download references

Author information

Authors and Affiliations

Department of Economics, Statistics and Finance ”Giovanni Anania”, University of Calabria, Arcavacata di Rende, CS, Italy
Filippo Domma & Francesca Condino
Department of Economics and Statistics, University of Siena, Siena, Italy
Sara Franceschi
Department of Informatics, Modelling, Electronics and System Engineering, University of Calabria, Arcavacata di Rende, CS, Italy
Davide Luciano De Luca & Daniela Biondi

Authors

Filippo Domma
View author publications
You can also search for this author in PubMed Google Scholar
Francesca Condino
View author publications
You can also search for this author in PubMed Google Scholar
Sara Franceschi
View author publications
You can also search for this author in PubMed Google Scholar
Davide Luciano De Luca
View author publications
You can also search for this author in PubMed Google Scholar
Daniela Biondi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

F.D.: study concepts; statistical analysis; interpretation of results. F.C. and S.F.: statistical analysis; interpretation of results. D.L.D.L. and D.B.: data acquisition; interpretation of results. All authors reviewed the manuscript.

Corresponding author

Correspondence to Francesca Condino.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Domma, F., Condino, F., Franceschi, S. et al. On the extreme hydrologic events determinants by means of Beta-Singh-Maddala reparameterization. Sci Rep 12, 15537 (2022). https://doi.org/10.1038/s41598-022-19802-4

Download citation

Received: 08 July 2021
Accepted: 05 September 2022
Published: 15 September 2022
DOI: https://doi.org/10.1038/s41598-022-19802-4

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.