What is the probability distribution of the conditioned peak flows?

After application of the inverse NQT the conditioned peak flows are modelled through the EV1 distribution and compared to the unconditioned (observed) peak flows.

What is the standard normal quantile for each cumulative frequency?

(b) the cumulative frequency, e.g. FQmi , is computed via a Weibull plotting position, and (c) the standard normal quantile, e.g. NQmi , is obtained as the inverse of the standard normal distribution for each cumulative frequency, e.g. G−1 (FQmi ).

What is the way to compare the seasonal correlations of the catchments?

By focusing on HFS, one can notice that the catchments with higher seasonal correlation are characterized by larger catchment area; higher baseflow index and temperature with respect to the remaining catchments; and lower specific runoff, precipitation, and wetness.

What is the effect of increased wetness on seasonal memory?

In fact, their finding that increased wetness has a negative impact on seasonal memory of both high and low flows extends the above results to the seasonal scale and, interestingly, to both types of extremes.

What is the main conclusion of Kuentz et al. (2017)?

In this respect, Kuentz et al. (2017) found that topography exerts dominant controls over the flow regime in the larger European region, controlling the flashiness of flow and being a particularly important driver for other low-flow signatures too.

What are the six groups of potential drivers of seasonal correlation magnitude?

To attribute the detected correlations to physical drivers, the authors define six groups of potential drivers of seasonal correlation magnitude: basin size, flow indices, the presence of lakes and glaciers, catchment elevation, catchment geology, and hydroclimatic forcing.

(Open Access) A large sample analysis of seasonal river flow correlation and its physical drivers (2018) | Theano Iliopoulou

Q: What are the contributions mentioned in the paper "A large sample analysis of european rivers on seasonal river flow correlation and its physical drivers" ?

The authors investigate here how persistence propagates along subsequent seasons and affects low and high flows. The authors investigate the links between seasonal streamflow correlation and various physiographic catchment characteristics and hydro-climatic properties. The benefit of the suggested methodology is demonstrated by updating the frequency distribution of high and low flows one season in advance in a real-world case. Their findings suggest that there is a traceable physical basis for river memory which, in turn, can be statistically assimilated into highand low-flow frequency estimation to reduce uncertainty and improve predictions for technical purposes.

Q: What is the potential limitation of the seasonal correlations?

A potential limitation is the assumption of symmetrical extension of HFS around the peak month, along with the uniform selection of its length (3-month period).

Q: What is the description of the studied rivers?

1. A summary of the river basins under study, in terms of the selected descriptors, is also provided in Table 1, showing that the investigated rivers cover a wide range of catchment area sizes, flow regimes, and climatic conditions.

Hydrol. Earth Syst. Sci., 23, 73–91, 2019

https://doi.org/10.5194/hess-23-73-2019

the Creative Commons Attribution 4.0 License.

A large sample analysis of European rivers on seasonal river ﬂow

correlation and its physical drivers

Theano Iliopoulou

, Cristina Aguilar

, Berit Arheimer

, María Bermúdez

, Nejc Bezak

, Andrea Ficchì

6,a

Demetris Koutsoyiannis

, Juraj Parajka

, María José Polo

, Guillaume Thirel

, and Alberto Montanari

Department of Water Resources and Environmental Engineering, School of Civil Engineering,

National Technical University of Athens, Zographou, 15780, Greece

Fluvial Dynamics and Hydrology Research Group, Andalusian Institute of Earth System Research,

University of Córdoba, Córdoba, 14071, Spain

Swedish Meteorological and Hydrological Institute, 601 76 Norrköping, Sweden

Water and Environmental Engineering Group, Department of Civil Engineering,

University of A Coruña, 15071 A Coruña, Spain

Faculty of Civil and Geodetic Engineering, University of Ljubljana, Jamova 2, 1000 Ljubljana, Slovenia

Department of Geography and Environmental Science, University of Reading, Reading, RG6 6AB, UK

Vienna University of Technology, Institute of Hydraulic Engineering and Water Resources Management,

Karlsplatz 13/222, 1040 Vienna, Austria

IRSTEA, Hydrology Research Group (HYCAR), 92761, Antony, France

Department DICAM, University of Bologna, Bologna, 40136, Italy

formerly at: IRSTEA, Hydrology Research Group (HYCAR), 92761, Antony, France

Correspondence: Theano Iliopoulou (anyily@central.ntua.gr)

Received: 15 March 2018 – Discussion started: 3 April 2018

Revised: 16 November 2018 – Accepted: 6 December 2018 – Published: 7 January 2019

Abstract. The geophysical and hydrological processes gov-

erning river ﬂow formation exhibit persistence at several

timescales, which may manifest itself with the presence of

positive seasonal correlation of streamﬂow at several differ-

ent time lags. We investigate here how persistence propagates

along subsequent seasons and affects low and high ﬂows. We

deﬁne the high-ﬂow season (HFS) and the low-ﬂow season

(LFS) as the 3-month and the 1-month periods which usu-

ally exhibit the higher and lower river ﬂows, respectively. A

dataset of 224 rivers from six European countries spanning

more than 50 years of daily ﬂow data is exploited. We com-

pute the lagged seasonal correlation between selected river

ﬂow signatures, in HFS and LFS, and the average river ﬂow

in the antecedent months. Signatures are peak and average

river ﬂow for HFS and LFS, respectively. We investigate the

links between seasonal streamﬂow correlation and various

physiographic catchment characteristics and hydro-climatic

properties. We ﬁnd persistence to be more intense for LFS

signatures than HFS. To exploit the seasonal correlation in

the frequency estimation of high and low ﬂows, we ﬁt a bi-

variate meta-Gaussian probability distribution to the selected

ﬂow signatures and average ﬂow in the antecedent months

in order to condition the distribution of high and low ﬂows

in the HFS and LFS, respectively, upon river ﬂow observa-

tions in the previous months. The beneﬁt of the suggested

methodology is demonstrated by updating the frequency dis-

tribution of high and low ﬂows one season in advance in a

real-world case. Our ﬁndings suggest that there is a traceable

physical basis for river memory which, in turn, can be sta-

tistically assimilated into high- and low-ﬂow frequency es-

timation to reduce uncertainty and improve predictions for

technical purposes.

Published by Copernicus Publications on behalf of the European Geosciences Union.

74 T. Iliopoulou et al.: A large sample analysis of European rivers

1 Introduction

Recent analyses for the Po River and the Danube River high-

lighted that catchments may exhibit signiﬁcant correlation

between peak river ﬂows and average ﬂows in the previ-

ous months (Aguilar et al., 2017). Such correlation is the

result of the behaviours of the physical processes involved

in the rainfall–runoff transformation that may induce mem-

ory in river ﬂows at several different timescales. The pres-

ence of long-term persistence in streamﬂow has been known

for a long time, since the pioneering works of Hurst (1951),

and has been actively studied ever since (e.g. Koutsoyian-

nis, 2011; Montanari, 2012; O’Connell et al., 2016 and refer-

ences therein). While a number of seasonal ﬂow forecasting

methods have been explored in the literature (e.g. Bierkens

and van Beek, 2009; Dijk et al., 2013), attempts to explic-

itly exploit streamﬂow persistence in seasonal forecasting

through information from past ﬂows have been, in general,

limited. Koutsoyiannis et al. (2008) proposed a stochastic ap-

proach to incorporate persistence of past ﬂows into a predic-

tion methodology for monthly average streamﬂow and found

the method to outperform the historical analogue method (see

also Dimitriadis et al., 2016, for theory and applications of

the latter) and artiﬁcial neural network methods in the case

of the Nile River. Similarly, Svensson (2016) assumed that

the standardized anomaly of the most recent month will not

change during future months to derive monthly ﬂow fore-

casts for 1–3 months lead time and found the predictive skill

to be superior to the analogue approach for 93 UK catch-

ments. The above-mentioned persistence approach has also

been used operationally in the production of seasonal stream-

ﬂow forecasts in the UK since 2013, within the framework of

the Hydrological Outlook UK (Prudhomme et al. 2017). A

few other studies have included past ﬂow information in pre-

diction schemes along with teleconnections or other climatic

indices (Piechota et al., 2001; Chiew et al., 2003; Wang et al.,

2009). Recently, it was shown that streamﬂow persistence,

revealed as seasonal correlation, may also be relevant for pre-

diction of extreme events by allowing one to update the ﬂood

frequency distribution based on river ﬂow observations in the

pre-ﬂood season and reduce its bias and variability (Aguilar

et al., 2017). The above previous studies postulated that sea-

sonal streamﬂow correlation may be due to the persistence

of the catchments storage and/or the weather, but no attempt

was made to identify the physical drivers.

The present study aims to further inspect seasonal persis-

tence in river ﬂows and its determinants, by referring to a

large sample of catchments in six European countries (Aus-

tria, Sweden, Slovenia, France, Spain, and Italy). We focus

on persistence properties of both high and low ﬂows by in-

vestigating the following research questions: (i) what are the

physical conditions, in terms of catchment properties, i.e. ge-

ology and climate, which may induce seasonal persistence in

river ﬂow, and (ii) can ﬂoods and droughts be predicted, in

probabilistic terms, by exploiting the information provided

by average ﬂows in the previous months? These questions

are relevant for gaining a better comprehension of catchment

dynamics and planning mitigation strategies for natural haz-

ards. To reach the above goals, we identify a set of descrip-

tors for catchment behaviours and climate and inspect their

impact on correlation magnitude and predictability of river

ﬂows.

A few studies have analysed physical drivers of streamﬂow

persistence on annual and deseasonalized monthly and daily

time series (Mudelsee, 2007; Hirpa et al., 2010; Gudmunds-

son et al., 2011; Zhang et al., 2012; Szolgayova et al., 2014;

Markonis et al., 2018), but the topic has been less studied on

intra-annual scales relevant to seasonal forecasting of ﬂoods

and droughts.

To demonstrate the high practical relevance of the identi-

ﬁed seasonal correlations we present a technical experiment

for one of the studied rivers (Sect. 7) in which the frequency

distribution of both high and low ﬂows is updated one season

in advance by exploiting real-time information on the state

of the catchment.

2 Methodology

The investigation of the persistence properties of river ﬂows

focuses separately on both high and low discharges and is

articulated in the following steps: (a) identiﬁcation of the

high- and low-ﬂow seasons, (b) correlation assessment be-

tween the peak ﬂow in the high-ﬂow season (average ﬂow

in the low-ﬂow season) and average ﬂows in the previous

months, (c) analysis of the physical drivers for streamﬂow

persistence and its predictability through a principal compo-

nent analysis (PCA), and (d) real-time updating of the fre-

quency distribution of high and low ﬂows for a selected case

study with signiﬁcant seasonal correlation by employing a

meta-Gaussian approach. The above steps are described in

detail in the following sections.

2.1 Season identiﬁcation

Season identiﬁcation is performed algorithmically to identify

the high-ﬂow season (HFS) and low-ﬂow season (LFS) for

each river time series. For the estimation of HFS, we employ

an automated method recently proposed by Lee et al. (2015),

which identiﬁes the high-ﬂow season as the 3-month period

centred around the month with the maximum number of oc-

currences of peaks over threshold (POT), with the thresh-

old set to the highest 5 % of the daily ﬂows. To evaluate

the selection of HFS, a metric constructed as the percentage

of annual maximum ﬂows (PAMF) captured in the HFS is

used. The PAMFs are classiﬁed in the subjective categories

of “poor” (< 40 %), “low” (40 %–60 %), “medium” (60 %–

80 %), and “high” (> 80 %) values, denoting the probability

that the identiﬁed HFS is the dominant high-ﬂow season in

the record. If the identiﬁed peak month alone contains more

Hydrol. Earth Syst. Sci., 23, 73–91, 2019 www.hydrol-earth-syst-sci.net/23/73/2019/

T. Iliopoulou et al.: A large sample analysis of European rivers 75

than or equal to 80 % of the annual maxima ﬂows, a unimodal

regime is assumed and the identiﬁcation procedure is termi-

nated. In all other cases, the method allows for the search of

a second peak month and the identiﬁcation of a minor HFS,

but we do not further elaborate on this analysis here, because

we are only interested in the most extreme seasons for the

purpose of predicting high and low ﬂows.

The method proposed by Lee et al. (2015) has several ad-

vantages that make it suitable for the purpose of this research.

Most importantly, it is capable of handling conditions of bi-

modality, which is usually a major issue for traditional meth-

ods, e.g. directional statistics (Cunderlik et al., 2004). A po-

tential limitation is the assumption of symmetrical extension

of HFS around the peak month, along with the uniform selec-

tion of its length (3-month period). The degree of subjectiv-

ity in the evaluation of the second HFS is another limitation,

which is not relevant here, as we focus on the main HFS.

The LFS is herein identiﬁed as the 1-month period with

the lowest amount of mean monthly ﬂow. An alternative ap-

proach of estimating the relative frequencies of annual min-

ima of monthly ﬂow and selecting the month with the highest

frequency as the LFS is also considered.

2.2 Correlation analysis and physical interpretation

through principal component analysis

2.2.1 Correlation analysis

In the case of HFS, a correlation is sought between the max-

imum daily ﬂow occurring in the HFS period and the mean

ﬂow in the previous months, before the onset of HFS. For

LFS, correlation is computed between the mean ﬂow in the

LFS itself and the mean ﬂow in the previous months. We use

the mean ﬂow in the previous month as a robust proxy of

“storage” in the catchment that is expected to reﬂect the state

of the catchment, i.e. wetter or drier than usual. Since we are

interested in seasonal persistence, we compute the Pearson’s

correlation coefﬁcient for HFS lag up to 9 months and for

LFS lag up to 11 months.

2.2.2 Analysis of physical drivers

Catchment, geological, and climatic descriptors

An extensive investigation is carried out to identify physical

drivers of seasonal streamﬂow correlation, in terms of catch-

ment, geological, and climatic descriptors.

As catchment descriptors, we consider the basin area (A),

the baseﬂow index (BI), the mean speciﬁc runoff (SR), the

percentage of basin area covered by lakes (percentage of

lakes – PL) and glaciers (percentage of glaciers – PG), and

altitude as candidates for explanatory variables for stream-

ﬂow correlation.

The area A (km

) is primarily investigated, as it is repre-

sentative of the scale of the catchment, under the assumption

that in larger basins the impact of the climatological and geo-

physical processes affecting river ﬂow becomes more signif-

icant and may lead to a magniﬁed seasonal correlation.

The BI is considered based on the assumption that high

groundwater storage may be a potential driver of correla-

tion. BI is calculated from the daily ﬂow series of the rivers

following the hydrograph separation procedure detailed in

Gustard et al. (2008). Flow minima are sampled from non-

overlapping 5-day blocks of the daily ﬂow series, and turn-

ing points in the sequence of minima are sought and identi-

ﬁed when the 90 % value of a certain minimum is smaller or

equal to its adjacent values. Subsequently, linear interpola-

tion is used in between the turning points to obtain the base-

ﬂow hydrograph. The BI is obtained as the ratio of the vol-

ume of water beneath the baseﬂow separation curve versus

the total volume of water from the observed hydrograph, and

an average value is computed over all the observed hydro-

graphs for a given catchment. A low index is indicative of

an impermeable catchment with rapid response, whereas a

high value suggests high storage capacity and a stable ﬂow

regime.

SR (m

−1

−2

) is computed as the mean daily ﬂow

of the river standardized by the size of its basin area. It

may be an important physical driver, as it is an indica-

tor of the catchment’s wetness. PL (%) and PG (%) are

investigated for the Swedish and Austrian catchments, re-

spectively, as lakes and glaciers are expected to increase

catchment storage thus affecting persistence. Lake cover-

age data are based on cartography and are available from

the Swedish Water Archive (https://www.smhi.se/, last ac-

cess: 1 November 2016), while glacier coverage data are

estimated from the CORINE land cover database (https:

//www.eea.europa.eu/publications/COR0-landcover, last ac-

cess: 6 November 2016).

The effect of catchment altitude is also inspected us-

ing relief maps from the Shuttle Radar Topography Mis-

sion (SRTM) data (http://srtm.csi.cgiar.org/, last access:

28 July 2017). The data are available for the whole globe and

are sampled at 3 arcsec resolution (approximately 90 m). To-

pographic information is available for all catchments located

at latitudes lower than 60

◦

north, while a 1 km resolution dig-

ital elevation model is available for Austria.

As geological descriptors we consider the percentage of

catchment area with the presence of ﬂysch (percentage of ﬂy-

sch – PF) and karstic formations (percentage of karst – PK)

for Austrian and Slovenian catchments, respectively, where

this type of information is available. A subset of Austrian

catchments is characterized by the dominant presence of ﬂy-

sch, a sequence of sedimentary rocks characterized by low

permeability, which is known to generate a very fast ﬂow

response. Karstic catchments, characterized by the irregular

presence of sinkholes and caves, are also known for having

rapid response times and complex behaviour; e.g. initiating

fast preferential groundwater ﬂow and intermittent discharge

via karstic springs (Ravbar, 2013; Cervi et al., 2017). Ge-

ological features are also presumed to be linked to persis-

www.hydrol-earth-syst-sci.net/23/73/2019/ Hydrol. Earth Syst. Sci., 23, 73–91, 2019

76 T. Iliopoulou et al.: A large sample analysis of European rivers

tence properties, because geology is the main control for the

baseﬂow index across the European continent (Kuentz et al.,

2017). PK (%) and PF (%) are estimated from geological

maps of Slovenia and Austria, respectively.

As climatic descriptors, the mean annual precipitation P

(mm year

−1

) and the mean annual temperature T (

◦

C) are

selected. Corresponding gridded data are retrieved from the

WorldClim database (http://www.worldclim.org/, last access:

20 March 2017) at a spatial resolution of 10 arcminutes (ap-

proximately 18.55 km). We note that low mean temperature

regimes are also associated with snow, the presence of which

is also considered in the interpretation of the results. We also

adopt the De Martonne index (IDM; De Martonne, 1926) as

a climatic descriptor, which is given by IDM = P /(T + 10)

and enables classiﬁcation of a region into one of the fol-

lowing six climate classes, i.e. arid (IDM ≤ 5), semi-arid

(5 < IDM ≤ 10), dry subhumid (10 < IDM ≤ 20), wet subhu-

mid (20 < IDM ≤ 30), humid (30 < IDM ≤ 60), and very hu-

mid (IDM ≥ 60). Additionally, the Köppen–Geiger climatic

classiﬁcation (Kottek et al., 2006) of the rivers is assessed.

Principal component analysis

To identify which catchment, physiographic, and climatic

characteristics may explain river memory, we attempt to

regress the seasonal streamﬂow correlation on the physical

descriptors introduced above. We expect the presence of mul-

ticollinearity among the predictor variables, and therefore

PCA (Pearson, 1901; Hotelling, 1933) was applied to con-

struct uncorrelated explanatory variables. In essence, PCA

is an orthonormal linear transformation of p data variables

into a new coordinate system of q ≤ p uncorrelated variables

(principal components – PCs) ordered by decreasing degree

of variance retained when the original p variables are pro-

jected into them (Jolliffe, 2002). Therefore, the ﬁrst princi-

pal axis contains the greatest degree of variance in the data,

while the second principal axis is the direction which max-

imizes the variance among all directions orthogonal to the

ﬁrst principal axis, and each succeeding component in turn

has the highest variance possible while satisfying the condi-

tion of orthogonality to the preceding components. Speciﬁ-

cally, let x be a random vector with mean µ and correlation

matrix 6, and the principal component transformation of x

is then obtained as follows:

y = C

, (1)

where y is the transformed vector whose kth column is the

kth principal component (k = 1, 2, . .. , p), C is the p × p

matrix of the coefﬁcients or loadings for each principal com-

ponent, and x

is the standardized x vector. Standardization is

applied in order to avoid the impact of the different variable

units on selecting the direction of maximum variance when

forming the PCs. The y values are the scores of each obser-

vation, i.e. the transformed values of each observation of the

original p variables in the kth principal component direction.

PCA has useful descriptive properties of the underlying

structure of the data. These properties can be efﬁciently vi-

sualized in the biplot (Gabriel, 1971), which is the combined

plot of the scores of the data for the ﬁrst two principal com-

ponents along with the relative position of the p variables as

vectors in the two-dimensional space. Herein, the distance

biplot type (Gower and Hand, 1995), which approximates

the Euclidean distances between the observations, is used.

Variable vector coordinates are obtained by the coefﬁcients

of each variable for the ﬁrst two principal components. After

construction of the PCs, a linear regression model is explored

for the case of HFS and LFS lag-1 correlation.

2.3 Technical experiment: real-time updating of the

frequency distribution of high and low ﬂows

In order to evaluate the usefulness of the information pro-

vided by the 1-month-lag seasonal correlation for ﬂow signa-

tures in HFS and LFS, we perform a real-time updating of the

frequency distribution of high and low ﬂows based on the av-

erage river ﬂow in the previous month. A similar analysis for

the high ﬂows was carried out by Aguilar et al. (2017) for the

Po and Danube Rivers. In principle, this is a data assimila-

tion approach, since real-time information, i.e. observations

of the average river ﬂow, is used in order to update a prob-

abilistic model and inform the forecast of the ﬂow signature

of the upcoming season.

In detail, a bi-variate meta-Gaussian probability distribu-

tion (Kelly and Krzysztofowicz, 1997; Montanari and Brath,

2004) is ﬁtted between the observed ﬂow signatures, i.e. peak

ﬂow in the HFS, Q

, average ﬂow in the LFS, Q

, and the

average ﬂow in the pre-HFS and LFS months, Q

. The peak

HFS ﬂow and the average LFS ﬂow are the dependent vari-

ables and are extracted as the peak river discharge observed

in the previously identiﬁed HFS and the average river dis-

charge observed in the previously identiﬁed LFS, respec-

tively. The average ﬂow in the month preceding the HFS and

the LFS is the explanatory variable in both cases. In the fol-

lowing, random variables are denoted by an underscore and

their outcomes are written in plain form.

The normal quantile transform (NQT; Kelly and Krzyszto-

fowicz, 1997) is used in order to make the marginal probabil-

ity distribution of dependent and explanatory variables Gaus-

sian. This is achieved as follows: (a) the sample quantiles Q

are sorted in increasing order, e.g. Q

. . . Q

, (b) the

cumulative frequency, e.g. FQ

, is computed via a Weibull

plotting position, and (c) the standard normal quantile, e.g.

, is obtained as the inverse of the standard normal dis-

tribution for each cumulative frequency, e.g. G

−1

(FQ

Therefore, all sample quantiles are discretely mapped into

the Gaussian domain. To get the inverse transformation for

any normal quantile, we connect the points in the above map-

ping with linear segments. The extreme segments are ex-

tended to allow extrapolation outside the range covered by

the observed sample.

Hydrol. Earth Syst. Sci., 23, 73–91, 2019 www.hydrol-earth-syst-sci.net/23/73/2019/

T. Iliopoulou et al.: A large sample analysis of European rivers 77

In the Gaussian domain, a bi-variate Gaussian distribu-

tion is ﬁtted between the random explanatory variable NQ

and the dependent variables NQ

and NQ

by assuming the

stationarity and ergodicity of the variables. We deﬁne the

generic random variable NQ

to represent any dependent

ﬂow signature, i.e.; NQ

and NQ

in our case. Then, the pre-

dicted signature at time t can be written as

(t) = ρ(NQ

,NQ

)NQ

(t − h) + Nε(t), (2)

where ρ(NQ

, NQ

) is the Pearson’s cross-correlation coef-

ﬁcient between NQ

and NQ

, h is the selected correlation

lag with h = 1 in the present application, and Nε(t) is an

outcome of the stochastic process Nε, which is independent,

homoscedastic, stochastically independent of NQ

, and nor-

mally distributed with zero mean and variance 1 − ρ

(NQ

). Then, the joint bi-variate Gaussian probability dis-

tribution function is deﬁned by the mean (µ(NQ

) = 0

and µ(NQ

) = 0), the standard deviation (σ (NQ

) = 1 and

σ (NQ

) = 1) of the standardized normalized series, and the

Pearson’s cross-correlation coefﬁcient between the normal-

ized series, ρ(NQ

, NQ

). From the Gaussian bi-variate

probability properties, it follows that for any observed

(t − h) the probability distribution function of NQ

(t)

conditioned on NQ

is Gaussian, with parameters given by

µ(NQ

(t)) = ρ(NQ

,NQ

)NQ

(t − h), (3)

σ (NQ

(t)) = (1 − ρ

(NQ

,NQ

))

0.5

. (4)

To derive the probability distribution of Q

(t) conditioned

to the observed Q

(t − h), we ﬁrst apply the inverse NQT,

i.e. we use linear segments to connect the points of the pre-

vious discrete quantile mapping of the original quantiles into

the Gaussian domain, and accordingly, obtain Q

(t) for any

(t). Subsequently, we estimate the parameters of an as-

signed probability distribution for the obtained quantiles in

the untransformed domain. This is referred to as the up-

dated probability distribution of the considered ﬂow signa-

ture (NQ

and NQ

, in our case). We use the extreme value

type I distribution for the peak ﬂows and calculate the differ-

ences in the magnitude of estimated maxima for a given re-

turn period between the unconditioned and the updated distri-

bution. The latter is conditioned by the 95 % sample quantile

of the observed mean ﬂow in the previous month. To model

the low ﬂows we use the log-normal distribution, which was

found to exhibit the best ﬁt for the river in question among

other typical candidates for average ﬂows, i.e. the Weibull

and Gamma distribution. The low ﬂows are conditioned by

the lower 5 % sample quantile of the observed mean ﬂow in

the previous month.

3 Data and catchment description

The dataset includes 224 records spanning more than

50 years of daily river ﬂow observations from gauging sta-

tions, mostly from non-regulated streams. A few catchments

are impacted by regulation. Among the 224 rivers, 108 are

located in Austria, 69 in Sweden, 31 in Slovenia, 13 in

France, two in Spain, and one in Italy. Catchment areas vary

signiﬁcantly, the largest being the Po River basin in Italy

(70 091 km

) and the smallest being the Hallabäcken River

basin in Sweden (4.7 km

). The geographical location of the

river gauge stations as well as their climatic classiﬁcation are

shown in Fig. 1. Most of the examined rivers belong to either

a warm temperate (C) or a boreal or snow climate (D) with

a subset impacted by polar climatic conditions (E), accord-

ing to the updated world map of the Köppen–Geiger climate

classiﬁcation (Fig. 1) based on gridded temperature and pre-

cipitation data for the period 1951–2000 (Kottek et al., 2006).

More speciﬁcally, the majority of French and Slovenian and

approximately one third of the Swedish basins belong to the

warm temperate Cfb category characterized by precipitation

distributed throughout the year (fully humid) and warm sum-

mers. The rest of the Swedish catchments are impacted by a

Dfc climatic type, i.e. a snow climate, fully humid with cool

summers. The Austrian catchments belonging to the region

impacted by the European Alps have the most complicated

regime due to their topographic variability. At the lowest al-

titudes, Cfb is the prevailing regime, but as proximity to the

Alps increases, a Dfc regime dominates, and progressively, in

the highest altitude basins, the climate becomes a polar tun-

dra type (Et), characterized primarily by the very low temper-

atures present. The characteristics of all the climatic regimes

of the studied rivers are given in the legend of Fig. 1. A sum-

mary of the river basins under study, in terms of the selected

descriptors, is also provided in Table 1, showing that the in-

vestigated rivers cover a wide range of catchment area sizes,

ﬂow regimes, and climatic conditions.

It is relevant to note that 16 of the Austrian rivers are sub-

ject to regulation, which may alter the persistence proper-

ties of river ﬂows. This relates to generally “mild” forms of

regulation, i.e. upstream regulation with a very low degree

of ﬂow attenuation, hydropower operations, and ﬂow diver-

sions to and from the basin. A preliminary examination of

these rivers did not reveal any signiﬁcant change during time

of the ﬂow regime. The presence of regulation does not pre-

clude the exploitation of correlation for predicting river ﬂows

in probabilistic terms, but it may affect the analysis of phys-

ical drivers, as it may enhance or reduce persistence in the

natural river ﬂow regime. Given that detailed information is

generally lacking on the impact of regulation (Kuentz et al.

2017), we assume stationarity of the river ﬂows for all the

catchments herein considered and, additionally, assume that

river management does not signiﬁcantly affect the identiﬁca-

tion of the physical drivers.

www.hydrol-earth-syst-sci.net/23/73/2019/ Hydrol. Earth Syst. Sci., 23, 73–91, 2019

A large sample analysis of seasonal river flow correlation and its physical drivers

Figures

Citations

Understanding Hydrologic Variability across Europe through Catchment Classification

Selecting the Probability Distribution of Cone Tip Resistance Using Moment Ratio Diagram for Soil in Nasiriyah

References

World Map of the Köppen-Geiger climate classification updated

Long-Term Storage Capacity of Reservoirs

The Proof and Measurement of Association Between Two Things

The biplot graphic display of matrices with application to principal component analysis

Practical Nonparametric Statistics; Nonparametric Statistical Inference

Related Papers (5)

Defining high-flow seasons using temporal streamflow patterns from a global model

Flow Regime Classification and Hydrological Characterization: A Case Study of Ethiopian Rivers

Regionalization of patterns of flow intermittence from gauging station records

Linking Flood Frequency To Long-term Water Balance: Incorporating Effects of Seasonality

Flood risk reduction and flow buffering as ecosystem services: A flow persistence indicator for watershed health

Frequently Asked Questions (10)

Q1. What are the contributions mentioned in the paper "A large sample analysis of european rivers on seasonal river flow correlation and its physical drivers" ?

Q2. How is the selection of a seasonal metric used?

Q3. What is the potential limitation of the seasonal correlations?

Q4. What is the probability distribution of the conditioned peak flows?

Q5. What is the standard normal quantile for each cumulative frequency?

Q6. What is the description of the studied rivers?

Q7. What is the way to compare the seasonal correlations of the catchments?

Q8. What is the effect of increased wetness on seasonal memory?

Q9. What is the main conclusion of Kuentz et al. (2017)?

Q10. What are the six groups of potential drivers of seasonal correlation magnitude?