Академический Документы
Профессиональный Документы
Культура Документы
Journal of Hydrology
journal homepage: www.elsevier.com/locate/jhydrol
a r t i c l e i n f o s u m m a r y
Article history: Streamflow forecasting, especially the long lead-time forecasting, is still a very challenging task in hydro-
Received 30 November 2015 logic modeling. This could be due to the fact that the forecast accuracy measured in terms of both the
Received in revised form 4 February 2016 amplitude and phase or temporal errors and the forecast precision/reliability quantified in terms of the
Accepted 23 February 2016
uncertainty significantly deteriorate with the increase of the lead-time. In the model performance eval-
Available online 2 March 2016
This manuscript was handled by Andras
uation, the conventional error metrics, which primarily quantify the amplitude error and do not explicitly
Bardossy, Editor-in-Chief, with the account for the phase error, have been commonly adopted. For the long lead-time forecasting, the wave-
assistance of Fi-John Chang, Associate Editor let based neural network (WNN) among a variety of advanced soft computing methods has been shown
to be promising in the literature. This paper presented and compared WNN and artificial neural network
Keywords: (ANN), both of which were combined with the ensemble method using block bootstrap sampling (BB), in
Artificial neural network terms of the forecast accuracy and precision at various lead-times on the Bow River, Alberta, Canada.
Block bootstrap Apart from conventional model performance metrics, a new index, called percent volumetric error,
Forecast accuracy was proposed, especially for quantifying the phase error. The uncertainty metrics including percentage
Forecast precision of coverage and average width were used to evaluate the precision of the modeling approaches. The
Streamflow forecast results obtained demonstrate that the WNN-BB consistently outperforms the ANN-BB in both the cate-
Wavelet neural network
gories of the forecast accuracy and precision, especially in the long lead-time forecasting. The findings
strongly suggest that the WNN-BB is a robust modeling approach for streamflow forecasting and thus
would aid in flood management.
Ó 2016 Elsevier B.V. All rights reserved.
http://dx.doi.org/10.1016/j.jhydrol.2016.02.044
0022-1694/Ó 2016 Elsevier B.V. All rights reserved.
162 K.S. Kasiviswanathan et al. / Journal of Hydrology 536 (2016) 161–173
lead-time forecasting. Since then, a large number of studies have the entire hydrograph rather than focusing on the peak flow. For
applied WNN for both the long and short lead-time flow forecast- flood management purposes, a week (7-day) of lead-time is
ing (e.g., Shiri and Kisi, 2010). Many studies have demonstrated required to issue flood awareness, activate mitigation measures,
that WNNs often produce more consistent and accurate results and evacuate most vulnerable groups (Golding, 2009). Therefore,
when compared to the traditional ANNs (Nourani et al., 2009; the lead-times up to 7-day are investigated in the paper.
Adamowski and Sun, 2010; Seo et al., 2015). Furthermore, several
studies have quantified uncertainty in wavelet based flood fore-
2. Methodology
casting aiming to improve the model reliability (Tiwari and
Chatterjee, 2010; Sang et al., 2015).
In this paper, the streamflow forecasting is conducted using
In streamflow forecasting, the model accuracy, in general, deteri-
both WNN and ANN. The models are developed for forecasting
orates with the increase of the lead-time, which can be attributed to
streamflow at several lead-times including 1-day, 3-day, 5-day
the weak dependence between the modeled variable and input(s).
and 7-day. In the model development, BB is employed to sample
The forecasting error is generally classified into three categories
data to generate an ensemble of models. The following sections
including amplitude error, phase/temporal error and shape error
briefly describe the methods of ANN, WNN, and BB, respectively.
(Prakash et al., 2014). The amplitude error is mainly caused by the
presence of noise in input data or due to deficiencies in the model
structure. This would lead to either overestimation or underestima- 2.1. Artificial neural network
tion of streamflow (Shamseldin and O’Connor, 2001). The phase error
indicates the lag in timing of the simulated hydrograph, which is crit- An ANN is characterized as massively parallel interconnections
ical in flow forecasting (Prakash et al., 2014). The shape error is of simple neurons that function as a collective system. The network
mainly determined by the rate of flow change in the rising and falling topology consists of a set of nodes (neurons) connected by links that
limbs of a hydrograph. The evaluation of the model performance (in are usually organized in a number of layers. Each node in a layer
terms of the model accuracy) has often been conducted using the receives and processes weighted input from previous layer and
intuitive graphical representation as well as statistical measures transmits its output to nodes in the following layer through links.
such as root mean square error (RMSE) and Nash–Sutcliffe coefficient Each link is assigned a weight, which is a numerical estimate of
(EI), which are objective and quantitative in nature but without con- the connection strength. The weighted summation of inputs to a
sidering the temporal dimension of time series (i.e., hydrograph). In a node is converted to an output according to a transfer function (typ-
few studies, the temporal error, but only limited to the peak flow ically a sigmoid function). Most ANNs have three or more layers: an
rather the entire hydrograph, has been focused aiming to improve input layer, which is used to feed data to the networks; an output
the prediction accuracy of peak flow in terms of both magnitude layer, which is used to produce an appropriate response to the given
and/or timing (e.g., Liu et al., 2011). In such a situation, the model inputs; and one or more hidden layers, which are used to act as a
may not warrant the best possible solution when there are multiple collection of feature detectors. The multilayer perceptron (MLP)
peaks as the model performance often biases toward a particular network is one of the most popular ANN architectures in use today
peak flow. Furthermore, several researchers have made effort to (Maier et al., 2010). The plethora of works that used ANN has been
reduce the phase error by modifying the modeling approaches. For reported in the field of hydrology and water resources. Unfortu-
instance, Abrahart et al. (2007) applied correction factor when cali- nately, to date, there are no systematic rules for ANN training to
brating ANN models to minimize the phase error; however they con- determine the optimal network weights and biases. Among many
cluded that this method is only effective for the short lead-time available training techniques, this paper used the Levenberg–
forecasting but not for the long lead-time forecasting. Marquardt optimization algorithm to train the neural networks.
Besides, the model precision, often assessed in terms of the pre-
diction uncertainty, degrades as the lead-time of forecasting 2.2. Wavelet based neural network
increases (Kasiviswanathan et al., 2013). In the long lead-time fore-
casting, several possible reasons including the decrease of the Wavelet analysis is a signal processing technique employed for
influence of input(s) on the output, increased ranges of model the purpose of extracting the useful information in data series,
parameters, and inadequate model structure could lead to the which are either stationary or non-stationary. The advantage of
increase of prediction uncertainty. However to present, most stud- wavelet analysis lies in producing high resolution information of
ies have concentrated on discussing the model accuracy, but not data series in both time and frequency domains, which is not
the model precision except few (e.g., Alvisi and Franchini, 2011; otherwise available in other transformations. The wavelet trans-
Kasiviswanathan et al., 2013). form has been shown to be a more effective tool than the Fourier
In view of the above, the primary objective of this paper is to transform when dealing with non-stationary hydrologic time ser-
identify the robust modeling approach from two most popular ies (Partal and Kisßi, 2007). The width of the wavelet changes with
data-driven methods, namely ANN and WNN, especially for the each spectral component in the wavelet transform; whereas the
long lead-time forecasting, through assessing both modeling accu- Fourier transform has a constant width. The wavelet function gen-
racy and precision. To the authors’ best knowledge, no research erates the signal to be analyzed and the signal transform is com-
articles that compare ANN and WNN for forecasting from both puted for each segment generated. The scaling and shifting are
aspects of model accuracy and precision to present. The ensemble the two main parameters used in the wavelet transform to decom-
modeling approach has been demonstrated to have the capability pose the original time series signal into different resolutions. The
to improve the model accuracy in hydrologic modeling (Tiwari large-scale signal provides the detailed information, while the
and Chatterjee, 2010). Therefore, the combination of the data- small-scale signal compresses the original signal in order to pre-
driven modeling approaches and the block bootstrap (BB) sampling serve the global information about the signal (Cannas et al.,
method (named as ANN-BB and WNN-BB) are used and compared 2006). The time and frequency domains are responsible to main-
in a real application using data collected from the Bow River, tain the localization property.
Alberta, Canada. The advantage of BB lies in preserving the correla- This paper used the discrete wavelet transform (DWT) as the
tion and periodicity inherent in time series data (Ebtehaj et al., CWT necessitates a large amount of computation time and
2010). In addition, a new index, called percent volume error resources. The proper mother wavelet should be identified prior
(PVE), is proposed to quantify the phase error and it considers to analyzing the input time series signals when using DWT. In
K.S. Kasiviswanathan et al. / Journal of Hydrology 536 (2016) 161–173 163
the paper, the modified form of DWT, which is derived from CWT, (b) Determine the moving length (k) (i.e., 1 year) using the auto-
in Adamowski and Sun (2010) was adopted. The DWT operates two correlation. The moving length is then fixed throughout the
sets of function and consequently, the original time series data are iteration.
processed through two types of filters such as high-pass and low- (c) Initialize the block length ‘l’ (i.e., 1 year).
pass. The high pass filters are further separated into different levels (d) Sample the input–output from the entire calibration data
of details depending on the time scale required. The low pass filters set.
comprise the trend presented in the actual input time series signal (e) Train the model (here, WNN or ANN model) use the samples
and are named as approximation. More details of the mathematical and store the model parameters (i.e. weights and biases).
formulations of DWT are provided in Mallat (1989). (f) Evaluate the model performance by estimating RMSE and
A WNN has the architecture similar to that of the standard ANN, associated variability in the form of standard deviation of
but it is fed by the decomposed signals instead of the actual time ensemble mean.
series data. Therefore, the model development approach for WNN (g) Increase the block size (i.e. l = l + Dl, i.e., where, Dl = 1 year).
is same as that for ANN. (h) Repeat the steps from (d) to (f) and compute RMSE (updated
RMSE) of the ensemble mean and standard deviation.
2.3. Block bootstrap method (i) Continue the steps (g) to (h) if the updated RMSE is less than
the RMSE estimated in previous step (f); otherwise store the
In the deterministic modeling approach, the model parameters block length ‘l’ and estimate model performance and uncer-
are determined assuming that they represent the time invariant tainty indices such as percentage of coverage (POC) and
properties of a process to be modeled. However due to the uncer- average width (AW) as described by Eqs. (9) and (10) in
tainty mentioned above, the model parameters should be treated the following section.
as non-deterministic parameters, which consequently are
required to be contained within certain ranges. One of 3. Model performance evaluation
approaches for quantifying such ranges is the probabilistic
approach, in which the assumption of the prior distribution of The models developed are evaluated in terms of both the model
model parameters is required. In practice, it is however very dif- accuracy and the model precision using various metrics.
ficult, if it is not possible, to make such an assumption with sta-
tistical justification for the distribution selection. Furthermore, 3.1. Evaluation of model accuracy
the assumption might lead to poor convergence in the posterior
distribution of model parameters (Adkison and Peterman, The model results are evaluated with regard to accuracy using
1996). Alternatively, bootstrap resampling method, which can various error metrics including coefficient of determination (R2), EI,
avoid the drawback of the probabilistic approach, has been RMSE, mean bias error (MBE). In addition to these indices, persistence
applied in the model calibration to determine parameter ranges. index (PI) and extrapolation index (EXI) are also included. PI com-
Despite a variety of bootstrap resampling methods available in pares the model forecast with the forecast obtained by assuming that
the literature, the BB is utilized to sample the input–output pat- the process to be modeled is a Wiener process (variance increasing
terns for calibrating the models as the BB has an advantage of linearly with time), in which the best estimate for the future is given
preserving the correlation structure exhibited in time series data by the latest measurement (Kitanidis and Bras, 1980). EXI assesses
(Ebtehaj et al., 2010). whether the model just extrapolates the values from the recent mea-
The BB essentially requires two parameters including block surements. All selected indices (except the mean bias error and coef-
length and moving length. The block length refers to a total num- ficient of extrapolation) also can be found in Crochemore et al.
ber of data that each block contains and the moving length is the (2015), which summarizes various error metrics for evaluating mod-
overlap of the data between two blocks. In order to determine eled hydrograph. These indices are calculated by:
these two parameters, different approaches have been proposed 8 92
>
> h i >
>
in literature (Hall et al., 1995; Politis and White, 2004). The optimal >
> P >
>
< n
i¼1 y o
i
y o
y i
f
y f =
block length can be determined based on model performance (i.e. 2
R ¼ sffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi ð1Þ
> X f 2>
> Pn yo y
> >
n
RMSE) so that the model produces less error. In hydrologic model- >
>
: o 2 yi y f > ;
ing, an early study by Ebtehaj et al. (2010) applied a BB approach to i¼1 i
i¼1
improve the robustness of hydrologic parameter estimation. In
their approach, the pseudo time series with a fixed arbitrary block ( Pn o )
f 2
i¼1 yi y
length are sampled uniformly from independent and identical dis- EI ¼ 1 Pn 100 ð2Þ
tribution. However the block length should be determined consid-
o o 2
i¼1 yi y
ering the characteristics of the data used for the model
rffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
development. Therefore in this paper, the optimal block length 1 Xn o f
2
was determined through the model calibration, so that the model RMSE ¼ y i y i ð3Þ
n i¼1
produces less error (i.e. RMSE). Besides, the moving length is also
n
required to be selected initially. The selected moving length should 1X
ensure that the sample data in each block preserve the statistical MBE ¼ yoi yif ð4Þ
n i¼1
characteristics, such as the auto correlation and heteroscedasticity,
of the original time series data. In this paper, the auto-correlation is 8 2 9
> Pn o >
used to determine the moving length. < i¼1 yi yi
f
=
The novel algorithm proposed to determine the block length is PI ¼ 1 ð5Þ
>
: P n
2>
;
described as follows: yo yo
i¼1 i ij
8 2 9
(a) Let ‘l’ and ‘k’ denote the block length and the moving length, > Pn o >
respectively. The input (xi) and output (yi) patterns of any
< i¼1 yi yi
f
=
EXI ¼ 1 Pn ð6Þ
bootstrap blocks are represented as ðxi ; yi Þ where (i.e. i = 1, >
:
2>
;
i¼1 yi yli
o o
2,. . ., l).
164 K.S. Kasiviswanathan et al. / Journal of Hydrology 536 (2016) 161–173
where n is the number of data points; yoi and yif are the observed (Case III) that the simulated hydrograph has a similar geometry (in
o and y
and forecasted values at a time, respectively; y f are the mean terms of both magnitude and shape) as the observed hydrograph,
of the observed and forecasted values; ‘j’ is the lead-time in the but there is a clear shift in time between the observed and simu-
lated hydrographs. Note that in streamflow forecasting, the simu-
forecasting; and ylif is the forecast corresponding to straight line fit-
lated hydrograph often shifts toward the right-hand side of the
ted to two most recent measurements, yoij and yoij1 .
observed hydrograph. In this case, the model overall underesti-
When conducting hydrograph evaluation (comparing modeled mates flows in the region of the rising limb of hydrograph, how-
and observed hydrographs), there are a large number of error met- ever it overestimates flows in the region of the falling limb of
rics currently available for selection and implementation. hydrograph (Case II). Besides the three cases, a model may produce
Crochemore et al. (2015) provided numbers of error metrics com- perfect forecasts, which do not have obvious amplitude and phase
monly used for this purpose. These error metrics, including the errors.
ones selected above in the paper, primarily estimate the amplitude In streamflow forecasting for flood management purposes, both
error in a variety of ways. The phase error, which commonly pre- the hydrograph geometry and the timing are two critical measures
sents in forecasts (e.g., flow forecasts here) and is obvious espe- to assess the model accuracy. Therefore, a new index, percent vol-
cially in long lead-time forecasting, has not been explicitly umetric error (PVE), is proposed aiming to quantitatively estimate
accounted when evaluating the model performance to present. the phase error for entire hydrograph. Fig. 2 illustrates a typical
Therefore, the hydrograph evaluation based on several existing example of observed and simulated hydrographs and the volume
error metrics might convey misleading information without suffi- error (VE) in a time interval, Dt (the shaded area in Fig. 2). PVE is
ciently assessing the modeled results from the aspect of phase calculated by Eqs. (7) and (8). The calculation of PVE for overesti-
error. As a result, an error metrics which explicitly considers the mation (negative VE) and underestimation (positive VE) is
phase error would be beneficial in better evaluating the developed separated.
model or minimizing the phase error in the model development.
8 9
Therefore in the paper, a new metrics is proposed especially for < yoi yif þ yoiþ1 yiþ1
f
=
quantifying the phase error by considering the second dimension VEiþ1 ¼ Dt ð7Þ
i
: 2 ;
– the time, when evaluating the model accuracy. Typically there
are three different possible cases in terms of the relative location
of a simulated hydrograph to an observed hydrograph (Fig. 1). Pu or o
VEiiþ1
The Case I represents that a model captures the dynamics or pre- PVEu or o ¼ Pi¼1
n 100 ð8Þ
i¼1 yi Dt
o
serves the shape of a hydrograph, but consistently underestimates
the observations. Thus the simulated hydrograph shows little to no where u and o indicate the numbers of positive and negative VE,
phase error (time lag). Note that a model can also overestimate the respectively.
observations while without an obvious phase error. In Case II, the In contrast to the conventional model accuracy measures, PVE
simulated hydrograph is right-skewed relative to the observed quantitatively accounts for the difference in the locations between
hydrograph; in addition, the time lags in the raising limb and the the observed and simulated hydrographs through adding the time
falling limb are not symmetric and the peak flow can be either into the calculation instead of concentrating on the magnitude of
overestimated or underestimated. There is also another possibility simulated results. As illustrated in Fig. 1, either PVEu or PVEo is zero
Fig. 1. Three typical relationships between observed (solid line) and simulated hydrographs (dotted line). Case I: no phase error, but consistent underestimation (or
overestimation – not shown in the graph); Case II: right-shifted with uneven phase errors in the rising and falling limbs; Case III: same shape, but evenly right-shifted in the
rising and falling limbs.
K.S. Kasiviswanathan et al. / Journal of Hydrology 536 (2016) 161–173 165
Observed the mountain snowpack melts. Flow in the Bow River at Calgary
often peaks after mid-June due to the contribution of both rain-
Simulated fall–runoff and mountain snowmelt and then slowly decreases
toward the winter. The City of Calgary, which is the most popu-
lated community within the river basin, is located in the foothills,
which is 1050 m above mean sea level. In the last decade, the City
Streamflow, m3/s
Fig. 3. Study area map (the extent of the figure is from the origin of the Bow River to the City of Calgary).
where N is the number of data to be decomposed. Hence, four levels The subscript ‘j’ denotes the forecast lead time.
of decomposition (L = 4) is determined herein. The decomposed sig-
nals, D1, D2, D3, D4 as well as the approximation signal, A4 and the 5.3. Network structure optimization
original flow data are plotted in Fig. 4, respectively.
In both the ANN-BB and the WNN-BB, the MLP ANN of a single
5.2. Input variable identification hidden layer was employed. The single layer of ANN is preferred
for reducing the inherent model structure uncertainty by minimiz-
The development of effective data-driven models largely ing number of network parameters (Kasiviswanathan et al., 2013).
depends on the identification of input variables that govern the The number of hidden neurons, which is responsible for capturing
underlying physical processes (Bowden et al., 2005a,b). This the dynamic and complex relationship between input and output
requires some degree of a priori knowledge of the system to be variables, is determined in the model calibration. Although com-
modeled (Thirumalaiah and Deo, 2000). It is, however, difficult to plex optimization procedures for determining the number of hid-
select the initial input variables based on the underlying physical den neurons are available in the literature (e.g., Maier et al.,
processes, especially in complex systems, and thereby analytical 2010), the simple trial and error method, which has been often
techniques such as auto- or cross-correlation are often employed employed (e.g., Srivastav et al., 2007; Kasiviswanathan and
(Sudheer et al., 2002). Despite that most such analytical techniques Sudheer, 2013), was adopted instead in this paper. The trial and
only take into account of the linear dependence between variables, error approach started with two hidden neurons initially, and the
they are mostly used for selecting input variables (Bowden et al., number of hidden neurons was increased by 1 neuron in each trial.
2005a,b). This paper employed a heuristic approach suggested by For each set of hidden neurons, the network was trained in a batch
Sudheer et al. (2002), in which the potential influencing variables mode (offline learning) to minimize the mean square error at the
corresponding to different time lags are identified through statisti- output layer. In order to avoid over-fitting in the model calibration,
cally analyzing the data series using cross-, auto-, and partial auto a cross validation was performed. The stopping criteria, mean
correlations between variables. In this paper, the output variable square error (0.0001) and/or maximum epochs (10,000), were
was correlated significantly with Qt (current flow) and Qt1 (ante- adopted in the training of ANN/WNN. The model calibration was
cedent flow with one-day time lag ‘t 1’), which were thus stopped when there is no significant improvement in the model
selected as the input variables. The selected variables were fed into performance. The optimal network architecture 2-2-1 (neurons in
the ANNs; while their decomposed signals as well as their approx- the input layer – neurons in the hidden layer – neurons in the out-
imate signals were the inputs of the WNNs. Thus, the ANN-BB and put layer) was determined for the ANN-BB; while the network
the WNN-BB are represented as follows, respectively. architecture 10-2-1 was selected for the WNN-BB.
Q tþj ¼ f ðQ t ; Q t1 Þ ð12Þ
5.4. Block bootstrap convergence
Q tþj ¼ f ðD1t ; D1t1 ; D2t ; D2t1 ; D3t ; D3t1 ; D4t ; D4t1 ; A4t ; A4t1 Þ
In order to assess the model precision, this paper applied BB to
ð13Þ sample different realizations of the input–output patterns during
K.S. Kasiviswanathan et al. / Journal of Hydrology 536 (2016) 161–173 167
Fig. 4. The daily flow, decomposed signals (D1, D2, D3 and D4), and the approximation signal (A4) based on data between 1912 and 2002.
Fig. 6. Variation of RMSE and its standard deviation with the block length. The dot and the line denote the mean and the standard deviation of RMSEs of the ensemble of the
WNN-BB and ANN-BB, respectively.
ment for both the WNN-BB and the ANN-BB. The results from the Table 1
model development suggest that an arbitrarily selected block Calculated various amplitude errors in the model validation.
length without justification based on the model performance Performance WNN-BB ANN-BB
might yield less optimal results. metrics
Flood Flood Flood Flood Flood Flood
I II III I II III
Fig. 7. Time series of 7-day lead-time forecasted and observed flows in Flood I (June 5–August 7, 2005), Flood II (May 9–August 25, 2007) and Flood III (May 24–August 7,
2010), respective, in the model validation.
ing. ANNs-based models are suspected to over-fit the data in the forecasted by the WNN-BB; but the phase error could result in
long lead-time forecasting (Prakash et al., 2014). In addition, the the inferior performance of the ANN-BB in reproducing the
WNN-BB produced more or less consistent performance in the observed hydrographs. These results suggest that the WNN-BB
three time periods; while the WNN-BB produced slightly better might be more efficient in capturing the dynamics of the
performance in Floods II and III compared to Flood I, which has observations.
the highest peak flow. This may be attributed to better model gen-
eralization of the developed model in the range from low to med- 6.2. Phase error in forecasting
ium flows, where contains a large portion of observations in the
calibration data set. The indices, PVEo and PVEu, were calculated for each Flood for 1-
In addition, the WNN-BB and the ANN-BB were compared in day, 3-day, 5-day and 7-day lead-time forecasting, respectively,
terms of reproducing the statistical characteristics of observations. and the results are presented in Table 3. It is under expectation
Descriptive statistics, including mean, standard deviation, and that the magnitudes of both PVEo and PVEu tend to increase with
skewness, calculated from the ensemble means of the WNN-BB the increase of the lead-time in both the WNN-BB and the ANN-
and the ANN-BB for each Flood are presented in Table 2. The results BB. This implies that the phase error increases with the increase
demonstrate that the forecasted flows by the WNN-BB reproduced of the lead-time. The WNN-BB, in general, yielded more or less
the statistical characteristics of the observations fairly well overall equivalent PVEo and PVEu in terms of magnitude but having differ-
compared to the ANN-BB. Both the means of forecasted flows ent signs in all cases, which suggests the WNN-BB can reproduce
obtained from the WNN-BB and the ANN-BB appear to be equiva- the shape of the observed hydrograph fairly well although there
lent and also closely match with observed means in all three is a phase error between forecasted and observed hydrographs
Floods; however there are considerable differences in other two (Case III in Fig. 1). The consistent forecast error in terms of PVEo
statistics, standard deviation and skewness, between these two and PVEu produced by WNN-BB might imply that the model can
models. Compared to the ANN-BB, the WNN-BB, in general, yielded be naive. However, the EXI values reported in Table 1 suggest that
standard deviation and skewness, which are closer to those of the the WNN-BB model does not just extrapolate the previous values.
observations. One of possible explanation of the results is that the In the forecasts produced by the ANN-BB, the magnitudes of PVEu
amplitude and the shape of the observed hydrographs are well are approximately two times of those of PVEo in Flood I and II (Case
170 K.S. Kasiviswanathan et al. / Journal of Hydrology 536 (2016) 161–173
Table 3
Estimates of percent volumetric error by both the WNN-BB and the ANN-BB for
various lead-times in the model validation.
Table 2
Comparison of the descriptive statistics between the observed flows and the forecasted flows by the WNN-BB and the ANN-BB for each Flood in the model validation.
Table 4 tendency to produce a large phase error in the long lead-time fore-
Prediction uncertainties estimated in each Flood in the 7-day lead-time forecasting by casting as discussed previously.
the WNN-BB and ANN-BB, respectively, in the model validation.
In water resource management, the decision makers always
Uncertainty WNN-BB ANN-BB desire accurate prediction with narrow prediction interval, when
index further analysis and management is needed based on modeling/-
Flood Flood Flood Flood Flood Flood
I II III I II III forecasting results. Based on the results in comparing the WNN-
POC (%) 53.13 52.78 33.33 32.81 35.19 21.33 BB and the ANN-BB, it can be concluded that the WNN-BB, in gen-
AW (m3/s) 97.34 23.03 12.94 50.00 27.02 16.40 eral, produces more accurate and precise forecasts, in particular in
the long lead-time forecasting.
The histograms of the forecasted peak flows along with the 6.4. Peak flow forecasting
observed peak flows for each Flood are presented in Fig. 9 for the
WNN-BB and the ANN-BB, respectively. It can be seen that the From the flood management perspective, for instant flood early
ranges of the forecasts by the WNN-BB encompass the observed warning, it is crucial to accurately forecast the timing of the peak
peak flows in all events; whereas the peak flows consistently fall flow besides the magnitude of the peak flow. Several errors were
outside of the interval bounds of forecasts by the ANN-BB. The calculated based on the observed and forecasted peaks. The peak
under-forecast of peak flows by the ANN-BB can also be seen from error (PE), which is the percentage of difference between the
Fig. 7. By closely observing Fig. 7, the under-forecast of peak flows observed peak and the ensemble mean of forecasted peak to the
by the ANN-BB might be ascribed to the fact that the ANN-BB has observed peak, is less than ±3.5% in Flood II and III in the WNN-
Fig. 9. Histograms of forecasted peak flows in each Flood by the WNN-BB and the ANN-BB, respectively. The blue points indicate the observed peak flows. (For interpretation
of the references to colour in this figure legend, the reader is referred to the web version of this article.)
172 K.S. Kasiviswanathan et al. / Journal of Hydrology 536 (2016) 161–173
6.5. Forecasting extremely high flow The authors would like to thank the University of Calgary (Eyes
High Program), Canada, for the financial support of this study. The
The foregoing discussions argue the applicability of the WNN- authors would also like to thank the anonymous reviewers for their
BB for the long lead-time forecasting. The developed WNN-BB insightful comments.
(for the 7-day lead-time forecasting) was also applied to forecast
the flows in 2013, when the highest peak flow of 1750 m3/s was References
recorded in Alberta’s history, and the results are illustrated in
Fig. 10. As shown in this figure, the WNN-BB forecasted the peak Abrahart, R.J., Heppenstall, A.J., See, L.M., 2007. Timing error correction procedure
applied to neural network rainfall—runoff modelling. Hydrol. Sci. J. 52, 414–
flow of 750 m3/s against the observed peak flow of 1750 m3/s, 431. http://dx.doi.org/10.1623/hysj.52.3.414.
which is an extreme outlier in the whole data set. Although the Adamowski, J., Sun, K., 2010. Development of a coupled wavelet transform and
model cannot forecast the extremely high flow, it can reproduce neural network method for flow forecasting of non-perennial rivers in semi-arid
watersheds. J. Hydrol. 390, 85–91. http://dx.doi.org/10.1016/j.
the low and medium flows reasonably. While observing the flows jhydrol.2010.06.033.
measured at Calgary over the historical period, the mechanisms Adkison, M., Peterman, R.M., 1996. Results of Bayesian methods depend on details
of generating peak flows, which can be rain-runoff, snow melting, of implementation: an example of estimating salmon escapement goals. Fish.
Res. 25, 155–170.
and the combination of rain-runoff and snow melting, vary from a
Alvisi, S., Franchini, M., 2011. Fuzzy neural networks for water level and discharge
year to a year. In addition as discussed previously, the lack of data forecasting with uncertainty. Environ. Model. Softw. 26, 523–537. http://dx.doi.
in the high flow region can also be blamed for the inferior perfor- org/10.1016/j.envsoft.2010.10.016.
mance of data-driven models in forecasting high flows, as the mod- Benaouda, D., Murtagh, F., Starck, J.L., Renaud, O., 2006. Wavelet-based nonlinear
multiscale decomposition model for electricity load forecasting.
els generally perform well within the range where they are Neurocomputing 70, 139–154. http://dx.doi.org/10.1016/j.neucom.2006.
sufficiently calibrated (Minns and Hall, 1996; Tokar and Johnson, 04.005.
K.S. Kasiviswanathan et al. / Journal of Hydrology 536 (2016) 161–173 173
Boucher, M.A., Lalibert́e, J.P., Anctil, F., 2010. An experiment on the evolution of an Mallat, S.G., 1989. A theory for multi-resolution signal decomposition: the wavelet
ensemble of neural networks for streamflow forecasting. Hydrol. Earth Syst. Sci. representation. IEEE Trans. Pattern Anal. Mach. Intell. 11 (7), 674–693.
14, 603–612. Minns, W., Hall, M.J., 1996. Artificial neural networks as rainfall-runoff models.
Bowden, G.J., Maier, H.R., Dandy, G.C., 2005a. Input determination for neural Hydrol. Sci. J. 41, 399–417.
network models in water resources applications. Part 1 – Background and Nayak, P.C., Sudheer, K.P., Rangan, D.M., Ramasastri, K.S., 2005. Short-term flood
methodology. J. Hydrol. 301, 75–92. http://dx.doi.org/10.1623/hysj.54.5.852. forecasting with a neurofuzzy model. Water Resour. Res. 41, 1–16. http://dx.doi.
Bowden, G.J., Maier, H.R., Dandy, G.C., 2005b. Input determination for neural org/10.1029/2004WR003562.
network models in water resources applications. Part 2 – Case study: Nourani, V., Kisi, Ö., Komasi, M., 2011. Two hybrid Artificial Intelligence approaches
forecasting salinity in a river. J. Hydrol. 301, 93–107. http://dx.doi.org/ for modeling rainfall–runoff process. J. Hydrol. 402, 41–59. http://dx.doi.org/
10.1016/j.jhydrol.2004.06.020. 10.1016/j.jhydrol.2011.03.002.
Cannas, B., Fanni, A., See, L., Sias, G., 2006. Data preprocessing for river flow Nourani, V., Komasi, M., Mano, A., 2009. A multivariate ANN-wavelet approach for
forecasting using neural networks: wavelet transforms and data partitioning. rainfall–runoff modeling. Water Resour. Manage. 23, 2877–2894. http://dx.doi.
Phys. Chem. Earth 31, 1164–1171. http://dx.doi.org/10.1016/j.pce.2006.03.020. org/10.1007/s11269-009-9414-5.
Chang, F.-J., Chen, P.-A., Lu, Y.-R., Huang, E., Chang, K.-Y., 2014. Real-time multistep- Partal, T., Kisßi, Ö., 2007. Wavelet and neuro-fuzzy conjunction model for
ahead water level forecasting by recurrent neural networks for urban flood precipitation forecasting. J. Hydrol. 342, 199–212. http://dx.doi.org/10.1016/j.
control. J. Hydrol. 517, 836–846. http://dx.doi.org/10.1016/j. jhydrol.2007.05.026.
jhydrol.2014.06.013. Politis, D.N., White, H., 2004. Automatic block-length selection for the dependent
Chen, P.A., Chang, L.C., Chang, F.J., 2013. Reinforced recurrent neural networks for bootstrap. Econ. Rev. 23, 53–70. http://dx.doi.org/10.1081/ETC-120028836.
multi-step-ahead flood forecasts. J. Hydrol. 497, 71–79. http://dx.doi.org/ Prakash, O., Sudheer, K.P., Srinivasan, K., 2014. Improved higher lead time river flow
10.1016/j.jhydrol.2013.05.038. forecasts using sequential neural network with error updating. J. Hydrol.
Crochemore, L., Perrin, C., Andréassian, V., Ehret, U., Seibert, S.P., Grimaldi, S., Gupta, Hydromech. 62, 60–74. http://dx.doi.org/10.2478/johh-2014-0010.
H., Paturel, J.-E., 2015. Comparing expert judgement and numerical criteria for Sang, Y.-F., Wang, Z., Liu, C., 2015. Wavelet neural modeling for hydrologic time
hydrograph evaluation. Hydrol. Sci. J. 60 (3), 402–423. series forecasting with uncertainty evaluation. Water Resour. Manage. http://
Ebtehaj, M., Moradkhani, H., Gupta, H.V., 2010. Improving robustness of hydrologic dx.doi.org/10.1007/s11269-014-0911-9.
parameter estimation by the use of moving block bootstrap resampling. Water Seo, Y., Kim, S., Kisi, O., Singh, V.P., 2015. Daily water level forecasting using wavelet
Resour. Res. 46, 1–14. http://dx.doi.org/10.1029/2009WR007981. decomposition and artificial intelligence techniques. J. Hydrol. 520, 224–243.
Golding, B., 2009. Long lead time flood warnings: reality or fantasy? Meteorol. Appl. http://dx.doi.org/10.1016/j.jhydrol.2014.11.050.
16, 3–12. Shamseldin, a.Y., O’Connor, K.M., 2001. A non-linear neural network technique for
Hall, P., Horowitz, J.L., Jing, B.Y., 1995. On blocking rules for the bootstrap with updating of river flow forecasts. Hydrol. Earth Syst. Sci. 5, 577–598. http://dx.
dependent data. Biometrika 82, 561–574. http://dx.doi.org/10.2307/2337534. doi.org/10.5194/hess-5-577-2001.
Han, D., Chan, L., Zhu, N., 2007. Flood forecasting using support vector machines. J. Shiri, J., Kisi, O., 2010. Short-term and long-term streamflow forecasting using a
Hydroinformatics 9 (4), 267–276. wavelet and neuro-fuzzy conjunction model. J. Hydrol. 394, 486–493. http://dx.
Kasiviswanathan, K.S., Cibin, R., Sudheer, K.P., Chaubey, I., 2013. Constructing doi.org/10.1016/j.jhydrol.2010.10.008.
prediction interval for artificial neural network rainfall runoff models based on Srivastav, R.K., Sudheer, K.P., Chaubey, I., 2007. A simplified approach to quantifying
ensemble simulations. J. Hydrol. 499, 275–288. http://dx.doi.org/10.1016/j. predictive and parametric uncertainty in artificial neural network hydrologic
jhydrol.2013.06.043. models. Water Resour. Res. 43, 1–12. http://dx.doi.org/10.1029/
Kasiviswanathan, K.S., Sudheer, K.P., 2013. Quantification of the predictive 2006WR005352.
uncertainty of artificial neural network based river flow forecast models. Sudheer, K.P., Gosain, A.K., Ramasastri, K.S., 2002. A data-driven algorithm for
Stoch. Environ. Res. Risk Assess. 27, 137–146. http://dx.doi.org/10.1007/ constructing artificial neural network rainfall-runoff models. Hydrol. Process.
s00477-012-0600-2. 16, 1325–1330. http://dx.doi.org/10.1002/hyp.554.
Kisi, O., Shiri, J., Tombul, M., 2013. Modeling rainfall-runoff process using soft Thirumalaiah, K., Deo, M.C., 2000. Hydrological forecasting using neural networks. J.
computing techniques. Comput. Geosci. 51, 108–117. http://dx.doi.org/ Hydrol. Eng. 5 (2), 180–189.
10.1016/j.cageo.2012.07.001. Tiwari, M.K., Chatterjee, C., 2010. Development of an accurate and reliable hourly
Kitanidis, P.K., Bras, R.L., 1980. Real-time forecasting with a conceptual hydrologic flood forecasting model using wavelet–bootstrap–ANN (WBANN) hybrid
model: 2. Applications and results. Water Resour. Res. 16, 1034–1044. http://dx. approach. J. Hydrol. 394, 458–470. http://dx.doi.org/10.1016/j.
doi.org/10.1029/WR016i006p01034. jhydrol.2009.12.013.
Liu, Y., Brown, J., Demargne, J., Seo, D.J., 2011. A wavelet-based approach to Tokar, B.A.S., Johnson, P.A., 1999. Rainfall-runoff modeling using artificial neural
assessing timing errors in hydrologic predictions. J. Hydrol. 397, 210–224. networks. J. Hydrol. Eng. 4 (3), 232–239.
http://dx.doi.org/10.1016/j.jhydrol.2010.11.040. Wang, W., Ding, J., 2003. Wavelet network model and its application to the
Maier, H.R., Jain, A., Dandy, G.C., Sudheer, K.P., 2010. Methods used for the prediction of hydrology. Nat. Sci. 1 (1), 67–71.
development of neural networks for the prediction of water resource variables Zhang, X.S., Liang, F.M., Srinivasan, R., Van Liew, M., 2009. Estimating uncertainty of
in river systems: current status and future directions. Environ. Model. Softw. 25, streamflow simulation using Bayesian neural networks. Water Resour. Res. 45.
891–909. http://dx.doi.org/10.1016/j.envsoft.2010.02.003. http://dx.doi.org/10.1029/2008wr007030.