Prediction and assessment of drought effects on surface water quality using artificial neural networks: case study of Zayandehrud River, Iran
© Safavi and Malek Ahmadi. 2015
Received: 21 December 2014
Accepted: 4 October 2015
Published: 8 October 2015
Although drought impacts on water quantity are widely recognized, the impacts on water quality are less known. The Zayandehrud River basin in the west-central part of Iran plateau witnessed an increased contamination during the recent droughts and low flows. The river has been receiving wastewater and effluents from the villages, a number of small and large industries, and irrigation drainage systems along its course. What makes the situation even worse is the drought period the river basin has been going through over the last decade. Therefore, a river quality management model is required to include the adverse effects of industrial development in the region and the destructive effects of droughts which affect the river’s water quality and its surrounding environment. Developing such a model naturally presupposes investigations into pollution effects in terms of both quality and quantity to be used in such management tools as mathematical models to predict the water quality of the river and to prevent pollution escalation in the environment.
The present study aims to investigate electrical conductivity of the Zayandehrud River as a water quality parameter and to evaluate the effect of this parameter under drought conditions. For this purpose, artificial neural networks are used as a modeling tool to derive the relationship between electrical conductivity and the hydrological parameters of the Zayandehrud River. The models used in this research include multi-layer perceptron and radial basis function. Finally, these two models are compared in terms of their performance using the time series of electrical conductivity at eight monitoring-hydrometric stations during drought periods between the years 1997–2012.
Results show that artificial neural networks can be used for modeling the relationship between electrical conductivity and hydrological parameters under drought conditions. It is further shown that radial basis function works better for the upstream stretches of the river while multi-layer perceptron is more efficient for the downstream stretches.
KeywordsDischarge Drought Temperature Electrical conductivity Artificial neural networks Multi layer perceptron Radial basis function
In recent decades, the available water has decreased to the extent that it barely, if at all, meets the human demands or the requirements for preserving the biological systems. Pollution and water scarcity are the two most important challenges facing most countries, especially those in arid and semi-arid regions. In this context, much attention has been focused on the physical availability of water resources at the expense of neglecting water quality which is also a main concern. Nowadays, an integrated and systematic approach to qualitative and quantitative management of water resources has gained a great significance due to the increasing components of these systems, the complex interrelationships, and their far reaching effects. For example, according to the Malaysia’s Department of Environment, many rivers experience a loss of quality, which in turn affects people’s health, the nation’s economy, and the environment . The main causes of river pollution are often associated with people’s attitudes and their lack of environmental awareness. This pollution is diffused due to development along the river .
On the other hand, periods of drought and low flows can have dramatic effects on aquatic systems by reducing the quantity of river flows . The impacts of drought conditions on river water quality may be substantial. Although the drought appeared to have significant adverse environmental effects, the actual impacts on water quality are not well understood. Typical effects are increases in total dissolved solids and their constituent ions and biochemical oxygen demand, and decreases in dissolved oxygen . There have been few studies evaluating impacts of droughts and low flow rivers on water quality or aquatic systems. These studies focused on modeling and discussing the possible impacts of drought and low flows on water quality [5–14]. Most of the models developed are complex and require a significant amount of field data to support analysis.
Recently, the neural networks approach has been applied in the areas of water engineering. Artificial neural networks are able to accurately approximate complicated non-linear input–output relationships. ANN model is flexible enough to accommodate additional constraints that may arise during its application. Moreover, the ANN model can reveal hidden relationships in historical data, thus facilitating the prediction and forecasting of water quality [2, 15, 16]. Many studies have been reported on water quality modeling and prediction by using ANNs [17–23]. Hence, motivated by successful applications in modeling non-linear system behaviors, ANNs are used in the present study for modeling and prediction of surface water quality in drought or low flow conditions.
The objective of this study is to predict and simulate electrical conductivity (EC) as a water quality parameter and to assess this parameter in drought conditions for the Zayandehrud River flows in west-central Iran. In this research, the relationship between electrical conductivity and hydrological parameters of the river investigated is obtained by artificial neural networks as the modeling tool hereinafter we can estimate the relation between hydrological parameters and water quality parameter. This modeling tool consists of the multi-layer perceptron (MLP) and the radial basis function (RBF). Finally, the two models are compared with respect to their performance.
Materials and methods
The Zayandehrud River is the most important river in the basin originating in the eastern slopes of the Zagross Mountain Range. The Zayandehrud storage dam with an efficient storage capacity of 1400 MCM is located 75 Km downstream the origin of the Zayandehrud River which has a natural average flow of about 900 MCM. To augment the water supply in the basin and to keep up with the increasing demand, inter-basin transfers have been implemented. Three tunnels have been constructed and are currently being operated which deliver an annual flow of 850 MCM into the basin. The flow downstream the dam supplies water for agricultural, municipal, and industrial uses. The total river length spans over a route of 350 Km to end in Gavkhooni wetland .
In recent decades, water has become increasingly scarce and the Zayandehrud basin has shown signs of salinization of agricultural land and increased pollution in the lower reaches of the river. While the river is subjected to multiple human impacts including water abstraction for domestic use in urban and rural areas, industrial and agricultural uses, and urban and agricultural runoff and drainage, it has also been receiving raw and treated sewages. Furthermore, the severe drought in recent years is a current phenomenon affecting water quantity and quality in the basin. Water quality generally shows a considerable spatial variability from upstream to downstream and deteriorates from Isfahan city downward the river’s course. The objective of this article is to evaluate the impact of droughts and low flows on the water quality of the Zayandehrud River.
Artificial neural networks
General concepts of artificial neural networks
An artificial neural network is created to mimic natural neural networks using computing processes. ANN models have been used to model wrapped non-linear input–output relationships in water resources management and environmental fields . ANNs receive a number of inputs in the processing units which are able to communicate by sending signals to each other through a large number of weighted connections. In each network, some basic features are presented such as a set of inputs, connections within each unit, an output from each unit, an external input called bias, the rule which determines the effective input from inputs, and an activation or transfer function (usually sigmoid) which computes the correlation between the sum and the output of the unit .
The main idea of neural networks is that parameters can be adjusted so that the network exhibits some desired or interesting behavior. Thus, we can train the network to do a particular job by adjusting the weight or bias parameters, or perhaps the network itself will adjust these parameters to achieve some acceptable end . The natural behavior of hydrological processes, and especially water quality, is appropriate for using the ANN approach. However, hydrological applications of ANN are still in their dehiscence stages .
The learning capability of ANNs is one of their interesting features. The purpose is to provide the network with a set of inputs for it to produce a certain set of outputs or at least to produce the desirable ones. The ANN processes sets of inputs and outputs in the vector phase. During periods of network learning, the weights gradually converge to desirable values. Actually, prediction error in learning a set is minimized by proper adjustment of weights. If the network learns properly, the model can produce outputs for unknown sets of inputs. There are two types of training used in ANNs: supervised and unsupervised [27, 28].
Multilayer perceptron neural network
Where, MSE is mean square error, N is the number of observations, T is the observation value, and Yi is the prediction or output value. Back-propagation learning rule may proceed in either of two ways: 1) the pattern or case by case mode; 2) the batch mode. In the former mode, calculations are performed after each case, while in the latter, updating the calculations and weights is performed after the whole training pattern is presented .
Generalizing multilayer perceptron neural network
After the learning stage is completed, the network enters the prediction stage in which the input vector which was not presented in the learning stage is applied to the network and the corresponding outputs are predicted. The ability of a network to predict such unknown outputs is called ‘interoperability’ or ‘generalization’. One of the obstacles against the learning stage is over-fitting or over-learning of ANN on training data by which is meant the error on the training data is reduced to a minimum, but the error is still high as a result of explicitly presenting unknown data as the set of inputs so that the network is not properly generalized. One solution proposed for generalizing the network is that the network is used in appropriate dimensions. Using the network with greater dimensions may result in over-fitting. A second solution for improved generalization of the network is regularization, which will not be further discussed in the present article.
In early stopping, the data is broken down into three categories. The first is the training data set that is used for adjusting weights and for training the network. The second category consists of the validation set. During the training process, routine training is supervised. The error of the validation set should decrease as with the training set errors. When the network is on the verge of over-fitting, the validation error begins to grow and training is stopped. The third category involves the test set. This set is not employed during the training and comparing processes if diverse models are performed by this set.
Radial basis function neural network
Where, μ is the center of the Gaussian function and d is the distance (radius) from the center of φ(x, μ) which gives a measure of the spread of the Gaussian curve.
Based on the type of neurons chosen from among those existing in the hidden layer, one of two methods may be employed for training the RBF-NN. The first is an exact design while the second is a more efficient design. In the first method, the numbers of hidden layer neurons are considered to be equal to the number of inputs. In the second method, one neuron is added each time to previous neurons individually till the minimum error is yielded . In this research, we used the more efficient error for modeling.
Method of presenting input and output for training the network
It is better to present and apply input and output sets to the network in a random manner. If the data in the input file are categorized and sorted or applied to the network in a specified sequence, the network may forget what it is to learn. In fact, the network learns relationships between the input and output data but when new data are presented to the network, the error value may increase. Random presentation of data is one of the efficient routes to escape local minimization .
Where, Oi and Ti represent the exact or real value of the output (observation) and the predicted (test) value, respectively. N is the number of observations and Ōi is the mean of the exact value.
If RMSE and MAE are close to zero, this will indicate that the prediction result is more accurate. R2 Anywhere close to 1 indicates that a better adoption was obtained through the exact and prediction values.
Water quality and hydrologic data
Variations at hydrometric stations for drought conditions
Modeling of water quality using neural networks
For modeling and predicting electrical conductivity (EC), we used MLP-NN and RBF-NN models. Hydrological parameters were used in the network as important factors affecting electrical conductivity to predict EC appropriately. Matlab software Ver. R2011b was used to build both networks with four input vectors. Discharge at present (t), discharge at a previous period (t-1), mean temperature at present (t), and electrical conductivity at a previous period (t-1) were fed as the sets of inputs to simulate electrical conductivity in the present time (t).
The performances of the models are evaluated using determination coefficient (R2), root mean square error (RMSE), and mean absolute error (MAE).
Results and discussion
Summary results for MLP-NN for simulating EC
Optimized number of neurons
Total number of data
First hidden layer
Second hidden layer
First hidden layer
Second hidden layer
Max absolute error prediction
Min absolute error prediction
Summary results for RBF-NN for simulating EC
Optimized number of neurons and spread
Total number of data
First hidden layer
Max absolute value of error prediction
Min absolute value of error prediction
Where, Yo represents the observed values and YP designates the predicted ones. This equation originates from a simple proportion that is commonly used for comparing two cases.
Summary of PEER results for MLP-RBF comparison
The low accuracy of classical methods and approaches such as linear regression for modeling environmental conditions and water quality, as well as the nonlinear nature of water quality problems for planning proper management systems have been discussed in numerous researches. A proper management plan is a comprehensive plan which has sufficient valence and reliability both in scientific terms and in empirical or industrial applications. ANN or the black-box model is a new technique for modeling water quality problems. It can accurately model problems involving water quality and hydrological processes provided that sufficient experimental data are available. It is also capable of discovering non-linear relations between hydrological and water quality parameters.
In this study, two different ANN models, namely the MLP and the RBF, were used to simulate and predict electrical conductivity in drought or low-flow conditions. Both networks were then compared with respect to their performance. It was found that electrical conductivity is associated with major water quality parameters and further that it is intensely depends on changes in discharge to the extent that the changes can be used as a proper water quality indicator. Significant changes in EC indicate abrupt changes in discharge or introduction of pollutants into the river. Obviously, river discharge is one of the parameters affected by hydrological droughts. Water from the Zayandehrud River is released from a regulating dam; discharge is, therefore, regulated at the downstream stations. When upstream discharge is low, a water deficit or drought conditions accrue, whereby evaporation is increased and the water stored in the dam reservoir declines. It is observed that EC increases severely at the last station near Gavkhuni Wetland where enormous biological disasters have been observed to occur which indicate the enormous agricultural activities upstream the Gavkhuni Wetland.
In this study, drought borders were determined and employed in the MLP and RBF neural networks. The results showed that when MAE is used as a criterion for comparing the networks in terms of their performances, the RBF-NN was found to outperform MLP-NN. However, based on the same criterion, both MLP-NN and RBF-NN were found to be equally reliable. According to the prediction error enhancement rate used as a criterion, the MLP-NN was found to be more efficient than the RBF-NN at Lenj, Mousian, and Chum-bridge stations. Obviously, these two criteria provided better results for MLP-NN at Lenj station. Nevertheless, both networks could be used for accurately modeling the situation at each station. Other decision making methods are suggested for investigation to validate the results obtained. Also, these neural network structures can be used as the basis for predicting and simulating water quality in diverse hydrological conditions, and for improving management approaches in river basins.
The authors wish to thank Isfahan Regional Water Company for providing the required data for this research.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Department of Environment (DOE). Water quality management in Malaysia. Kuala Lumpur, Malaysia: Federal Goverment Administrative Centre; 2003.
- Najah A, El-Shafie A, Karim OA, El-Shafie Amr H. Application of artificial neural networks for water quality prediction. Neural Comput & Applic. 2013;22(1):187–201.View ArticleGoogle Scholar
- Tallaksen LM, Madsen H, Clausen B. On the definition and modelling of stream flow drought duration and deficit volume. Hydrol Sci J. 1997;42:15–33.View ArticleGoogle Scholar
- Bruce CC, Robinson DP. Some effects of the 1982–83 drought on water quality and macro-invertebrate fauna in the Loer La Trobe River, Victoria. Aust J Mar Fresh Wat Res. 1987;38:289–99.View ArticleGoogle Scholar
- Attrill MJ, Power M. Modeling the effect of drought on estuarine water quality. Water Res. 2000;34(5):1584–94.View ArticleGoogle Scholar
- Caruso BS. Temporal and spatial patterns of extreme low flows and effects on stream ecosystems in Otago, New Zealand. J Hydrology. 2002;257:115–33.View ArticleGoogle Scholar
- Clair TA, Ehrman JM. Variations in discharge and dissolved organic carbon and nitrogen export from terrestrial basins with changes in climate: a neural network approach. Limnol Oceanogr. 1996;41:921–7.View ArticleGoogle Scholar
- Hrdinka T, Novicky O, Hanslik E, Rieder M. Possible impacts of floods and droughts on water quality. J Hydro Environ Res. 2012;6:145–50.View ArticleGoogle Scholar
- Murdoch PS, Baron JS, Miller TL. Potential effects of climate change on surface-water quality in North America. J Am Water Resour Assoc. 2000;36:347–66.View ArticleGoogle Scholar
- Nouri J, Mirbagheri SA, Farrikhian F, Jaafarzadeh N, Alesheikh AA. Water quality variability and eutrophic state in wet and dry years in wetlands of the semiarid and arid regions. Environ Earth Sci. 2010;59:1397–407.View ArticleGoogle Scholar
- Schindler DW. Widespread effects of climate warming on freshwater ecosystems in North America. Hydrologic Processes. 1997;11:225–51.View ArticleGoogle Scholar
- Sprague LA. Drought effects on water quality in the South Platte River Basin, Colorado. J Am Water Resour Assoc. 2005;41(1):11–24.View ArticleGoogle Scholar
- van Vliet MTH, Zwolsman JJG. Impact of summer droughts on the water quality of the Meuse River. J Hydrol. 2008;353:1–17.View ArticleGoogle Scholar
- Zielinski P, Gorniak A, Krzysztof Piekarski M. The effect of hydrological drought on chemical quality of water and dissolved organic carbon concentrations in Lowland Rivers. Polish J Ecol. 2009;57:373–84.Google Scholar
- Nasir MFM, Abdul Zali M, Juahir H, Hussain H, Zain SM, Ramli M. Application of receptor models on water quality data in source apportionment in Kuantan River Basin. J Environ Health Sci Eng. 2012;9:18.View ArticleGoogle Scholar
- Zare AH. Evaluation of multivariate linear regression and artificial neural networks in prediction of water quality parameters. J Environ Health Sci Eng. 2012;12:40.View ArticleGoogle Scholar
- Fu Y, Zhao Y, Zhang Y, Guo T, He Z, Chen J. GIS and ANN-based spatial prediction of DOC in river networks: a case study in Dongjiang, Southern China. Environ Earth Sci. 2013;68:1495–505.View ArticleGoogle Scholar
- Ha H, Stenstrom MK. Identification of land use with water quality data in stormwater using a neural network. Water Res. 2003;37(17):4222–30.View ArticleGoogle Scholar
- Maier HR, Dandy GC. Neural networks for the prediction and forecasting of water resources variables: a review of modelling issues and applications. Environ Model Softw. 2000;15(1):101–24.View ArticleGoogle Scholar
- Ogleni N, Topal B. Water quality assessment of the Mudurnu River, Turkey. Using Biotic Indices. Water Resour Manag. 2011;25(11):2487–508.View ArticleGoogle Scholar
- Rooki R, Doulati Ardejani F, Aryafar A, Bani AA. Prediction of heavy metals in acid mine drainage using artificial neural network from the Shur River of the Sarcheshmeh porphyry copper mine, Southeast Iran. Environ Earth Sci. 2011;64:1303–16.View ArticleGoogle Scholar
- Verma AK, Singh TN. Prediction of water quality from simple field parameters. Environ Earth Sci. 2013;69:821–9.View ArticleGoogle Scholar
- Zhang Y, Pulliainen J, Koponen S, Hallikainen M. Application of an empirical neural network to surface water quality estimation in the Gulf of Finland using combined optical data and microwave data. Remote Sens Environ. 2002;81(2–3):327–36.View ArticleGoogle Scholar
- Safavi HR, Khoshoei Esfahani M, Zamani AR. Integrated index for assessment of vulnerability to drought, case study: Zayandehrud River basin, Iran. Water Resour Manag. 2014;28(6):1671–88.View ArticleGoogle Scholar
- Safavi HR, Chakraei I, Kabiri-Samani A, Golmohammadi MH. Optimal reservoir operation based on conjunctive use of surface water and groundwater using neuro-fuzzy systems. Water Resour Manag. 2013;27(12):4259–75.View ArticleGoogle Scholar
- Haykin S. Neural networks: Comprehensive foundation. Upper Saddle River: Prentice-hall; 1999.Google Scholar
- Farmaki EG, Thomaidis NS, Constantinos EE. Artificial neural networks in water analysis; Theory and applications. Int J Environ An Ch. 2010;90(2):85–105.View ArticleGoogle Scholar
- Hagan MT, Demuth HB, Beale MH. Neural network design, MA. Boston: PWS Publishing; 1996.Google Scholar
- Broomhead DS, Lowe D. Multivariate functional interpolation and adaptive networks. Complex Systems. 1988;2:321–55.Google Scholar
- Bishop CM. Neural networks for pattern recognition. Oxford University Press. 1996
- El-Shafie A, Jaafer O, Akrami SA. Adaptive neuro-fuzzy inference system based model for rainfall forecasting in Klang River, Malaysia. Int J Phys Sci. 2011;6(12):2875–88.Google Scholar