Development of innovative computer software to facilitate the setup and computation of water quality index
© Nabizadeh et al.; licensee BioMed Central Ltd. 2013
Received: 6 April 2013
Accepted: 22 April 2013
Published: 26 April 2013
Developing a water quality index which is used to convert the water quality dataset into a single number is the most important task of most water quality monitoring programmes. As the water quality index setup is based on different local obstacles, it is not feasible to introduce a definite water quality index to reveal the water quality level.
In this study, an innovative software application, the Iranian Water Quality Index Software (IWQIS), is presented in order to facilitate calculation of a water quality index based on dynamic weight factors, which will help users to compute the water quality index in cases where some parameters are missing from the datasets.
A dataset containing 735 water samples of drinking water quality in different parts of the country was used to show the performance of this software using different criteria parameters. The software proved to be an efficient tool to facilitate the setup of water quality indices based on flexible use of variables and water quality databases.
The Water Quality Index (WQI) is introduced as a mathematical instrument to convert the water quality dataset into a single number which represents the water quality level while eliminating subjective assessments of water quality and biases of individual water quality experts . Application of water quality indices allows the assessment of changes in water quality over time and space and also the evaluation of the efficacy of domestic policies and international strategies designed to protect aquatic resources . Water quality indices are also used for the classification of water .
Ramakrishnaiah et al.  presented a groundwater WQI which was based on 12 parameters: pH, Total Hardness(TH), Ca++, Mg++, HCO3 −, Cl−, NO3 −, SO4 −, Total Dissolved Dolids (TDS), Fe++, Mn++, and F−. According to their presented method, the values of these 12 parameters should be monitored to calculate the WQI. Relative weight factors of the mentioned twelve parameters should also be calculated, and there is no way to calculate the WQI when parameters included in the computation of the index are missing from the datasets. In many countries, water monitoring programmes are decentralized and different water monitoring sectors include their choices of parameters in routine periodic sampling and analysis. Therefore, the use of water quality indices which are based on fixed parameters overlooks large data records during the process of computing the WQI, especially when the index is not defined according to available data in the database. In many areas, especially those with extensive use of agro-chemicals, it is necessary to consider pesticides as health-risk-based parameters. Furthermore, in industrialized areas with high levels of potentially harmful anthropogenic pollutants, the role of organic solvents such as carbon tetrachloride, trichloroethylene, and perchloroethylene as potential criteria pollutants should not be overlooked; otherwise, some particular water sources may receive good scores and yet have water quality impaired by parameters not included in the index.
According to the above mentioned cases, it is not practical to set up a WQI with definite criteria pollutants which could be effectively used in all cases. Therefore, software is needed to enable water quality experts to set up their own water quality indices. Furthermore, facilities should be presented for the efficient use of parameters in water quality datasets which contain missing values. In this study, a software named as the Iranian Water Quality Index Software (IWQIS) was developed to address these issues.
Materials and methods
Water quality index background
Two indices were calculated in 1988; the degree of contamination for health-risk -based parameters (F−,NO3−, UO22−, As, B, Ba, Cd, Cr, Ni, Pb, Rn, and Se), and the degree of contamination for technical-aesthetic parameters (pH, KMnO4 consumption, SO4 −, Cl−, Ag, Al, Cu, Fe, Mn, Na, and Zn) . In another study, nine variables were considered: nitrate, phosphate, chloride, TDS, biological oxygen demand, cadmium, chromium, nickel, and lead .
Stigter et al.  created a groundwater quality index (GWQI) with a method based on multivariate analysis for monitoring the influence of agriculture using parameters of groundwater chemistry and potability and tested its applicability in the south of Portugal. They included nitrate, sulphate, chloride, and calcium in their presented index. A groundwater quality index (GWQI) was also developed to assess water quality affected by a landfill site based on seven variables . In this study, creation of the index was based on Principal Component Analysis (PCA) and benchmarking analysis. They showed that seven variables, electric conductivity, TDS, salinity, nitrate, chemical oxygen demand, and iron, could be used as indicators. Simoes et al.  proposed a Water Quality Index for management purposes in the Medio Paranapanema Watershed in Sao Paulo State, Brazil, as a pollution indicator for aquaculture activity based on three parameters: turbidity, total phosphorus, and dissolved oxygen. They showed that the water quality degradation in the studied area due to aquaculture activity could be described with this simple index.
The groundwater quality in Sunamganj, Bangladesh, was studied based on different indices for irrigation and drinking uses. Parameters such as absorption ratio, soluble sodium percentage, residual sodium carbonate, electrical conductance, magnesium adsorption ratio, Kelly’s ratio, total hardness, permeability index, and residual sodium bi-carbonate were included to investigate the ionic toxicity .
Terrado et al.  selected the WQI of the Canadian Council of Ministers of the Environment (CCME WQI) as the most suitable index. It gives a number between 0 (worst quality) and 100 (best quality). They also performed a sensitivity analysis for the CCME WQI to select the best procedure for optimizing the WQI according to input data. Sharma and Patel  collected various seasonal groundwater samples for some consecutive years and the respective physiochemical analysis was carried out for five groundwater quality parameters (pH, TDS, chlorides, hardness, and electrical conductivity) which are essentially responsible for groundwater quality degradation in the studied area. They indicated that the groundwater of the study area needs to achieve a considerable degree of quality improvement by the most feasible approach such as artificial groundwater recharging. Yidana et al.  developed a groundwater classification scheme using a robust WQI modified for the case of the Keta basin and classified groundwater in their study area into ‘good’, ‘fair’, and ‘marginal’ water types using ordinary kriging developed from a well fitted linear semivariogram function. Recently, a global, country-level Water Quality Index (WATQI) was developed as a research and policy-making tool for the measurement and management of freshwater quality based on data from the UNEP GEMS/Water programme and the European Environment Agency (EEA) .
Omo-Irabor et al.  subjected the chemical data set to PCA/FA, and Hierarchic Cluster Analysis (HCA). The aim of this study was to determine the nature and spatial distribution of chemical pollutants in surface and groundwater resources in the western Niger Delta region. Yidana et al.  used the multivariate method to analyse surface water hydrochemical data from different locations along the Ankobra Basin, Ghana. They aimed to extract principal factors related to different sources of variation in the hydrochemistry, and therefore they combined PCA and CA to classify water samples into specific groups on the basis of hydrochemical characteristics. Banoeng-Yakubo et al.  calculated a WQI for samples using concentrations of Na+, Ca++, Mg++, Cl−, NO3 −, F−, and EC at various sample locations. R-mode HCA and factor analysis (using varimax rotation and the Kaiser Criterion) were used to find the significant sources of variation in the hydrochemistry. They classified the WQI values into five categories as follows (<50: excellent water; 50–100: good water; 100–200: poor water; 200–300 very poor water; >300: water unsuitable for drinking). Saeedi et al.  used a WQI to analyse the nature and rate of land use change and its associated impact on groundwater quality. In this study, a methodology based on multivariate analysis was developed to create a GWQI that aimed to identify the places with the best quality water for drinking within the Qazvin province in western central Iran. Al-Shami et al.  studied the abundance and diversity of benthic macroinvertebrates as well as physico-chemical parameters in five rivers of the Juru River Basin in northern Peninsula Malaysia. The physico-chemical parameters and calculated WQI were significantly different among the investigated rivers (ANOVA, p < 0.05). They concluded that the multivariate analysis (CCA) was highly satisfactory, explaining 43.32% of the variance for the assemblages of macroinvertebrates as influenced by 19 physical and chemical variables.
Bu et al.  studied the sampled water quality at 12 sampling sites in the Jinshui River of the South Qinling Mountains in China. It was confirmed that 25 studied water quality variables had significant temporal differences (p < 0.01) and spatial variability (p < 0.01). Based on the similarity of water quality variables and application of cluster analysis, the 12 sampling sites were classified into three pollution level groups (no pollution, moderate pollution, and high pollution). Razmkhah et al.  applied PCA and HCA methods to determine the water quality of Jajrood River (Iran) and to assess and discriminate the relative magnitude of anthropogenic and natural influences on the quality of river water. T, EC, pH, TDS, NH4 +, NO3 −, NO2 −, Turbidity, Total Hardness, Ca++, Mg++, Na+, K+, Cl−, SO4 −, and SiO2 were selected as the physico-chemical variables and total coliform and faecal coliform as the biochemical variables to be analysed in the water samples from 18 sampling stations.
In another study, parameters such as dissolved oxygen (DO), biochemical oxygen demand (BOD), pH, temperature, TDS, turbidity, faecal coliform, heterotrophic plate count, hardness, alkalinity, arsenic, lead, mercury, nickel, cadmium, chromium, total phosphorous, H2S, nitrate, and fluoride were selected to develop the quality of drinking water supplied to dairy cattle based on fuzzy logic using trapezoidal membership functions . In our recent study, we selected twenty parameters which were included based on their critical importance for the overall water quality and their potential impact on human health to assess the performance of the proposed index under actual conditions. The comparison of the outputs of the fuzzy-based proposed index with those of the NSF WQI and Canadian Water Quality Index (CWQI) showed similar results and were sensitive to changes in the level of water quality parameters .
Water quality index setup
The structure of variables, weights, mathematical relationships, and specific features of the GWQI presented in this study are described in this section. For different water quality indices, various variables may be selected according to the importance of the parameters and availability of data. In this study, we developed software which enables users to choose different parameters according to the desired criteria pollutants. In the software, the user can select up to 40 variables which are supposed to be responsible for water contamination based on the importance of the variables, the availability of data, and experts’ professional judgements. The most frequently used variables in other studies which are used in water monitoring programmes and in our national monitoring water activities are set as default parameters.
Criteria parameters, weight factors, and limit values considered for setting up the water quality index
Water quality classification based on WQI values
Water quality index values
Excellent water quality
Good water quality
Poor water quality
Very poor water quality
Unsuitable for drinking
The main concept and incentive for developing the IWQIS was to facilitate the computation of WQI with more flexibility and to make the calculation of the WQI feasible in cases where some data related to selected criteria pollutants are missing from the database. It is very common to find missing values in some records of water quality databases. As mentioned, all the previous water quality indices were based on the use of fixed parameters and their definite weights. The practical shortcoming of these indices appears when one or more parameters are not available in a record set. In these cases the other data could not be used for calculation of the index, since the weights are fixed and cannot be changed. In the method presented in this study, the weights are dynamic and in cases where users face a lack of data in records of water samples, the new relative weights are recalculated according to the available data.
As previously mentioned, quality scores are determined dynamically for parameters which have available data in the water quality dataset. The water quality index, a dimensionless number, is determined as the sum of all quality values for those constituents chosen by the user as criteria parameters.
In this study, user-friendly software has been developed according to the concept of dynamic weights allocation to make the computation of the WQI simple. This package is called the Iranian Water Quality Index Software (IWQIS) and can be effectively used to process water quality data according to the user’s choice of parameters, weights, and limit values. The authors provide access to the mentioned software (via: http://tums.ac.ir/ajaxplorer/data/public/ec21f02fcf2f681a89a2f7500c83d1e6.php?lang=en) in order to simplify the water quality assessment monitoring activities.
The report of IWQIS is generated as an Excel workbook with three worksheets, “Original Data”, “Quality Values”, and “Water Quality Index”. The first sheet, Original Data, includes the data which were previously entered in the database. The second sheet presents the calculated quality values for each parameter and the third sheet includes WQI and the related interpretations.
Spatial variability and principal component analysis
Recently, multivariate statistical methods have been used to characterize and evaluate surface and groundwater. Chemical, biological, and physical data were monitored at 12 locations along the Passaic River, New Jersey and analysed in a study performed by Bengraine and Marhaba . PCA was used to extract the factors related to the hydrochemical variability and to demonstrate the spatial and temporal changes in water quality. Singh et al.  used cluster analysis (CA), factor analysis (FA), PCA, and discriminant analysis (DA) of the dataset on water quality of the Gomti River (India). They concluded that 10 parameters (river discharge, pH, BOD, Cl, F, PO4, NH4–N, NO3 –N, TKN, and Zn) contributed to 97% correct assignations in the spatial analysis of three different regions in the basin. Zhou et al.  showed that multivariate statistical methods are useful for interpreting complex data sets in the analysis of temporal and spatial variations in water quality and could be used for the optimization of a regional water quality monitoring network.
In this study, the spatial variability in the dataset with 735 drinking water samples in the country was illustrated using box plots. After filtering records with missing values, PCA was performed to find the meaningful components. The retained components were used to perform a linear model. Finally, the fitness of predictions of the principal component model generated and the WQI computed by IWQIS was determined. It should be noted that PCA was performed using R software .
Results and discussion
Stambuk-Giljanovic  believes that lack of consent for the selection of quality evaluation parameters is the greatest obstacle to a broader index application in the world. Rickwood and Carr  published a list of all possible parameters, their associated WHO guidelines, and whether they were measured in 20%, 35%, and 50% of countries in all regions: Europe, Asia, Africa, Americas, and Oceania. The appropriate selection of criteria variables from the list for setting the quality index is still the most important task. In this study, the selection of variables was essentially based on the availability of data on the national scale. We tried to choose those parameters which are commonly measured in water monitoring programmes. In this stage, the objective of our study was to show a general picture of drinking water quality using widely selected water samples from around the country.
Standardized loadings (pattern matrix) based upon correlation matrix
In this study, the first component, which accounts for about 44% of the variance, has high positive loadings for magnesium, chloride, TDS, fluoride, and sulfate, and could be due to the dominant share of groundwater resources in supplying drinking waters. The second principal component accounts for about 13% of the hydrochemistry and has a high positive loading for calcium. This factor could be related to higher alkalinity of groundwater due to bicarbonate ions. The third principal component accounts for about 10% of the variance in the hydro-chemical data and has high positive loadings for NO3 − and turbidity. This could be attributed to the impact of domestic waste and agricultural activities. The fourth principal component represents about 8% of the variance in the hydrochemistry of drinking water in the country and has high positive loadings for ammonium, which is an indication of agricultural practice with excessive use of fertilizers.
Outputs for linear model of 4 retained principal components
Descriptive statistics of water quality index of drinking water samples
Confidence Level (95.0%)
The previous works done by researchers had revealed that water quality indices should be set according to generic water quality parameters as well as locally important variables which may not be of importance in other locations. The results of these researches showed that the WQI for the monitoring of water quality changes with time and location. Hence, the importance of the variables, availability of the data, and experts’ professional judgements should be considered as the main cornerstones of WQI development. In this study, the Iranian Water Quality Index Software (IWQIS) has been set, tested and proved to be an efficient tool to facilitate the setting up of water quality indices based on flexible use of variables and existing water quality databases. The software prepared in this work will help researchers and water quality monitoring experts to design and calculate their own water quality indices easily. The presented software can be used by other researchers and communities based on the following considerations.
The criteria parameters, weights, and limit values should be entered into the program according to local considerations.
If the data are previously available, IWQIS would be a helpful tool to calculate the desired WQI, especially if there are some missing values in the record set.
In cases where samples with many parameters have been collected, techniques such as PCA are useful to reduce the number of variables.
IWQIS can also be used to determine the sensitivity analysis of weights attributed to the parameters when the allocation of definite weight factors to some parameters is controversial.
This research was supported by a grant from Tehran University of Medical Sciences and Health Services.
- Stambuk-Giljanovic N: Water quality evaluation by index in Dalmatia. Water Res 1999, 33: 3423–3440. 10.1016/S0043-1354(99)00063-9View ArticleGoogle Scholar
- Rickwood CJ, Carr GM: Development and sensitivity analysis of a global drinking water quality index. Environ Monit Assess 2009, 156: 73–90. 10.1007/s10661-008-0464-6View ArticleGoogle Scholar
- Chaturvedi MK, Bassin JK: Assessing the water quality index of water treatment plant and bore wells, in Delhi, India. Environ Monit Assess 2010, 163: 449–453. 10.1007/s10661-009-0848-2View ArticleGoogle Scholar
- Ramakrishnaiah CR, Sadashivaiah C, Ranganna G: Assessment of water quality index for the groundwater in Tumkur Taluk, Karnataka State, India. E J Chem 2009, 6: 523–530. 10.1155/2009/757424View ArticleGoogle Scholar
- Backman B, Bodis D, Lahermo P, Rapant S, Tarvainen T: Application of a groundwater contamination index in Finland and Slovakia. Environ Geol 1998, 36: 55–64. 10.1007/s002540050320View ArticleGoogle Scholar
- Soltan ME: Evaluation of ground water quality in Dakhla Oasis (Egyptian Western Desert). Environ Monit Assess 1999, 57: 157–168. 10.1023/A:1005948930316View ArticleGoogle Scholar
- Stigter TY, Ribeiro L, Dill A: Application of a groundwater quality index as an assessment and communication tool in agro-environmental policies - Two Portuguese case studies. J Hydrol 2006, 327: 578–591. 10.1016/j.jhydrol.2005.12.001View ArticleGoogle Scholar
- Mohamad Roslan MK, Mohd Kamil Y, Wan Nor Azmin S, Mat Yusoff A: Creation of a ground water quality index for an open municipal landfill area. Malaysian J Math Sci 2007, 1: 181–192.Google Scholar
- Simoes FD, Moreira AB, Bisinoti MC, Gimenez SMN, Yabe MJS: Water quality index as a simple indicator of aquaculture effects on aquatic bodies. Ecol Indic 2008, 8: 476–484. 10.1016/j.ecolind.2007.05.002View ArticleGoogle Scholar
- Raihan F, Alam JB: Assessment of groundwater quality in Sunamganj of Bangladesh. Iranian J Environ Health Sci Eng 2008, 5: 155–166.Google Scholar
- Terrado M, Borrell E, de Campos S, Barcelo D, Tauler R: Surface-water-quality indices for the analysis of data generated by automated sampling networks. TrAC: Trends Analytic Chem 2010, 29: 40–52. 10.1016/j.trac.2009.10.001Google Scholar
- Sharma ND, Patel JN: Evaluation of groundwater quality index of the urban segments of Surat City, India. Int J Geol 2010, 4: 1–4.Google Scholar
- Yidana SM, Banoeng-Yakubo B, Akabzaa TM: Analysis of groundwater quality using multivariate and spatial analyses in the Keta basin, Ghana. J Afr Earth Sci 2010, 58: 220–234. 10.1016/j.jafrearsci.2010.03.003View ArticleGoogle Scholar
- Srebotnjak T, Carr G, de Sherbinin A, Rickwood C: A global water quality index and hot-deck imputation of missing data. Ecol Indic 2012, 17: 108–119.View ArticleGoogle Scholar
- Omo-Irabor OO, Olobaniyi SB, Oduyemli K, Alunna J: Surface and groundwater water quality assessment using multivariate analytical methods: a case study of the Western Niger Delta, Nigeria. Phys Chem Earth 2008, 33: 666–673. 10.1016/j.pce.2008.06.019View ArticleGoogle Scholar
- Yidana SM, Ophori D, Banoeng-Yakubo B: A multivariate statistical analysis of surface water chemistry data - The Ankobra Basin, Ghana. J Environ Manag 2008, 86: 80–87. 10.1016/j.jenvman.2006.11.023View ArticleGoogle Scholar
- Banoeng-Yakubo B, Yidana SM, Emmanuel N, Akabzaa T, Asiedu D: Analysis of groundwater quality using water quality index and conventional graphical methods: the Volta region, Ghana. Environ Earth Sci 2009, 59: 867–879. 10.1007/s12665-009-0082-9View ArticleGoogle Scholar
- Saeedi M, Abessi O, Sharifi F, Meraji H: Development of groundwater quality index. Environ Monit Assess 2010, 163: 327–335. 10.1007/s10661-009-0837-5View ArticleGoogle Scholar
- Al-Shami SA, Rawi CSM, Ahmad AH, Hamid SA, Nor SAM: Influence of agricultural, industrial, and anthropogenic stresses on the distribution and diversity of macroinvertebrates in Juru River Basin, Penang, Malaysia. Ecotoxicol Environ Saf 2011, 74: 1195–1202. 10.1016/j.ecoenv.2011.02.022View ArticleGoogle Scholar
- Bu H, Tan X, Li S, Zhang Q: Temporal and spatial variations of water quality in the Jinshui River of the South Qinling Mts., China. Ecotoxicol Environ Saf 2010, 73: 907–913. 10.1016/j.ecoenv.2009.11.007View ArticleGoogle Scholar
- Razmkhah H, Abrishamchi A, Torkian A: Evaluation of spatial and temporal variation in water quality by pattern recognition techniques: a case study on Jajrood River (Tehran, Iran). J Environ Manag 2010, 91: 852–860. 10.1016/j.jenvman.2009.11.001View ArticleGoogle Scholar
- Gharibi H, Sowlat MH, Mahvi AH, Mahmoudzadeh H, Arabalibeik H, Keshavarz M, Karimzadeh N, Hassani G: Development of a dairy cattle drinking water quality index (DCWQI) based on fuzzy inference systems. Ecol Indic 2012, 20: 228–237.View ArticleGoogle Scholar
- Gharibi H, Mahvi AH, Nabizadeh R, Arabalibeik H, Yunesian M, Sowlat MH: A novel approach in water quality assessment based on fuzzy logic. J Environ Manag 2012, 112: 87–95.View ArticleGoogle Scholar
- Bengraine K, Marhaba TF: Using principal component analysis to monitor spatial and temporal changes in water quality. J Hazard Mater 2003, 100: 179–195. 10.1016/S0304-3894(03)00104-3View ArticleGoogle Scholar
- Singh KP, Malik A, Sinha S: Water quality assessment and apportionment of pollution sources of Gomti river (India) using multivariate statistical techniques - a case study. Anal Chim Acta 2005, 538: 355–374. 10.1016/j.aca.2005.02.006View ArticleGoogle Scholar
- Zhou F, Liu Y, Guo H: Application of multivariate statistical methods to water quality assessment of the watercourses in northwestern new territories, hong kong. Environ Monit Assess 2007, 132: 1–13. 10.1007/s10661-006-9497-xView ArticleGoogle Scholar
- R Core Team: R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing; 2012. ISBN 3–900051–07–0, URL http://www.R-project.org/ Google Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.