Relationship between benthic macroinvertebrate bio-indices and physicochemical parameters of water: a tool for water resources managers
© Yazdian et al.; licensee BioMed Central Ltd. 2014
Received: 25 April 2013
Accepted: 10 December 2013
Published: 10 January 2014
The ecosystem health of rivers downstream of dams is among the issues that has become focus of attention of many researchers particularly in the recent years. This paper aims to deal with the question, how the environmental health of a river ecosystem can be addressed in water resources planning and management studies. In this study, different parameters affecting the ecosystem of river-reservoir systems, as well as various biological components of river ecosystems have been studied and among them, benthic macro-invertebrates have been selected. Among various bio-indices, biodiversity indices have been selected as the evaluation tool. The case study of this research is Aboulabbas River in Khuzestan province in Iran. The relationship between the biodiversity indices and physicochemical parameters have been studied using correlation analysis, Principal Component Analysis (PCA), and Genetic Programming (GP). Margalef index was selected as the appropriate bio-index for the studied catchment area. The relationship found in this study for the first time between the Margalef bio-index and physicochemical parameters of water in the Aboulabbas River has proved to be a useful tool for water resources managers to assess the ecosystem status when only physicochemical properties of water are known.
KeywordsBio-diversity index Physicochemical parameter Genetic programming Margalef index
Bio-indices have been recognized as suitable criteria for understanding the quality of aquatic environment. They are numerical expressions that combine quantitative values of species diversity with qualitative information on the ecological sensitivity of each taxon . Ecologists use various metrics and indices for ecological assessment of river ecosystem environments. They can be used to predict the response of an ecosystem to different water resources management practices and environmental conditions. Considering the importance of rivers, ecosystem environment and the role of bio-indices in basin scale water resources planning and management, most rivers in the developed countries are constantly evaluated and their physical, chemical and biological characteristics are monitored .
Various bio-indices have been proposed and used by ecologists in different countries. The most commonly used indices in biological evaluation of rivers include species richness, evenness, diversity and dominance indices, BMWP (Biological Monitoring Working Party), ASPT (Average Score per Taxon) and EPT (The total number of Ephemeroptera, Plecoptera and Trichopteraindex).
However the literature on the bio-indices and the criteria for understanding the quality of aquatic environment is rich, but there is a gap between these studies and those related to water resources planning and management. Most of the previous studies in the field of water resources planning and management have focused on socio-economic aspects of water allocation to different users while some also have considered physicochemical water quality constraints . Bio-indices have not been used in these studies mostly because of the lack of knowledge of water resources modelers about these indices and also limited interval of limnological measurements. Previous studies some of which are also cited later in the section, show that the limnological information are only available in very short periods of time (mostly one or two years) in very limited rivers specially in the under developed countries while water resources planning and management studies require long records of data (usually longer than 30 years). To close this gap, one approach which is the focus of this study is to find a mathematical relationship between an ecological index which can reflect the overall environmental condition of a river in the study area and the physicochemical properties of water. Since there are widespread databases about physicochemical characteristics of water bodies in many basins around the world, finding this relationship can help in determining the quality of aquatic environments wherever no record on the quantity or diversity of species is available.
Several studies including the followings have shown consistency between variations of biotic indices and fluctuations in physicochemical characteristics of water: Czerniawska and Kusza , studied correlation between bio-indices and diversity indices at the family level of benthic macro-invertebrates with physicochemical variables of Nysa Klodzka River in southern Poland, using Spearman’s correlation coefficient.
Yap et al.  studied variations of a benthic species called Oligochateas and physicochemical parameters of water in a river in Malaysia from March 1998 to February 1999, and showed that there has been a negative correlation between density and distribution of this benthic macro-invertebrate and DO and PH, and a positive correlation with electrical conductivity, BOD, NO3, NH3, TSS, COD, Cc and Zn.
Azrina et al.  studied the correlation between richness and diversity index of benthic macro-invertebrates communities with physicochemical parameters of water of Langat River, Malaysia for four consecutive months (March–June 1999), and showed that they are mainly affected by TSS and EC of the river water. They showed that the richness index has a strong negative correlation with TSS, width of the river and temperature while Simpson diversity index is strongly correlated with TSS and electrical conductivity of water.
Latha and Thanga  in a study in India examined variations in Shannon diversity and evenness indices in a period of two years for six stations on the Veli and Kadinamkulam Rivers and showed that species diversity and distribution is clearly related to water quality and the more contaminated water is, the less the diversity index will be. Their study also showed that Shannon index has had fluctuations similar to abundance index.
Kennen et al.  studied benthic macro-invertebrates in 67 small and medium sized catchment areas in America and demonstrated the relationship between EPT species richness index and hydrological characteristics of flow.
In Iran, Nemati et al.  calculated various biotic indices estimated based on samples collected from benthic macro-invertebrates of Zayandeh-rud River. They studied correlation between these indices and physicochemical parameters of water and concluded that BMWP (Biological Monitoring Working Party) index has a significant correlation with physicochemical parameters of water.
Monk et al.  reviewed the 22-year long-term statistics of samples collected from 14 rivers in England. They computed BMWP, EPT and Life Score biotic indices and studied their variations with respect to changes in Indicators of Hydrologic Alteration (IHA) and observed the strongest relation between biotic indices and hydrological parameters in frequency and intensity of current flow groups.
Ogleni and Topal  studied the impacts of pollutants on water quality in 15 stations over Mudurnu River, Turkey in a 12-month period (2006 to 2007) and biotic indices obtained based on different organisms in water. They showed that from 100 biotic indices, 60% of them have used benthic macro-invertebrates and it seems that modified ASPT and BMWP indices have the strongest correlation with water quality parameters.
The above studies show that different types of bio-indices have statistically significant relationships with hydrological indicators of flow and physicochemical characteristics of water. All of the aforementioned studies have used descriptive statistics to assess this relationship. However, These types of assessments could be useful for many environmental planning and management purposes, but they cannot be used for inclusion in the operation management models of river-reservoir systems. The questions this study is trying to answer are: 1) When modeling river-reservoir systems, which bio-index should be chosen? And 2) How the relationship between the chosen bio-index and physicochemical characteristics of water can be quantified?
The case study of this research is Aboulabbas River in Khuzestan Province in Iran. Genetic Programming (GP) has been used in this study to obtain a quantitative relationship between biodiversity index and physicochemical characteristics of water.
Materials and methods
Using benthic macro-invertebrates for calculating bio-index
Different biotic indices have been defined and used in different regions of the world for bio-monitoring programs, which some of them have a reasonable accuracy to be used in other regions too. Biological assessments can be used for identifying weaknesses in ecosystem environments caused by pollutants or degradation of habitats. They are also, in some cases, even more effective than physical and chemical measurement processes, because they are economical and need less time to be evaluated.
Among the various components of the aquatic ecosystems including plants, birds, fish and Macrobenthic organisms (Macrobenthos), the last one pave the way for one of the best and most efficient ways for biological assessments . Macrobenthos plays the role of a link in food chains which provide the energy stored by plants in larger animals such as fish. Aquatic invertebrates in the river food chains are the primary consumers of herbal products such as algae, diatoms, mosses and decaying leaves and enter the production cycle of the fish, and when mature, they fly or they are directly consumed by secondary consumers.
Macrobenthos are invertebrates which can be seen with the naked eye. They spend at least part of their lives in the river beds. Being the basic components of the aquatic chains of rivers and ubiquitous in all aquatic ecosystems, limited mobility, long lifespan and species richness with varying sensitivity to pollution are the highlighted reasons for widely reported studies on benthic macroinvertebrares as biological monitoring techniques [5, 12–14].
Exploiting benthic macro-invertebrates is based on the assumption that the streams and rivers which are not affected by pollutant factors have more arrays of benthic species and non-resistant species are dominant there, while in polluted waters, arrays which are less tolerant to pollutants can be found less .
Parameters affecting the ecosystem of rivers
The first step for choosing an appropriate bio-index and obtaining its possible mathematical relationship with physicochemical characteristics of river water is identifying the parameters with considerable effects on the ecosystem of the river being studied. Studying the mathematical relationship between variations of biotic indices with these physicochemical characteristics is the second step. In this step, the proper biotic index which shows strongest statistical relation with the physicochemical parameters can be selected. Some of the most influencing physicochemical characteristics of the river water bodies on the ecosystems can be listed as follows:
River discharge is the most important hydrologic characteristic of rivers. It has direct and indirect impacts on the ecosystem health. While river discharge directly satisfies the needs of species in rivers, indirectly change the physical and chemical quality of water.
Water velocity is among the major characteristics affecting river ecosystems. It has significant effects on morphology of river beds and movement of sediments which both have impacts on various species Floods and all types of hydrologic alterations can significantly change the ecosystem health one way or another.
In addition to the hydrological conditions of the river, water quality parameters also play a major role in ecosystem health. Any change in water quality can lead to variations in compositions of plants and animal species. The most important water quality parameters in terms of impact on aquatic ecosystems include temperature, salinity, acidity, Total Dissolved Solids (TDS), pH, DO and BOD5. Many physical processes and chemical and biological transformations are sensitive to temperature variations. Salinity increase in freshwater ecosystems generally decreases biodiversity and may reduce the available food resources. Generally lower acidity leads to reduced biodiversity and species composition of various invertebrate communities. Increased turbidity reduces light penetration depth and thus limits the growth of aquatic species. Since oxygen is needed for aerobic respiration of aquatic species, low DO concentration is harmful to plants and aquatic organisms [15–17].
Various bio-indices have been proposed and used by ecologists in different countries, such as species richness index, evenness index, species diversity index, dominance index, and BMWP, EPT and ASPT indices.
Evenness index demonstrate the distribution of the communities of species. The more even species distribution is, (i.e. the number of individual organisms or abundance of species are more similar), the higher stability is present which results in greater biodiversity. Species richness indicates the presence of various species and is calculated by the number of species in an area. An increasing number of taxons can be due to habitat diversity, suitability of water or its improved quality. Dominance index reflects the abundance of some species over others which is used as an index in biodiversity assessments. Species diversity index is in fact a combination of species richness and evenness indices, and aggregate both species richness and evenness into a single quantity. Higher biodiversity indices indicate less stress in ecosystems, higher abundance and more even distribution of species in the ecosystem. Various studies have also shown this point, some of which are also cited in this paper.
With respect to the various biotic indices, it seems that using diversity indices for river ecosystem health assessment will be more appropriate [18–20], stated that diversity index increases by increased number of species or increasing the total number of organisms in populations; when the population of various species is distributed evenly, the diversity index increases as well.
Shannon, Simpson, and Margalef diversity indices have been used by several researchers to assess bio diversity. These indices have been also used in this study and therefor are introduced with more details in the following sections.
Shannon diversity index
P i : Relative abundance of ith taxon in the sample.
s: total number of taxons in the sample.
It has been emphasized in the literature that Shannon diversity index is a fast and reliable tool to identify major changes in community structure of benthic species . It has also been shown that seasonal patterns of Shannon diversity index and species richness and evenness are similar to seasonal changes in species abundance and composition .
Simpson diversity index
In this index, lower/higher weights are assigned to the rare/usual species. The index values are in the range of zero (lowest diversity) to (highest diversity).
Margalef diversity index
Where N is the total number of individuals.
In order to find the relationship between bio-indices and the physicochemical characteristics of river, GP has been used in this study. This technique is briefly described in the following section
Genetic algorithm for programming (Genetic programming)
To obtain a formula indicating the relationship between biotic index and qualitative and quantitative characteristics of water, GP has been used. GP was proposed for the first time by Koza in 1992 . The first step in GP is generating initial population randomly consisting of two elements, i.e. functions and terminals. Functions can, according to the type of problem, be the basic operations like addition, subtraction, multiplication and division or logical functions such as AND, OR and NOT or any other function. Terminals also include variables and constants, if desired.
In GP, functions and terminals are randomly selected, and a member of population is presented as a tree with functions as its roots and branches that ultimately end to the terminals. After generating a random initial population which is known as the parent for the first generation, each member will be evaluated and this evaluation can be carried out in different ways based on the type of problem. From initial population, a new population is formed using various selection methods such as roulette wheel, tournament, etc. GP operators including “reproduction,” “cross over” and “mutation”, affect this new population .
GP has proved to be a useful tool especially when the relationship between variables is unknown or the size and form of relationship is complex and difficult to formulate, as well as when no approach can be presented by analytical and mathematical methods for establishing relationship between variables [25, 26].
In application of GP for determining the relationship between bio-indices and physicochemical characteristics of water, firstly, all parameters have been standardized to be in the range of [0, 1] to avoid any magnitude difference between the parameters. The basic mathematical operators of addition, subtraction, multiplication, and division have been considered as functions. GP offers a different relation for calculating bio-index in each run. Due to the fact that GP, like other evolutionary methods, is based on producing initial random answers, the estimated equation in each run can be different. Various relationships between the dependent variable (biotic index) and the independent variables (qualitative and quantitative parameters) are calculated using the results of 100 runs of GP. The best relationship is then selected based on the highest correlation coefficient. It worth mentioning that since GP algorithm uses random operators, it is suggested in the literature that the final results should be chosen from several runs.
To formulate a relationship between a bio-diversity index and physicochemical parameters of water, by removing discharge variable from independent variables, again GP is used. 80% of the available dataset has been used for training and 20% for validation. The GP parameter used in this study are as follows:
mutation rate = 0.1;
population size = 300;
maximum number of generations = 500;
Functional set = addition, subtraction, multiplication and division.
In Iran, very few studies on aquatic ecosystems can be found and there is very little information available. In the recent years, some efforts have been spent to further recognize and assess aquatic environments in some catchment areas.
It worth mentioning that no sampling has been carried out after 2007. Since no major development or land use change has happened in the basin, it is assumed that the results of this study are still valid for water resources planning purposes.
Results and discussions
Correlation between bio-indices and physicochemical parameters of the Aboulabbas river data
For conducting a more accurate analysis, the available data for all of the stations, including biotic indices and qualitative and quantitative parameters have been clustered using K-means clustering technique. K-means clustering is a simple clustering method with low computational complexity. It is very simple and can be easily implemented in solving many practical problems. K-means algorithm is under the category of Squared Error-Based Clustering . For all of three selected bio-indices, it has been observed that the data for winter season has been clustered into one cluster and the data for the rest of the year in another cluster. Bearing this point in mind for further analysis, and in order to establish a relationship between bio-indices and physicochemical parameters, the data which is clustered into one cluster and has the information related to spring, summer and autumn is used in GP. Since TDS and EC parameters are highly correlated, only EC has been used as independent variable.
Hundred GP runs provided equations for estimating each of the biotic indices with various degrees of accuracy. The obtained results presented in the Table 2 show the number of presence of each of the physicochemical parameters in the obtained equations for calculating each of the biotic indices. The results show that a small percentage of the obtained equations for calculating all of the bio-indices are affected by the river discharge. Moreover, DO and BOD5 parameters have the most frequent repetition in the obtained equations.
Percent of presence of each physicochemical parameter in the equations obtained from GP for calculating the bio-indices
Rotated component matrix
Comparison of statistics (mean, standard deviation and correlation coefficient) between observed and calculated values of the Margalef index
Estimation error (%)
Estimation error (%)
Mean square error
Correlation analysis has been carried out between the estimated values of the three bio-indices based on the observations and the estimated values using the equations obtained from GP. As it was mentioned earlier, 100 values have been estimated for each index. The results of the correlation analysis shows that the estimated values by GP method for Margalef diversity index has higher correlation with the values estimated based on the observations. Therefore, Margalef biotic index is chosen in this study.
MI: Margalef diversity index,
DO: dissolved oxygen (mg L-1),
T: water temperature (°C),
EC: Electrical Conductivity of the water (μmohs cm-1), and
BOD5: Biological oxygen demand (mg L-1).
To assess the accuracy of this relationship, summary statistics of the observed and estimated values of the index are presented in Table 4.
Reviewing the results reveals the relatively significant accuracy of the obtained relationship in both training and validation datasets. Table 4 indicates that the error of the equation in estimating the average value of Margalef index is about 3.8% and 5.04% for the training and validation datasets, respectively. There has been 6.6% and 11.60% difference between standard deviation of the calculated and observed values of the index in training and validation datasets, respectively. The correlation coefficients between the observed and estimated values of the Margalef index estimated for training and testing datasets show relatively acceptable accuracy of the proposed relationship. Mean square error for training and validation datasets have been relatively close which shows no over fitting has occurred.
The aim of this study has been to provide a tool for assessing biodiversity of river ecosystems to be used by water resources planners and reservoir operators. The major obstacle in this study has been the lack of long-term data. The accuracy of the proposed equation can be significantly improved in case of availability of long-term observations. Despite this fact, the novelty of this work lies in the methodology used in choosing biotic indices and the physiochemical parameters for estimating them. The equation proposed in this study for estimating Margalef index is based on the environmental condition of the study region and we are not to claim that it would work in other regions as well as Aboulabbas River, because diversity and even abundance of benthic macroinvertebrates depend on various physico-chemical properties of water and specific environmental condition of each ecosystem. Also, a larger dataset could lead to more accurate mathematical relationships between ecological target indices and various water quality parameters.
Further research can be dedicated to finding similar equations for other rivers in the region specially the headwaters of Aboulabbas River to assess whether the same conclusion about the choice of bio index and physicochemical parameters is valid for them. For the larger datasets, it can also be suggested to investigate the possibility of increasing the accuracy of the relationship by making it sensitive to the overall pollution level of the river water.
The authors are grateful of Mahab Ghodss Consulting Company for providing the database used in this study.
- Czeniawska-Kusza I: Comparing modified biological monitoring working party score system and several biological indices based on macroinvertebrates for water quality assessment. Limnol 2005, 35: 169–176. 10.1016/j.limno.2005.05.003View ArticleGoogle Scholar
- Wetzel RG: Lake and river ecosystems. Third Edit: Limnology; 2002.Google Scholar
- Jager HI, Brennant S: Sustainable reservoir operation: can we generate hydropower and preserve ecosystem values? River Res Appl 2008, 24: 340–352. 10.1002/rra.1069View ArticleGoogle Scholar
- YAP CK, RAHIM ISMAIL A, AZRINA MZ, ISMAIL A, TAN SG: The influential of physico-chemical parameters on the distributions of oligochateas (limnodrilus sp .) at the polluted downstream of the tropical Langat river, peninsular Malaysia. J Appl Sci Environ Mgt 2006, 10: 135–140.Google Scholar
- Azrina MZ, Yap CK, Ismail AR, Ismail A, Tan SG: Anthropogenic impacts on the distribution and biodiversity of benthic macroinvertebrates and water quality of the Langat River, Peninsular Malaysia. Ecotoxicol Environ l Saf 2006, 64: 337–347. 10.1016/j.ecoenv.2005.04.003View ArticleGoogle Scholar
- Latha C, Thanga VSG: Macroinvertebrate diversity of veli and kadinamkulam lakes, South Kerala, Indiaa. J Environ Biol 2010, 547: 543–547.Google Scholar
- Kennen JG, Riva-murray K, Beaulieu KM: Determining hydrologic factors that influence stream macroinvertebrate assemblages in the northeastern US. Ecohydrol 2010, 106: 88–106.Google Scholar
- Nemati Varnosfaderany M, Ebrahimi E, Mirghaffary N, Safyanian A: Biological assessment of the zayandeh Rud river, Iran, using benthic macroinvertebrates. Limnol 2010, 40: 226–232. 10.1016/j.limno.2009.10.002View ArticleGoogle Scholar
- Monk WA, Wood PJ, Hannah M, Wilson DA: Short communication selection of river flow indices for the assessment of hydroecological change. River Res Appl 2007, 122: 113–122.View ArticleGoogle Scholar
- Ogleni N, Topal B: Water quality assessment of the Mudurnu river, Turkey, using biotic indices. Water Resour Manage 2011, 25(11):2487–2508.View ArticleGoogle Scholar
- Karr JR: Rivers as sentile: using the biology of rivers to guide landscape management. Springer New york: Final report for USEPA; 1998:28.Google Scholar
- Taylor BR, Bailey RC: Technical evaluation on methods for benthic invertebrate data analysis and interpretation, final report. AETE project 2.1.3. Ottawa: National Resources Canada, CANMET; 1997:26.Google Scholar
- Rosenberg DM, Davies IJ, Cobb DG, Wiens AP: Protocols for measuring biodiversity: Benthic macroinvertebrates in fresh waters. Canada: Report for the Ecological Monitoring and Assessment Network (EMAN) Biodiversity Science Board (BSB); 1998:43.Google Scholar
- Lliopoulou-Georgudaki J, Kantzatis V, Katharios P, Kaspiris T, Montesantou B: An application of different bioindicators for assessing water quality: a case study in the rivers alfeios and pineios. Ecol Indic 2003, 2: 345–360. 10.1016/S1470-160X(03)00004-9View ArticleGoogle Scholar
- Boulton AJ, Brock MA: Australian freshwater ecology. Processes and Management: Gleneagles Publishing; 1999.Google Scholar
- EPA: Draft state environment protection policy (waters of Victoria). Melbourne: Environment Protection Authority Victoria; 2001.Google Scholar
- ANZECC & ARMCANZ: Australian and New Zealand guidelines for fresh and marine water quality. National Water Quality ManagementStrategy Paper No 4. Canberra: Australian and New Zealand Environment and Conservation Council & Agriculture and Resource Management Council of Australia and New Zealand; 2000.Google Scholar
- Wilsey B, Stirling G: Species richness and evenness respond in a different manner to propabule density in developing prairie microcosm communities. Plant Ecol 2007, 190: 259–273. 10.1007/s11258-006-9206-4View ArticleGoogle Scholar
- Gallardo B, Gascón S, Quintana X, Comín FA: How to choose a biodiversity indicator? – redundancy and complementarity of biodiversity metrics in a freshwater ecosystem. Ecol Indic 2011, 11(5):1177–1184. 10.1016/j.ecolind.2010.12.019View ArticleGoogle Scholar
- Pielou EC: Ecological diversity. New Jersey, USA: John Wiley & Sons Inc; 1975.Google Scholar
- Pettersson M: Monitoring a freshwater fish population: statistical surveillance of biodiversity. Environ 1998, 9(2):139–150.Google Scholar
- Kuo SR, Lin HJ, Shao KT: Seasonal changes in abundance and composition of the fish assemblage in Chiku Lagoon, southwestern Taiwan. Bull Mar Sci 2001, 68(1):85–99.Google Scholar
- Koza J: Genetic programming: on the programming of computers by natural selection. Cambridge, Massachusetts, USA: The MIT Press; 1992.Google Scholar
- Yang YCE, Cai X, Herricks EE: Identification of hydrologic indicators related to fish diversity and abundance: a data mining approach for fish community analysis. Water Resour Res 2008., 44: W04412. doi:10.1029/2006WR005764Google Scholar
- Sivapragasam C, Maheswaran R, Venkatesh V: Genetic programming approach for flood routing in natural channels. Hydrol Processes 2008, 628: 623–628.View ArticleGoogle Scholar
- Rezapour OM, Shui LT, Dehghani AA: Review of genetic programing in water resource engineering 1. Sci 2010, 4(11):5663–5667.Google Scholar
- Nasseri M, Zahraie B: Application of simple clustering on space-time mapping of mean monthly rainfall pattern. Int J Climatol 2011, 31(5):732–741. 10.1002/joc.2109View ArticleGoogle Scholar
- Ouyang Y: Evaluation of river water quality monitoring stations by principal component analysis. Water Res 2005, 39(12):2621–2635. 10.1016/j.watres.2005.04.024View ArticleGoogle Scholar
- Shrestha S, Kazama F: Assessment of surface water quality using multivariate statistical techniques: a case study of the Fuji river basin. Japan Environ Modell Softw 2007, 22: 464–475. 10.1016/j.envsoft.2006.02.001View ArticleGoogle Scholar
- Noori R, Karbassi AR, Sabahi MS: Evaluation of PCA and Gamma test techniques on ANN operation for weekly solid waste predicting. J Environ Manag 2010, 91(3):767–771. 10.1016/j.jenvman.2009.10.007View ArticleGoogle Scholar
- Noori R, Karbassi A, Khakpour A, Mohammadi H, Badam K, Vesali- M: Chemometric analysis of surface water quality data: case study of the gorganrud river basin. Iran Environ Model Assess 2012, 17(4):411–420. 10.1007/s10666-011-9302-2View ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.