Modelling water quality parameters using model tree, random forest, and non-linear regression for Mula-Mutha River, Pune, India.


Journal

Environmental monitoring and assessment
ISSN: 1573-2959
Titre abrégé: Environ Monit Assess
Pays: Netherlands
ID NLM: 8508350

Informations de publication

Date de publication:
12 Oct 2024
Historique:
received: 18 07 2024
accepted: 30 09 2024
medline: 12 10 2024
pubmed: 12 10 2024
entrez: 12 10 2024
Statut: epublish

Résumé

Evaluation of vital water-quality indicators, especially biological and chemical demand of oxygen (BOD and COD), is important for environmental factors, human health, and agricultural output. In the recent past, data-driven techniques (DDT) offer the ability to automate water quality assessment with more reliable and rapid evaluation. The present study thus aims to utilize various DDTs: random forest (RF), model tree (MT), and non-linear-regression (NLR) to predict vital water quality indicators such as BOD and COD for the three stretches of Mula-Mutha River, Pune, India. Since the river has three stretches: Mutha, Mula, and Mula-Mutha respectively, BOD-COD models have been developed separately for each using MT, RF, and NLR. Data analysis using a violin diagram is done to understand the data characteristics. Further, the models developed were developed using the appropriate input parameters for predicting BOD and COD. Error measures including coefficient of correlation (R), mean absolute error (MAE), and root mean square error (RMSE) were used to evaluate the constructed models. The Taylor diagram, scatter plot, and hydrograph were also used for visual performance analysis. The findings suggest that the MT and RF techniques exhibit a stronger connection between the actual and anticipated levels of BOD and COD, with NLR following closely behind. Practical acceptance of these approaches is increased by RF in the form of trees, MT with an output in the form of a sequence of equations, and NLR with a single equation. These findings help us gain insight into DDT's water quality assessment model, which will also help future researchers and water quality professionals make decisions.

Identifiants

pubmed: 39395072
doi: 10.1007/s10661-024-13206-9
pii: 10.1007/s10661-024-13206-9
doi:

Substances chimiques

Water Pollutants, Chemical 0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

1047

Informations de copyright

© 2024. The Author(s), under exclusive licence to Springer Nature Switzerland AG.

Références

Abolfathi, S., Yeganeh-Bakhtiary, A., Hamze-Ziabari, S. M., & Borzooei, S. (2015). Wave run up prediction using M5 model tree algorithm. OceanEng, 112, 76–81. https://doi.org/10.1016/j.oceaneng.2015.12.016
doi: 10.1016/j.oceaneng.2015.12.016
Antanasijević, D., Pocajt, V., Povrenović, D., & Perić-Grujić, R. (2013). Modelling of dissolved oxygen in the Danube River using artificial neural networks and Monte Carlo simulation uncertainty analysis. Journal of Hydrology, 519, 1895–1907. https://doi.org/10.1016/j.jhydrol.2014.10.009
doi: 10.1016/j.jhydrol.2014.10.009
Basant, N., Gupta, S., Malik, A., & Singh, K. P. (2010). Linear and nonlinear modelling for simultaneous prediction of dissolved oxygen and biochemical oxygen demand of the surface water—A case study. Chemometrics and Intelligent Laboratory Systems, 104, 172–180. https://doi.org/10.1016/2Fj.chemolab.2010.08.005
doi: 10.1016/2Fj.chemolab.2010.08.005
Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32. https://doi.org/10.1023/A:1010933404324
doi: 10.1023/A:1010933404324
Emamgholizadeh, S., Kashi, H., Marofpoor, I., & Zalaghi, E. (2014). Prediction of water quality parameters of Karoon River (Iran) by artificial intelligence-based models. International Journal of Environmental Science and Technology, 11, 645–656. https://doi.org/10.1007/s13762-013-0378-x
doi: 10.1007/s13762-013-0378-x
Granata, F., Papirio, S., Esposito, G., Gargano, R., & De Marinis, G. D. (2017). Machine learning algorithms for the forecasting of wastewater quality indicators. Water, 9(2), 105–117. https://doi.org/10.3390/w9020105
doi: 10.3390/w9020105
Hafeez, S., Wong, M. S., Ho, H. C., Nazeer, M., Nichol, J., Abbas, S., & Pun, L. (2019). Comparison of machine learning algorithms for retrieval of water quality indicators in case-II waters: A case study of Hong Kong. Remote Sensing, 11(6), 617–630. https://doi.org/10.3390/rs11060617
doi: 10.3390/rs11060617
Hashmi, S., Halawani, S. M., Barukab, O. M., & Ahmad, A. (2015). “Model trees and sequential minimal optimization based support vector machine models for estimating minimum surface roughness value. Applied Mathematical Modelling, 39, 1119–1136. https://doi.org/10.1016/J.APM.2014.07.026
doi: 10.1016/J.APM.2014.07.026
Heddam, S., & Kisi, O. (2018). “Modelling daily dissolved oxygen concentration using least square support vector machine”, Multivariate Adaptive Regression Splines and M5 model Tree. Journal of Hydrology, 559, 499. https://doi.org/10.1016/J.JHYDROL.2018.02.061
doi: 10.1016/J.JHYDROL.2018.02.061
Hem, J. D. (1989). Study and interpretation of the chemical characteristic of natural water. In U.S. geological survey of water-supply paper (3rd ed., p. 2254).
Hore, A., Dutta, S., Datta, S., & Bhattacharjee, C. (2008). Application of an artificial neural network in wastewater quality monitoring: Prediction of water quality index. International Journal of Nuclear Desalination, 3(2), 160–174. https://doi.org/10.1504/IJND.2008.020223
doi: 10.1504/IJND.2008.020223
Ji, X., & Lu, J. (2018). Forecasting riverine total nitrogen loads using wavelet analysis and support vector regression combination model in an agricultural watershed. Environmental Science and Pollution Research, 25, 26405–26422. https://doi.org/10.1007/s11356-018-2698-3
doi: 10.1007/s11356-018-2698-3
Keshtegar, B., Heddam, S., & Hosseinabadi, H. (2019). The employment of polynomial chaos expansion approach for modeling dissolved oxygen concentration in river. Environment and Earth Science, 78, 34. https://doi.org/10.1007/s12665-018-8028-8
doi: 10.1007/s12665-018-8028-8
Legates, D. R., & McCabe, G. J. (1999). Evaluating the use of “goodness-of-fit” measures in hydrologic and hydroclimatic model validation. Water Resources Research, 35, 233–241. https://doi.org/10.1029/1998WR900018
doi: 10.1029/1998WR900018
Olyaie, E., Zare, A. H., & Danandeh, M. A. (2017). A comparative analysis among computational intelligence techniques for dissolved oxygen prediction in Delaware River. Geoscience Frontiers, 8, 517–527. https://doi.org/10.1016/2Fj.gsf.2016.04.007
doi: 10.1016/2Fj.gsf.2016.04.007
Palani, S., Liong, S., & Tkalich, P. (2008). An ANN application for water quality forecasting. Marine Pollution Bulletin, 56(09), 1586–1597. https://doi.org/10.1016/j.marpolbul.2008.05.021
doi: 10.1016/j.marpolbul.2008.05.021
Pengcheng, L., Xianguo, W., Hongyu, C., & Tiemei, Z. (2020). August, “Prediction of compressive strength of high-performance concrete by random forest algorithm.” In IOP conference series: Earth and environmental science, 552(9), 012020. https://doi.org/10.1088/1755-1315/552/1/012020
doi: 10.1088/1755-1315/552/1/012020
Quinlan, J. R. (2014). Learning with continuous classes. In Proceedings of the fifth australian joint conference on artificial intelligence, Hobart, Australia, 16–18. Singapore, p. 343–348. https://doi.org/10.4236/ojas.2014.43017 .
Radtke, D. B., Davis, J. V., & Wilde, F. D. (2005). Specific electrical conductance, techniques of water-resources. 9th ed. Supersedes USGS Techniques of Water-Resources Investigations, 1–22. https://doi.org/10.3133/twri09A6.3 .
Ranković, V., Radulović, J., Radojević, I., Ostojić, A., & Čomić, L. (2010). Neural network modeling of dissolved oxygen in the Gruža reservoir, Serbia. Ecol Modell, 221, 1239–1244. https://doi.org/10.1016/j.ecolmodel.2009.12.023
doi: 10.1016/j.ecolmodel.2009.12.023
Sattari, M. T., Joudi, A. R., & Kusiak, A. (2016). Estimation of water quality parameters with data-driven model. Journal-American Water Works Association, 108(4), 232–239. https://doi.org/10.5942/jawwa.2016.108.0012
doi: 10.5942/jawwa.2016.108.0012
Sihag, P., Sahar, M. K., & Angelaki, A. (2019). Random forest, M5P and regression analysis to estimate the field unsaturated hydraulic conductivity. Applied Water Science, 9(129), 1–9. https://doi.org/10.1007/s13201-019-1007-8
doi: 10.1007/s13201-019-1007-8
Singh, K. P., Basant, A., Malik, A., & Jain, G. (2009). Artificial neural network modeling of the river water quality a case study. Ecological Modelling, 220, 888–895. https://doi.org/10.1016/j.ecolmodel.2009.01.004
doi: 10.1016/j.ecolmodel.2009.01.004
Solomatine, D. P., & Xue, Y. (2004). M5 model trees compared to neural networks: Application to flood forecasting in the upper reach of the Huai River in China. Journal of Hydrologic Engineering, 9, 491–501. https://doi.org/10.1061/(ASCE)1084-0699(2004)9:6(491)
doi: 10.1061/(ASCE)1084-0699(2004)9:6(491)
Tao, H., Sulaiman, S.O., Yaseen, Z.M., Asadi, H., Meshram, S.G., Ghorbani, M.A., (2018), “What is the potential of integrating phase space reconstruction with SVM-FFA data-intelligence model? Application of rainfall forecasting over regional scale”, Water Resource Management, 32(12), 3935– 3959. https://link.springer.com/article/ https://doi.org/10.1007/s11269-018-2028-z
Tyralis, H., Georgia, P., & Langousis, A. (2019). A brief review of random forests for water scientists and practitioners and their recent history in water resources. Water, 11(5), 1–37.
doi: 10.3390/w11050910
Verma, A. K., & Singh, T. N. (2013). Prediction of water quality from simple field parameters. Environmental Earth Sciences, 69, 821–829. https://doi.org/10.1007/s12665-012-1967-6
doi: 10.1007/s12665-012-1967-6
Xiang, S. L., Liu, Z. M., & Ma, L. P. (2006). Study of multivariate linear regression analysis model for groundwater quality prediction. Guizhou Science, 24, 60–62.
Zain, M. F. M., & Abd, S. M. (2009). “Multiple regression model for compressive strength prediction of high performance concrete. Journal of Applied Science, 9, 155–160. https://doi.org/10.3923/jas.2009.155.160
doi: 10.3923/jas.2009.155.160
Zhang, C. (2010) Nearly unbiased variable selection under minimax concave penalty. The Annals of Statistics. 2010; 38, 894–942. https://doi.org/10.1214/09-AOS729 [ https://www.cms.waikato.ac.nz/~ml/weka/ ].

Auteurs

Pali Sahu (P)

Civil Department, Oriental College of Technology (OCT), Bhopal, India. palisahu18@gmail.com.

Shreenivas N Londhe (SN)

Civil Department, Vishwakarma Institute of Information Technology (VIIT), Pune, India.

Preeti S Kulkarni (PS)

Civil Department, Vishwakarma Institute of Information Technology (VIIT), Pune, India.

Articles similaires

Vancomycin Polyesters Anti-Bacterial Agents Models, Theoretical Drug Liberation
Animals Lung India Sheep Transcriptome
India Carbon Sequestration Environmental Monitoring Carbon Biomass
Rivers Turkey Biodiversity Environmental Monitoring Animals

Classifications MeSH