Accurate and interpretable water quality prediction is crucial for environmental monitoring and public health. This study evaluates six machine learning models—Random Forest, Long Short-Term Memory (LSTM), K-Nearest Neighbors (KNN), Linear Regression, Ridge Regression, and Support Vector Regression (SVR)—using real-world groundwater data from ARPAE. Model performance was assessed via Mean Absolute Error (MAE) and Mean Squared Error (MSE), while SHAP values were employed for feature-level interpretability. Results indicate that Random Forest outperforms all models in both accuracy and explainability, whereas SVR demonstrates poor predictive capability and lacks meaningful interpretability. The study highlights the trade-offs between predictive power and transparency, offering insights for selecting appropriate models in water quality monitoring systems.

A Comparative Study of Machine Learning Algorithms for Water Quality Prediction Using SHAP-based Explainability / Cabri, G.; Rahimi, A.. - (2025), pp. 1-6. ( 2025 33rd International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE) Catania, Italy July 23rd-25th, 2025) [10.1109/WETICE67341.2025.11091841].

A Comparative Study of Machine Learning Algorithms for Water Quality Prediction Using SHAP-based Explainability

Cabri G.;Rahimi A.
2025

Abstract

Accurate and interpretable water quality prediction is crucial for environmental monitoring and public health. This study evaluates six machine learning models—Random Forest, Long Short-Term Memory (LSTM), K-Nearest Neighbors (KNN), Linear Regression, Ridge Regression, and Support Vector Regression (SVR)—using real-world groundwater data from ARPAE. Model performance was assessed via Mean Absolute Error (MAE) and Mean Squared Error (MSE), while SHAP values were employed for feature-level interpretability. Results indicate that Random Forest outperforms all models in both accuracy and explainability, whereas SVR demonstrates poor predictive capability and lacks meaningful interpretability. The study highlights the trade-offs between predictive power and transparency, offering insights for selecting appropriate models in water quality monitoring systems.
2025
2025 33rd International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE)
Catania, Italy
July 23rd-25th, 2025
1
6
Cabri, G.; Rahimi, A.
A Comparative Study of Machine Learning Algorithms for Water Quality Prediction Using SHAP-based Explainability / Cabri, G.; Rahimi, A.. - (2025), pp. 1-6. ( 2025 33rd International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE) Catania, Italy July 23rd-25th, 2025) [10.1109/WETICE67341.2025.11091841].
File in questo prodotto:
File Dimensione Formato  
WETICE25water_dalsito.pdf

Accesso riservato

Tipologia: VOR - Versione pubblicata dall'editore
Dimensione 1.36 MB
Formato Adobe PDF
1.36 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

Licenza Creative Commons
I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11380/1403188
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact