Source of Publication
Water has become intricately linked to the United Nations' sixteen sustainable development goals. Access to clean drinking water is crucial for health, a fundamental human right, and a component of successful health protection policies. Clean water is a significant health and development issue on a national, regional, and local level. Investments in water supply and sanitation have been shown to produce a net economic advantage in some areas because they reduce adverse health effects and medical expenses more than they cost to implement. However, numerous pollutants are affecting the quality of drinking water. This study evaluates the efficiency of using machine learning (ML) techniques in order to predict the quality of water. Thus, in this paper, a machine learning classifier model is built to predict the quality of water using a real dataset. First, significant features are selected. In the case of the used dataset, all measured characteristics are chosen. Data are split into training and testing subsets. A set of existing ML algorithms is applied, and the results are compared in terms of precision, recall, F1 score, and ROC curve. The results show that support vector machine and k-nearest neighbor are better according to F1-score and ROC AUC values. However, The LASSO LARS and stochastic gradient descent are better based on recall values.
Computer Sciences | Environmental Sciences
water quality, machine learning, sustainability, supervised machine learning, drinking water
Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.
Kaddoura, Sanaa, "Evaluation of Machine Learning Algorithm on Drinking Water Quality for Better Sustainability" (2022). All Works. 5387.
Indexed in Scopus
Open Access Type
Gold: This publication is openly available in an open access journal/series