Indicating if water is safe for human consumption using an enhanced machine learning approach
Abstract
Predicting water quality accurately is critically important in real-life water resource management. This work proposes an approach based on supervised machine learning to predict water quality. Motivated, by the success of the non-smooth loss function for supervised learning problems [22], we reformulate the learning problem as a regularized optimization problem whose fidelity term is the hinge loss function and the hypothesis space is a polynomial approximation. To deal with the non-differentiability of the loss function, a special smoothing function is proposed. Then, the obtained optimization problem is solved by an improved conjugate gradient algorithm. Finally,some experiments results are presented.References
Rochmi, MN. "Access to clean water is still very far from the target". Accessed from: https://beritagar.id/article/editorial/hapuskan-perda-causing-economy-high-cost. 2016.
UNESCO. "Global Climate Change". Accessed from: www.unesco.org. August 19, 2021.
Elysia, V. “Water and Sanitation where is Indonesia's position”. National Seminar on the Role of Mathematics, Science, and Technology in achieving sustainable development goals/SDGs, FMIPA Open University. 2018. Pages 157-159.
Farooqui, A. “Invertigation of a community outbreak of typhoid fever associated with drinking water”. BMC Public Health. 2009. 9:476.
Abrischamchi, A., Ebrahimian, A., Tajrishi, M., and Marino, M. “Case Study: Application of Multicriteria Decision Making to Urban Water Supply”. J Water Resources Plan Manage. 2005. 131(4) Pages 326-335.
Cronin, A.A. “Monitoring source and domestic water quality in parallel with sanitary risk identification in Northern Mozambique to prioritise protection interventions”. J Water Health. 2006. Volume 4 Pages 333-345.
Shyamala, R. “Physicochemical Analysis of Borwell Water Samples of Telungu Palayam Area in Coimvatore District, Tamilnadu, India”. E-Journal of Chemistry. 2008. Volume 5(4), Pages 924-929.
Momba, M.N.M. “Abundance of pathogenic eschericia coli, Salmonella typhimurium and vibrio cholerae in Nkonkobe drinking water source”. J Water Health. 2006. Pages 289-296.
Eschol, J. Is fecal contamination of drinking water after collection associated with household water handling and hygiene practices? A study of urban slum households in Hyderabad, India”. Journal of Water and Health. 2009. Pages 145-154.
C. Veesommai and Y. Kiyoki. “Critical Contaminate Detection, Classification of Multiple-water-quality-parameters Values and Real-time Notification by rSPA Processes”. Surabaya ; IEEE International Electronics Sysposium, 2015.
A. Sarkar and P. Pandey, "River Water Quality Modeling Using Artificial Neural Network Technique". Aquatic Procedia, Vol.4, 2015, Pages 1070-1077.
Y. R. Ding, Y. J. Cai, P. D. Sun and B. Chen. “The Use of Combined Neural Networks and Genetic Algorithms for Prediction of River Water Quality” Journal of Applied Research and Technology. Vol.12, Issue 3, June 2014, Pages 493-499.
Noori, Roohollah, Zhiqiang Deng, Amin Kiaghadi, and Fatemeh Torabi Kachoosangi. "How Reliable Are ANN, ANFIS, and SVM Techniques for Predicting Longitudinal Dispersion Coefficient in Natural Rivers?" Journal of Hydraulic Engineering, 2015: 04015039.
Shuxiu Liang, Songlin Han, Zhaochen Sun, “Parameter optimization method for the water quality dynamic model based on data-driven theory”. China, 2011.
Yue Liao, Jianyu Xu, Wenjing Wang, “A method of Water Quality Assessment Based on Biomonitoring and Multiclass Support Vector Machine”. International Conferences on ESIAT, 2011.
Cortes C, Vapnik V. “Support-vector networks”. Machine learning. 1995;20(3):273–97.
Wu X, Kumar V, Quinlan JR, Ghosh J, Yang Q, Motoda H, et al. “Top 10 algorithms in data mining”. Knowledge and information systems. 2008;14(1):1–37.
McCulloch WS, Pitts W. “A logical calculus of the ideas immanent in nervous activity”. The bulletin of mathematical biophysics. 1943;5(4):115–33.
Webb G.I. (2011) Naïve Bayes. In: Sammut C., Webb G.I. (eds) Encyclopedia of Machine Learning. Springer, Boston, MA.
Rosasco L., De Vito E., Caponnetto A., Piana M., Verri A. Are Loss Functions All the Same? Neural Computation. 16 (5), 1063-1076 (2004).
Lyaqini S., Quafafou M., Nachaoui M., Chakib A. Supervised learning as an inverse problem based on non-smooth loss function. Knowledge and Information Systems. 62, 3039-3058 (2020).
Lyaqini S., Nachaoui M., Quafafou M. Non-smooth classification model based on new smoothing technique. Journal of Physics: Conference Series. 1743, 012025 (2021).
S Lyaqini, M Nachaoui, A Hadri "An efficient primal-dual method for solving non-smooth machine learning problem" Chaos, Solitons & Fractals 155, 111754 (2022).
Nachaoui M. Parameter learning for combined first and second order total variation for image reconstruction. Advanced Mathematical Models and Applications. 5 (1), 53-69 (2020).
El Mourabit I., El Rhabi M., Hakim A., Laghrib A., Moreau E. A new denoising model for multi-frame super-resolution image reconstruction. Signal Processing. 132, 51-65 (2017).
Tikhonov, A. N., Arsenin, V. Y. Solutions to ill-posed problems. Winston-Wiley, New York (1977).
Hadamard, Jacques. Lectures on Cauchy's problem in linear partial differential equations. Courier Corporation, 2003.
S. Smale and D. Zhou. Shannon sampling II : Connections to learning theory. preprint, 2004.
Fletcher, R., and Reeves, C. M. (1964). Function minimization by conjugate gradients. The computer journal, 7(2), 149-154.
Lyaqini, S., Nachaoui, M., Quafafou, M. (2021). Non-smooth classification model based on new smoothing technique. In Journal of Physics: Conference Series (Vol. 1743, No. 1, p. 012025). IOP Publishing.
Lyaqini, S., Nachaoui, M. (2021, February). Identification of Genuine from Fake Banknotes Using an Enhanced Machine Learning Approach. In International Conference on Numerical Analysis and Optimization Days (pp. 59-70). Springer, Cham.
https://www.kaggle.com/adityakadiwal/water-potability.
Webb, G. I., Keogh, E., Miikkulainen, R. (2010). Naïve Bayes. Encyclopedia of machine learning, 15, 713-714.
Zhang, H., Jiang, L., Yu, L. (2021). Attribute and instance weighted naive Bayes. Pattern Recognition, 111, 107674.
Otchere, D. A., Ganat, T. O. A., Gholami, R., Ridha, S. (2021). Application of supervised machine learning paradigms in the prediction of petroleum reservoir properties: Comparative analysis of ANN and SVM models. Journal of Petroleum Science and Engineering, 200, 108182.
Khorshid, S. F., Abdulazeez, A. M. (2021). breast cancer diagnosis based on k-nearest neighbors: A review. PalArch's Journal of Archaeology of Egypt/Egyptology, 18(4), 1927-1951.
Ayyad, S. M., Saleh, A. I., Labib, L. M. (2019). Gene expression cancer classification using modified K-Nearest Neighbors technique. Biosystems, 176, 41-51.
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).