Identifikasi Kelayakan Air Minum Dengan Metode Analisis Komponen Utama Berbasis Entropi

  • Thommy willay Universitas Widya Dharma Pontianak
  • Jimmy Tjen Universitas Widya Dharma Pontianak
  • Paskalia Kartini Universitas Widya Dharma Pontianak
  • Riyadi Jimmy Iskandar Universitas Widya Dharma Pontianak
DOI: https://doi.org/10.52158/jacost.v5i2.815
I will put the dimension here
Keywords: Analisis Komponen Utama, Air bersih, Entropi, Potabilitas air

Abstract

The need for clean water is a fundamental requirement that must be met by humans, as water constitutes 60 to 70% of the total human body weight. Therefore, it is important to be able to determine the quality of the water entering the body, as consuming unsafe water will bring various diseases, such as diarrhea, and in severe cases might lead to death. This study aimed to investigate the factors which determine the potability of drinking water. Specifically, this research aims to produce a fault detection algorithm that can detect the potability of water samples based on Principal Component Analysis (PCA) and entropy-based subset selection methods. This paper addresses the linearity problem that commonly occurred in PCA by finding a subset of data that has a good entropy relation among the parameters contained in the subset, thus maintaining linearity in the data. There were 8 parameters considered in this reseach: pH, hardness, total dissolved solids, chloramines, sulfate, conductivity, organics carbon, trihalomethanes and turbidity. The experiment was conducted with 811 water samples, where 645 samples were used to train the model and the rest for validating the model predictive accuracy. Based on experiments conducted, it is confirmed that the proposed algorithm can determine the potability of drinking water samples from synthetic data sourced from India with an accuracy of over 98% for potable water data and 100% for non-potable water data.

Downloads

Download data is not yet available.

References

Daftar Pustaka

F. A. Padder and A. Bashir, “Scarcity of water in the twenty-first century: Problems and potential remedies,” MEDALION JOURNAL: Medical Research, Nursing, Health and Midwife Participation, vol. 4, no. 1, pp. 1–5, 2023.

K. de Mello et al., “Multiscale land use impacts on water quality: Assessment, planning, and future perspectives in Brazil,” J Environ Manage, vol. 270, p. 110879, Sep. 2020, doi: 10.1016/j.jenvman.2020.110879.

M. Salehi, “Global water shortage and potable water safety; Today’s concern and tomorrow’s crisis,” Environ Int, vol. 158, p. 106936, Jan. 2022, doi: 10.1016/j.envint.2021.106936.

A. C. Johnson et al., “Identification and Quantification of Microplastics in Potable Water and Their Sources within Water Treatment Works in England and Wales,” Environmental Science & Technology, vol. 54, no. 19, pp. 12326–12334, Aug. 2020, doi: 10.1021/acs.est.0c03211.

Biro Pusat Statistik (BPS), “persentase rumah tangga menurut provinsi tipe daerah dan sumber air minum layak.” Accessed: Jan. 06, 2023. [Online]. Available: https://www.bps.go.id/indicator/29/854/1/persentase-rumah-tangga-menurut-provinsi-tipe-daerah-dan-sumber-air-minum-layak.html.

H. M. Gomes, J. Read, A. Bifet, J. P. Barddal, and J. Gama, “Machine learning for streaming data: state of the art, challenges, and opportunities,” ACM SIGKDD Explorations Newsletter, vol. 21, no. 2, pp. 6–22, Nov. 2019, doi: 10.1145/3373464.3373470.

A. Nandy, C. Duan, and H. J. Kulik, “Audacity of huge: overcoming challenges of data scarcity and data quality for machine learning in computational materials discovery,” Curr Opin Chem Eng, vol. 36, p. 100778, Jun. 2022, doi: 10.1016/j.coche.2021.100778.

S. Karamizadeh, S. M. Abdullah, A. A. Manaf, M. Zamani, and A. Hooman, “An Overview of Principal Component Analysis,” Journal of Signal and Information Processing, vol. 04, no. 03, pp. 173–175, 2013, doi: 10.4236/jsip.2013.43b031.

B. M. Salih Hasan and A. M. Abdulazeez, “A Review of Principal Component Analysis Algorithm for Dimensionality Reduction,” Journal of Soft Computing and Data Mining, vol. 02, no. 01, Apr. 2021, doi: 10.30880/jscdm.2021.02.01.003.

V. Pratama and J. Tjen, “Entropy-based subset selection principal component analysis for diabetes risk factor identification,” J Emerg Investig, 2023, doi: 10.59720/23-015.

M. Tripathi and S. K. Singal, “Use of Principal Component Analysis for parameter selection for development of a novel Water Quality Index: A case study of river Ganga India,” Ecol Indic, vol. 96, pp. 430–436, Jan. 2019, doi: 10.1016/j.ecolind.2018.09.025.

W. Yang, Y. Zhao, D. Wang, H. Wu, A. Lin, and L. He, “Using Principal Components Analysis and IDW Interpolation to Determine Spatial and Temporal Changes of Surface Water Quality of Xin’anjiang River in Huangshan, China,” Int J Environ Res Public Health, vol. 17, no. 8, p. 2942, Apr. 2020, doi: 10.3390/ijerph17082942.

S. Abdelaziz, M. I. Gad, and A. H. M. H. El Tahan, “Groundwater quality index based on PCA: Wadi El-Natrun, Egypt,” Journal of African Earth Sciences, vol. 172, p. 103964, Dec. 2020, doi: 10.1016/j.jafrearsci.2020.103964.

G. Thanh Nguyen, “Evaluating Current Water Quality Monitoring System on Hau River, Mekong Delta, Vietnam Using Multivariate Statistical Techniques,” Applied Environmental Research, pp. 14–25, Jan. 2020, doi: 10.35762/aer.2020.42.1.2.

J. Tjen, F. Smarra, and A. D’Innocenzo, “An entropy-based sensor selection algorithm for structural damage detection,” in 2020 IEEE 16th International Conference on Automation Science and Engineering (CASE), IEEE, Aug. 2020, pp. 1566–1571. doi: 10.1109/case48305.2020.9216828.

I. M. Johnstone and D. Paul, “PCA in High Dimensions: An Orientation,” Proceedings of the IEEE, vol. 106, no. 8, pp. 1277–1292, Aug. 2018, doi: 10.1109/jproc.2018.2846730.

J. B. Schreiber, “Issues and recommendations for exploratory factor analysis and principal component analysis,” Research in Social and Administrative Pharmacy, vol. 17, no. 5, pp. 1004–1011, May 2021, doi: 10.1016/j.sapharm.2020.07.027.

J. Tjen and V. Pratama, “Penentuan Jalur Diagnostik Penyakit Berbasis Konsep Pembelajaran Mesin: Studi kasus Penyakit Hepatitis C,” Journal of Applied Computer Science and Technology, vol. 4, no. 2, pp. 124–130, Nov. 2023, doi: 10.52158/jacost.v4i2.556.

F. Smarra, J. Tjen, and A. D’Innocenzo, “Learning methods for structural damage detection via entropy‐based sensors selection,” International Journal of Robust and Nonlinear Control, vol. 32, no. 10, pp. 6035–6067, Mar. 2022, doi: 10.1002/rnc.6124.

L. Wang, “Enhanced fault detection for nonlinear processes using modified kernel partial least squares and the statistical local approach,” Can J Chem Eng, vol. 96, no. 5, pp. 1116–1126, Nov. 2017, doi: 10.1002/cjce.23058.

A. Kadiwal, “Water Quality,” 2023. Accessed: Jan. 06, 2023. [Online]. Available: https://www.kaggle.com/datasets/adityakadiwal/water potability.

M. Grandini, E. Bagli, and G. Visani, “Metrics for Multi-Class Classification: an Overview,” 2020.

Published
2024-12-31
How to Cite
[1]
T. willay, J. Tjen, P. Kartini, and R. Jimmy Iskandar, “Identifikasi Kelayakan Air Minum Dengan Metode Analisis Komponen Utama Berbasis Entropi ”, J. Appl. Comput. Sci. Technol., vol. 5, no. 2, pp. 136 - 143, Dec. 2024.
Bookmark and Share