Identifikasi Kelayakan Air Minum Dengan Metode Analisis Komponen Utama Berbasis Entropi
I will put the dimension here
Abstract
The need for clean water is a fundamental requirement that must be met by humans, as water constitutes 60 to 70% of the total human body weight. Therefore, it is important to be able to determine the quality of the water entering the body, as consuming unsafe water will bring various diseases, such as diarrhea, and in severe cases might lead to death. This study aimed to investigate the factors which determine the potability of drinking water. Specifically, this research aims to produce a fault detection algorithm that can detect the potability of water samples based on Principal Component Analysis (PCA) and entropy-based subset selection methods. This paper addresses the linearity problem that commonly occurred in PCA by finding a subset of data that has a good entropy relation among the parameters contained in the subset, thus maintaining linearity in the data. There were 8 parameters considered in this reseach: pH, hardness, total dissolved solids, chloramines, sulfate, conductivity, organics carbon, trihalomethanes and turbidity. The experiment was conducted with 811 water samples, where 645 samples were used to train the model and the rest for validating the model predictive accuracy. Based on experiments conducted, it is confirmed that the proposed algorithm can determine the potability of drinking water samples from synthetic data sourced from India with an accuracy of over 98% for potable water data and 100% for non-potable water data.
Downloads
References
Daftar Pustaka
F. A. Padder and A. Bashir, “Scarcity of water in the twenty-first century: Problems and potential remedies,” MEDALION JOURNAL: Medical Research, Nursing, Health and Midwife Participation, vol. 4, no. 1, pp. 1–5, 2023.
K. de Mello et al., “Multiscale land use impacts on water quality: Assessment, planning, and future perspectives in Brazil,” J Environ Manage, vol. 270, p. 110879, Sep. 2020, doi: 10.1016/j.jenvman.2020.110879.
M. Salehi, “Global water shortage and potable water safety; Today’s concern and tomorrow’s crisis,” Environ Int, vol. 158, p. 106936, Jan. 2022, doi: 10.1016/j.envint.2021.106936.
A. C. Johnson et al., “Identification and Quantification of Microplastics in Potable Water and Their Sources within Water Treatment Works in England and Wales,” Environmental Science & Technology, vol. 54, no. 19, pp. 12326–12334, Aug. 2020, doi: 10.1021/acs.est.0c03211.
Biro Pusat Statistik (BPS), “persentase rumah tangga menurut provinsi tipe daerah dan sumber air minum layak.” Accessed: Jan. 06, 2023. [Online]. Available: https://www.bps.go.id/indicator/29/854/1/persentase-rumah-tangga-menurut-provinsi-tipe-daerah-dan-sumber-air-minum-layak.html.
H. M. Gomes, J. Read, A. Bifet, J. P. Barddal, and J. Gama, “Machine learning for streaming data: state of the art, challenges, and opportunities,” ACM SIGKDD Explorations Newsletter, vol. 21, no. 2, pp. 6–22, Nov. 2019, doi: 10.1145/3373464.3373470.
A. Nandy, C. Duan, and H. J. Kulik, “Audacity of huge: overcoming challenges of data scarcity and data quality for machine learning in computational materials discovery,” Curr Opin Chem Eng, vol. 36, p. 100778, Jun. 2022, doi: 10.1016/j.coche.2021.100778.
S. Karamizadeh, S. M. Abdullah, A. A. Manaf, M. Zamani, and A. Hooman, “An Overview of Principal Component Analysis,” Journal of Signal and Information Processing, vol. 04, no. 03, pp. 173–175, 2013, doi: 10.4236/jsip.2013.43b031.
B. M. Salih Hasan and A. M. Abdulazeez, “A Review of Principal Component Analysis Algorithm for Dimensionality Reduction,” Journal of Soft Computing and Data Mining, vol. 02, no. 01, Apr. 2021, doi: 10.30880/jscdm.2021.02.01.003.
V. Pratama and J. Tjen, “Entropy-based subset selection principal component analysis for diabetes risk factor identification,” J Emerg Investig, 2023, doi: 10.59720/23-015.
M. Tripathi and S. K. Singal, “Use of Principal Component Analysis for parameter selection for development of a novel Water Quality Index: A case study of river Ganga India,” Ecol Indic, vol. 96, pp. 430–436, Jan. 2019, doi: 10.1016/j.ecolind.2018.09.025.
W. Yang, Y. Zhao, D. Wang, H. Wu, A. Lin, and L. He, “Using Principal Components Analysis and IDW Interpolation to Determine Spatial and Temporal Changes of Surface Water Quality of Xin’anjiang River in Huangshan, China,” Int J Environ Res Public Health, vol. 17, no. 8, p. 2942, Apr. 2020, doi: 10.3390/ijerph17082942.
S. Abdelaziz, M. I. Gad, and A. H. M. H. El Tahan, “Groundwater quality index based on PCA: Wadi El-Natrun, Egypt,” Journal of African Earth Sciences, vol. 172, p. 103964, Dec. 2020, doi: 10.1016/j.jafrearsci.2020.103964.
G. Thanh Nguyen, “Evaluating Current Water Quality Monitoring System on Hau River, Mekong Delta, Vietnam Using Multivariate Statistical Techniques,” Applied Environmental Research, pp. 14–25, Jan. 2020, doi: 10.35762/aer.2020.42.1.2.
J. Tjen, F. Smarra, and A. D’Innocenzo, “An entropy-based sensor selection algorithm for structural damage detection,” in 2020 IEEE 16th International Conference on Automation Science and Engineering (CASE), IEEE, Aug. 2020, pp. 1566–1571. doi: 10.1109/case48305.2020.9216828.
I. M. Johnstone and D. Paul, “PCA in High Dimensions: An Orientation,” Proceedings of the IEEE, vol. 106, no. 8, pp. 1277–1292, Aug. 2018, doi: 10.1109/jproc.2018.2846730.
J. B. Schreiber, “Issues and recommendations for exploratory factor analysis and principal component analysis,” Research in Social and Administrative Pharmacy, vol. 17, no. 5, pp. 1004–1011, May 2021, doi: 10.1016/j.sapharm.2020.07.027.
J. Tjen and V. Pratama, “Penentuan Jalur Diagnostik Penyakit Berbasis Konsep Pembelajaran Mesin: Studi kasus Penyakit Hepatitis C,” Journal of Applied Computer Science and Technology, vol. 4, no. 2, pp. 124–130, Nov. 2023, doi: 10.52158/jacost.v4i2.556.
F. Smarra, J. Tjen, and A. D’Innocenzo, “Learning methods for structural damage detection via entropy‐based sensors selection,” International Journal of Robust and Nonlinear Control, vol. 32, no. 10, pp. 6035–6067, Mar. 2022, doi: 10.1002/rnc.6124.
L. Wang, “Enhanced fault detection for nonlinear processes using modified kernel partial least squares and the statistical local approach,” Can J Chem Eng, vol. 96, no. 5, pp. 1116–1126, Nov. 2017, doi: 10.1002/cjce.23058.
A. Kadiwal, “Water Quality,” 2023. Accessed: Jan. 06, 2023. [Online]. Available: https://www.kaggle.com/datasets/adityakadiwal/water potability.
M. Grandini, E. Bagli, and G. Visani, “Metrics for Multi-Class Classification: an Overview,” 2020.
Copyright (c) 2024 Thommy willay, Jimmy Tjen, Paskalia Kartini, Riyadi Jimmy Iskandar
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Pernyataan Hak Cipta dan Lisensi
Dengan mengirimkan manuskrip ke Journal of Applied Computer Science and Technology (JACOST), penulis setuju dengan kebijakan ini. Tidak diperlukan persetujuan dokumen khusus.
- Hak cipta pada setiap artikel adalah milik penulis.
- Penulis mempertahankan semua hak mereka atas karya yang diterbitkan, tak terbatas pada hak-hak yang diatur dalam laman ini.
- Penulis mengakui bahwa Journal of Applied Computer Science and Technology (JACOST) sebagai yang pertama kali mempublikasikan dengan lisensi Creative Commons Atribusi 4.0 Internasional (CC BY-SA).
- Penulis dapat memasukan tulisan secara terpisah, mengatur distribusi non-ekskulif dari naskah yang telah terbit di jurnal ini kedalam versi yang lain (misal: dikirim ke respository institusi penulis, publikasi kedalam buku, dll), dengan mengakui bahwa naskah telah terbit pertama kali pada Journal of Applied Computer Science and Technology (JACOST);
- Penulis menjamin bahwa artikel asli, ditulis oleh penulis yang disebutkan, belum pernah dipublikasikan sebelumnya, tidak mengandung pernyataan yang melanggar hukum, tidak melanggar hak orang lain, tunduk pada hak cipta yang secara eksklusif dipegang oleh penulis.
- Jika artikel dipersiapkan bersama oleh lebih dari satu penulis, setiap penulis yang mengirimkan naskah menjamin bahwa dia telah diberi wewenang oleh semua penulis bersama untuk menyetujui hak cipta dan pemberitahuan lisensi (perjanjian) atas nama mereka, dan setuju untuk memberi tahu rekan penulis persyaratan kebijakan ini. Journal of Applied Computer Science and Technology (JACOST) tidak akan dimintai pertanggungjawaban atas apa pun yang mungkin timbul karena perselisihan internal penulis.
Lisensi :
Journal of Applied Computer Science and Technology (JACOST) diterbitkan berdasarkan ketentuan Lisensi Creative Commons Atribusi 4.0 Internasional (CC BY-SA). Lisensi ini mengizinkan setiap orang untuk :.
- Berbagi — menyalin dan menyebarluaskan kembali materi ini dalam bentuk atau format apapun;
- Adaptasi — menggubah, mengubah, dan membuat turunan dari materi ini untuk kepentingan apapun.
Lisensi :
-
Atribusi — Anda harus mencantumkan nama yang sesuai, mencantumkan tautan terhadap lisensi, dan menyatakan bahwa telah ada perubahan yang dilakukan. Anda dapat melakukan hal ini dengan cara yang sesuai, namun tidak mengisyaratkan bahwa pemberi lisensi mendukung Anda atau penggunaan Anda.
-
BerbagiSerupa — Apabila Anda menggubah, mengubah, atau membuat turunan dari materi ini, Anda harus menyebarluaskan kontribusi Anda di bawah lisensi yang sama dengan materi asli.