Penerapan Algoritma XGBoost dalam Klasifikasi Jumlah Korban Kecelakaan Kereta Api di Indonesia
Main Article Content
Abstract
This study aims to classify the number of vehicle accident casualties caused by railway accidents in Indonesia into low, medium, and high-risk categories using the XGBoost algorithm, as well as to evaluate the model performance based on accuracy, precision, and recall metrics. The employed methodology is CRISP-DM, consisting of stages such as business understanding, data understanding, data preparation, modeling, evaluation, and deployment stages. The dataset was obtained from official reports of the National Transportation Safety Committee (KNKT) and online news articles from 1991 to early 2025, resulting in 112 valid records after preprocessing, including data labeling, transformation of nominal attributes, and conversion of date data into numerical form. The classification process was carried out using RapidMiner. The results show that the XGBoost model achieved an accuracy of 88.39%, with the highest precision and recall values in the low-risk class (0.91 and 0.94) and high-risk class (0.88 and 0.87), while the performance for the medium-risk class remains relatively low (precision 0.75 and recall 0.68), indicating potential data imbalance or insufficient discriminative features. Based on these findings, it can be concluded that the XGBoost algorithm is effective in classifying railway accident risk levels; however, improvements in data quality and feature selection are still needed to achieve more optimal performance.
Article Details
Section
How to Cite
References
Anisah, S., & Irwansyah, I. (2025). Analisis Data Mining untuk Klasifikasi Kafe Populer di Jakarta Menggunakan Decision Tree dan Visualisasi dengan Tableau. Jutisi : Jurnal Ilmiah Teknik Informatika Dan Sistem Informasi, 14(2), 890. https://doi.org/10.35889/jutisi.v14i2.2660
Ari Rama Novryadi, Irwansyah, & Moh Shidqon. (2026). Analisis Sentimen Masyarakat Terhadap Kinerja Presiden Indonesia Joko Widodo Periode Kedua Menggunakan Metode Naïve Bayes dan SVM. DIGINTEL-AI : DIGital INnovation and inTELligence – AI, 1(1), 11–24. https://doi.org/10.66217/digintel-ai.v1i1.2
Bagaskara, W., Pusparini, N. N., & Irwansyah, I. (2024). Klasifikasi Penjadwalan Kerja Perawatan Air Conditioner (Ac) Menggunakan Algoritma Decision Tree (C4.5) Pada Pt Xyz. Infotech: Journal of Technology Information, 10(1), 11–20. https://doi.org/10.37365/jti.v10i1.240
Bridgelall, R., & Tolliver, D. D. (2021). Railroad accident analysis using extreme gradient boosting. Accident Analysis & Prevention, 156, 106126. https://doi.org/10.1016/j.aap.2021.106126
Budiyantara, A., Irwansyah, I., Prengki, E., Pratama, P. A., & Wiliani, N. (2020). Komparasi Algoritma Decision Tree, Naive Bayes Dan K-Nearest Neighbor Untuk Memprediksi Mahasiswa Lulus Tepat Waktu. JITK (Jurnal Ilmu Pengetahuan dan Teknologi Komputer), 5(2), 265–270. https://doi.org/10.33480/jitk.v5i2.1214
Ihza Kurniawan, D., Irwansyah, & Taufik, A. (2026). Analisis Sentimen Terhadap Komentar Video IShowSpeed Tour Indonesia Pada YouTube Menggunakan Metode SVM. DIGINTEL-AI : DIGital INnovation and inTELligence – AI, 1(1), 50–62. https://doi.org/10.66217/digintel-ai.v1i1.5
Irwansyah, I., Dittyata, R., Rizal, R., & Wiyono, W. (2024). Optimalisasi Klasifikasi Uji Emisi Sepeda Motor Menggunakan Algoritma Naïve Bayes. Infotech: Journal of Technology Information, 10(2), 337–242. https://doi.org/10.37365/jti.v10i2.327
Liu, J., Wang, Y., Deng, C., Hou, F., Jin, Z., Qiao, L., & Wang, G. (2025). A new procedure for assessing and predicting the severity of accidents: A case study on freight-train derailments. Journal of Loss Prevention in the Process Industries, 94, 105511. https://doi.org/10.1016/j.jlp.2024.105511
Liu, X., Turla, T., & Zhang, Z. (2018). Accident-Cause-Specific Risk Analysis of Rail Transport of Hazardous Materials. Transportation Research Record: Journal of the Transportation Research Board, 2672(10), 176–187. https://doi.org/10.1177/0361198118794532
Nafisah Nurul Hakim. (2020). Implementasi Machine Learning pada Sistem Prediksi Kejadian dan Lokasi Patah Rel Kereta Api di Indonesia. Jurnal Sistem Cerdas, 3(1), 25–35. https://doi.org/10.37396/jsc.v3i1.58
Permatasari, M. P., Kristanto, D., Ervianty, R. M., Salam, M. D., Dwikesumasari, P. R., & Zulfa, V. R. (2024). Persepsi Kepuasan Layanan Transportasi Kereta Api Pt. Kereta Api Indonesia (Pt.Kai): Survey Pada Mahasiswa Yang Berkuliah Di SurabayA. 4(10).
Salsa Billa Permana Putri, Irwansyah, & Tri M, T. (2026). Implementasi Algoritma K-NN Pada Sosial Media X Untuk Analisis Sentimen Pengalaman Warganet Tinggal Di Luar Negeri. DIGINTEL-AI : DIGital INnovation and inTELligence – AI, 1(1), 37–49. https://doi.org/10.66217/digintel-ai.v1i1.4
Senkondo, E., Chimba, D., Madalo, M., Yeboah, A., & Blue, S. (2025). Comparative Analysis of Machine Learning and Statistical Models for Railroad–Highway Grade Crossing Safety. Vehicles, 7(4), 163. https://doi.org/10.3390/vehicles7040163
Sinai, N. K., Dzulkifli, D., Pertiwi, S., Chentia, R., Halimatussakdiyah, H., Furqan, M., Alfaiz, A. B., Nafyla, A., Arrafif, C. R., Zawira, S., & Windasari, N. (2024). Kematian Akibat Kecelakaan Kereta Api: Laporan Kasus. Jurnal Ilmu Kesehatan Indonesia, 5(3), 268–272. https://doi.org/10.25077/jikesi.v5i3.1246
Wang, D., He, Q., Peng, J., & Li, G. (2025). Enhancing Intelligent Transportation Safety with Explainable AI: A Framework for Uncovering Crash Severity Factors at Highway–Rail Grade Crossings. World Electric Vehicle Journal, 16(11), 637. https://doi.org/10.3390/wevj16110637
Zhou, X., Lu, P., Zheng, Z., Tolliver, D., & Keramati, A. (2020). Accident Prediction Accuracy Assessment for Highway-Rail Grade Crossings Using Random Forest Algorithm Compared with Decision Tree. Reliability Engineering & System Safety, 200, 106931. https://doi.org/10.1016/j.ress.2020.106931