Decision Tree Model for Predicting Work Schedules Using Scikit-Learn
Main Article Content
Abstract
Predicting category and numerical data, such as working schedule data, is difficult since it necessitates a specific process. A decision tree is one of many categorization methods that can handle both category and numerical input. Scikit learn, a python library that may be used for decision trees, is one example. Although Scikit-optimized learn's CART algorithm could only handle numerical data, it did provide certain features to deal with categorical data. To forecast working schedules, this study used scikit-learn to create a decision tree model. There are 54 variables, three of which are category and one of which is numerical. A 6-depth decision tree model was created as a result of the implementation. The evaluation yielded a positive outcome, with accuracy and precision above 0.7 and 0.9, respectively. The optimal division of data is 30% validation and 70% training. In comparison to KNN, the decision tree model has higher accuracy, with decision tree accuracy exceeding 0.8 while KNN accuracy is below.
Downloads
Article Details
Aguilar-chinea, R. M., Castilla, I., Expósito, C., Aguilar-chinea, R. M., Castilla, I., Moreno-vega, J. M., & Moreno-vega, J. M. (2019). Using a decision tree algorithm to predict the robustness of a transshipment schedule. Procedia Computer Science, 149, 529–536. https://doi.org/10.1016/j.procs.2019.01.172
Ceballos, F. (2019). Scikit-Learn Decision Trees Explained - Training, Visualizing, and Making Predictions with Decision Trees. Retrieved from https://towardsdatascience.com/scikit-learn-decision-trees-explained-803f3812290d
Ellson, J., Gansner, E., Koutsofios, L., North, S., & Woodhull, G. (2001). Graphviz - Open Source Graph Drawing Tools. In Lecture Notes in Computer Science (pp. 483–484). Springer-Verlag. Retrieved from https://graphviz.gitlab.io/
Floares, A. G., Calin, G. A., & Manolache, F. B. (2016). Bigger Data is Better for Molecular Diagnosis Tests Based on Decision Trees. In In: Tan Y., Shi Y. (eds) Data Mining and Big Data. DMBD 2016. Lecture Notes in Computer Science, vol 9714 (pp. 288–295). Springer, Cham. https://doi.org/https://doi.org/10.1007/978-3-319-40973-3_29
Loh, W. (2011). Classification and Regression Trees. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, (January 2011). https://doi.org/10.1002/widm.8
Navlani, A. (2018). Decision Tree Classification in Python. Retrieved from https://www.datacamp.com/community/tutorials/decision-tree-classification-python
Ochiai, Y., Masuma, Y., & Tomii, N. (2019). Improvement of timetable robustness by analysis of drivers’ operation based on decision trees. Journal of Rail Transport Planning & Management, 9(March), 57–65. https://doi.org/10.1016/j.jrtpm.2019.03.001
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., … Duchesnay, E. (2011). Scikit-learn: Machine Learning in {P}ython. Journal of Machine Learning Research, 12, 2825--2830.
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., … Duchesnay, E. (2019). Decision Tree. Retrieved from https://scikit-learn.org/stable/modules/tree.html
Ridok, A., & Latifah, R. (2015). Klasifikasi Teks Bahasa Indonesia Pada Corpus Tak Seimbang Menggunakan NWKNN. In Konferensi Nasional Sistem & Informatika 2015 (pp. 9–10).
Topîrceanu, A., & Grosseck, G. (2017). Decision tree learning used for the classification of student archetypes in online courses. Procedia Computer Science, 112, 51–60. https://doi.org/10.1016/j.procs.2017.08.021
Vaish, P. (2017). Decision Trees in scikit-learn. Retrieved August 14, 2019, from https://adataanalyst.com/scikit-learn/decision-trees-scikit-learn/

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.