Acquiring Independent Components through Hybrid PCA and ICA to Enhance the Classification Performance of Decision Tree

  • Achmad Efendi Statistics Department, Universitas Brawijaya, Malang, East Java, Indonesia
  • Zuraidah Zuraidah Islamic Banking Study Program, State Islamic Institute of Kediri, Kediri, East Java, Indonesia
  • Dewi Sri Susanti Statistics Study Program, Lambung Mangkurat University, Banjarbaru, South Kalimantan, Indonesia
  • Naomi N. Debataraja Statistics Study Program, Tanjungpura University, Pontianak, West Kalimantan, Indonesia
  • Ratno B.E. Wibowo Mathematics Department, Universitas Brawijaya, Malang, East Java, Indonesia
  • Samingun Handoyo Statistics Department, Universitas Brawijaya, Malang, East Java, Indonesia; 2EECS-IGP Department, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
Keywords: Confusion matrix; independent component; Mathew's Correlation; Variable extraction; Empirical data; Simulation

Abstract

The Principal Component Analysis (PCA) is widely used for modeling in both statistical and machine learning domains. However, PCA's orthogonal components may not always be independent. This research aims to compare PCA and Independent Component Analysis (ICA) using simulation and empirical data and evaluate a Decision Tree (DT) model. Two scenarios of simulation data with linear and nonlinear relationships, along with two empirical datasets were analyzed. PCA was used to project the dataset, while ICA was applied to the 6th to 10th and the 5th to 9th principal components. Both PCA and ICA resulted in projection data with zero correlation values. Scatter plots of PCA projection on nonlinear simulation data indicated consistent underlying patterns, whereas ICA projection revealed sparse patterns on both simulation datasets. The DT model utilizing 7 independent components emerged as the optimal model, displaying superior performance across accuracy, precision, recall, F1 score, Mathew's Correlation Coefficient, and Area Under Curve metrics.
Published
2025-02-14
How to Cite
Efendi, A., Zuraidah Zuraidah, Susanti, D. S., Debataraja, N. N., Wibowo, R. B., & Handoyo, S. (2025). Acquiring Independent Components through Hybrid PCA and ICA to Enhance the Classification Performance of Decision Tree. Statistics, Optimization & Information Computing, 13(5), 1832-1846. https://doi.org/10.19139/soic-2310-5070-2175
Section
Research Articles