Density-Adaptive Clustering of Multivariate Angular Data Using Dirichlet Process Mixture Models with Circular Normal Distribution for Artificial Intelligence Applications

  • Said Benlakhdar SmartiLab Laboratory, Moroccan School of Engineering Sciences (EMSI), Rabat, Morocco; LRIT URAC 29, Faculty of Sciences, Mohammed V University, Rabat, Morocco
  • Saralees Nadarajah Department of Mathematics, University of Manchester, UK
  • Mohammed Rziza LRIT URAC 29, Faculty of Sciences, Mohammed V University, Morocco
  • Rachid Oulad Haj Thami RIITM, ENSIAS, Mohammed V University, Morocco
Keywords: Dirichlet Process, Mixture distributions, Non parametric Bayesian model, Clustering, Ward's algorithm, Modeling Multivariate Angular Data

Abstract

Data clustering is an essential technique for organizing unsupervised data, extracting subjects automatically, and swiftly retrieving or filtering information. In this study, we approach the task of clustering multivariate angular distributions using nonparametric Bayesian mixture models featuring von Mises distributions. Our approach operates within a nonparametric Bayesian framework, specifically leveraging the Dirichlet process. Unlike finite mixture models, our approach assumes an infinite number of clusters initially, inferring the optimal number automatically from the data. Morever, our paper introduces a unified approach, leveraging Ward's algorithm, Dirichlet process, and von Mises Mixture distributions (DPM-MvM), to effectively capture both the structure and variability inherent in the data. We've developed a variational inference algorithm for DPM-MvM enabling automatic determination of the number of clusters. Our experimental results showcase the efficiency and accuracy of our method for analyzing multivariate angular data with state of the art approaches.

References

Published
2025-01-02
How to Cite
Benlakhdar, S., Nadarajah, S., Rziza, M., & Oulad Haj Thami, R. (2025). Density-Adaptive Clustering of Multivariate Angular Data Using Dirichlet Process Mixture Models with Circular Normal Distribution for Artificial Intelligence Applications. Statistics, Optimization & Information Computing, 13(4), 1404-1412. https://doi.org/10.19139/soic-2310-5070-2146
Section
Research Articles