Enhanced Outlier Detection in Linear-Circular Regression Using Circular Distance and Mean Resultant Length

  • Thunchanok Chaitongdee Department of Statistics, Faculty of Science, Khon Kaen University, Khon Kaen 40002, Thailand
  • Wuttichai Srisodaphol Department of Statistics, Faculty of Science, Khon Kaen University, Khon Kaen 40002, Thailand
  • Oktsa Dwika Rahmashari Department of Statistics, Faculty of Science, Khon Kaen University, Khon Kaen 40002, Thailand
  • Benjawan Rattanawong Department of Applied Statistics, Faculty of Engineering, Rajamangala University of Technology Isan Khon Kaen Campus, Khon Kaen 40000, Thailand
  • Khanuengnij Prakhammin Department of Applied Statistics, Faculty of Engineering, Rajamangala University of Technology Isan Khon Kaen Campus, Khon Kaen 40000, Thailand
Keywords: circular distance, linear-circular regression, mean resultant length, masking effect, swamping effect

Abstract

In the study of outlier identification in linear-circular regression, two new methods are proposed. By calculating the circular distance of each erroneous value and using the mean resultant length for outlier identification, these methods aim to enhance the precision and reliability of outlier detection. Their effectiveness will be assessed through comprehensive simulations on datasets with and without outlier contamination, comparing them with the previous method. Additionally, the methods were tested on real-world data, specifically wind speed and wind direction data, to further validate their practical applicability. Three metrics are used to evaluate their performance: the probability of correctly identifying all outliers, the masking effect, and the swamping effect. While occasional misclassification of inliers as outliers is possible, the results indicate that both proposed methods demonstrate strong overall performance.
Published
2025-06-09
How to Cite
Chaitongdee , T., Srisodaphol, W., Rahmashari , O., Rattanawong , B., & Prakhammin , K. (2025). Enhanced Outlier Detection in Linear-Circular Regression Using Circular Distance and Mean Resultant Length. Statistics, Optimization & Information Computing, 14(1), 454-468. https://doi.org/10.19139/soic-2310-5070-2459
Section
Research Articles