A Bayesian method for estimation of the entropy in the presence of outliers based on the contaminated Pareto model

  • Mahsa Salajegheh Department of Statistics, Ferdowsi University of Mashhad, Mashhad, Iran
  • Mehdi Jabbari Nooghabi Department of Statistics, Ferdowsi University of Mashhad, Mashhad, Iran
  • Kheirolah Okhli Department of Statistics, Ferdowsi University of Mashhad, Mashhad, Iran
Keywords: Shannon entropy . Fisher information . Outliers . Insurance claim data . Contaminated Pareto model . Bayesian analysis.

Abstract

Shannon entropy and Fisher information are pivotal in the information theory area. The presence of outliers in data and using an inappropriate model may cause misleading inferential results in the amount of information. Our aim of this paper is to compute the amount of Shannon entropy and Fisher information that exists in the Pareto distribution in the presence of multiple outliers. Unlike the existing methods in the literature, we present a good method for the estimation of Shannon entropy and Fisher information to cope with the allowing for the possibility of outliers. In this regard, we focus on the Bayesian approach proposed by D. P. Scollnik (A Pareto scale-inflated outlier model and its Bayesian analysis, Scandinavian Actuarial Journal, vol. 2015, no. 3, pp.201–220) , based on the contaminated Pareto distribution. We implement the Gibbs sampler which is a simple and rational method for computing Bayesian estimation of Shannon entropy and Fisher information under different loss functions. Some simulation studies are conducted to investigate the performance of the proposed methodology under various sample sizes and the number of outliers. In the end, two examples of real insurance claim data are studied to illustrate the superiority of the proposed model in analyzing datasets and computing the amount of Shannon entropy and Fisher information.
Published
2025-02-22
How to Cite
Salajegheh, M., Jabbari Nooghabi, M., & Okhli, K. (2025). A Bayesian method for estimation of the entropy in the presence of outliers based on the contaminated Pareto model. Statistics, Optimization & Information Computing. https://doi.org/10.19139/soic-2310-5070-1997
Section
Research Articles