Systematic Literature Review on Named Entity Recognition: Approach, Method, and Application

  • Warto Faculty of Computer Science, Dian Nuswantoro University, Semarang, 50131, Indonesia; Faculty of Da’wa, Institut Agama Islam Negeri Purwokerto, Purwokerto, 53127, Indonesia.
  • Supriadi Rustad Faculty of Computer Science, Dian Nuswantoro University, Semarang, 50131, Indonesia.
  • Guruh Fajar Shidik Faculty of Computer Science, Dian Nuswantoro University, Semarang, 50131, Indonesia.
  • Edi Noersasongko Faculty of Computer Science, Dian Nuswantoro University, Semarang, 50131, Indonesia.
  • Purwanto Faculty of Computer Science, Dian Nuswantoro University, Semarang, 50131, Indonesia.
  • Muljono Faculty of Computer Science, Dian Nuswantoro University, Semarang, 50131, Indonesia.
  • De Rosal Ignatius Moses Setiadi Faculty of Computer Science, Dian Nuswantoro University, Semarang, 50131, Indonesia.
Keywords: Named entity recognition, entity extraction, entity detection, entity classification, natural language processing

Abstract

Named entity recognition (NER) is one of the preprocessing stages in natural language processing (NLP), which functions to detect and classify entities in the corpus. NER results are used in various NLP applications, including sentiment analysis, text summarization, chatbot, machine translation, and question answering. Several previous reviews partially discussed NER, for instance, NER reviews in specific domains, NER classification, and NER deep learning. This paper provides a comprehensive and systematic review on NER topic studies published from 2011 to 2020. The main contribution of this review is to present a comprehensive systematic literature review on NER from preprocessing techniques, datasets, application domains, feature extraction techniques, approaches, methods, and evaluation techniques. The result concludes that the deep learning approach and the Bi-directional long short-term memory with a conditional random field (Bi-LSTM-CRF) method are the most interesting methods among NER researchers. At the same time, medical and health are NER researchers' most popular domains. These developments have also led to an increasing number of public datasets in the medical and health fields. At the end of this review, we recommend some opportunities and challenges for NER research going forward.
Published
2024-02-28
How to Cite
Warto, Rustad, S., Shidik, G. F., Noersasongko, E., Purwanto, Muljono, & Setiadi, D. R. I. M. (2024). Systematic Literature Review on Named Entity Recognition: Approach, Method, and Application. Statistics, Optimization & Information Computing, 12(4), 907-942. https://doi.org/10.19139/soic-2310-5070-1631
Section
Research Articles