PAMR: Persian Abstract Meaning Representation Corpus

Nasim Tohidi,Chitra Dadkhah,Reza Nouralizadeh Ganji,Ehsan Ghaffari Sadr,Hoda Elmi
DOI: https://doi.org/10.1145/3638288
IF: 1.471
2024-01-19
ACM Transactions on Asian and Low-Resource Language Information Processing
Abstract:One of the most used and well-known semantic representation models is Abstract Meaning Representation (AMR). This representation has had numerous applications in natural language processing tasks in recent years. Currently, for English and Chinese languages, large annotated corpora are available. Besides, in some low-recourse languages, related corpora have been generated with less size. Although, till now to the best of our knowledge, there is not any AMR corpus for the Persian/Farsi language. Therefore, the aim of this paper is to create a Persian AMR (PAMR) corpus via translating English sentences and adjusting AMR guidelines and to solve the various challenges that are faced in this regard. The result of this research is a corpus, containing 1020 Persian sentences and their related AMR which can be used in various natural language processing tasks. In this paper, to investigate the feasibility of using the corpus, we have applied it to two natural language processing tasks: Sentiment Analysis and Text Summarization.
computer science, artificial intelligence
What problem does this paper attempt to address?