Journal "Software Engineering"
a journal on theoretical and applied science and technology
ISSN 2220-3397
Issue N1 2019 year
The article is devoted to the problem of automatic text sentiment analysis. Under the text sentiment the authors emotional attitude, expressed in the text is meant. There is a wide range of areas where sentiment analysis is applied, including political, sociological and marketing research, the search engines, the human-computer interfaces. The task of sentiment analysis is complicated by the strong dependence of the sentiment expressions on the context and domain, the presence of sarcasm and irony. To solve this problem the following approaches are used: a knowledge-based approach; machine learning and combined approach. Each of these approaches has advantages and disadvantages. In this article a combined approach is used. The article proposes the structure of the software system for text sentiment analysis based on sentiment lexicons. The structure includes four main subsystems, namely text preprocessing, sentiment lexicon generation, sentiment lexicon parameters determination and classification subsystems. The implementation of this system is considered in accordance with the object-oriented approach. Main classes and their relations are described with UML class diagram. The results of developed systems experimental research are presented. The research is performed using the text corpora of movies and books reviews, provided by organizers of ROMIP-2011 and ROMIP-2012 seminars. These results show the superiority of the proposed system over widespread machine learning methods. The developed system can be used as a standalone application for summarizing opinions on the text corpora. It is possible to integrate the main part of the system into search services, focused on the preparation of recommendations based on the text sentiment analysis.