Journal "Software Engineering"
a journal on theoretical and applied science and technology
ISSN 2220-3397

Issue N12 2017 year

DOI: 10.17587/prin.8.556-562
The Resolution of Ambiguities in the Identification of Authors of the Publication with the Use of Co-Authors' Graphs in Large Collections of Bibliographic Data
A. S. Kozitsin, alexanderkz@mail.ru, S. A. Afonin, serg@msu.ru, Lomonosov Moscow State University, Moscow, 117223, Russian Federation
Corresponding author: Kozitsin Alexander S., Researcher, Lomonosov Moscow State University, Moscow, 117223, Russian Federation, E-mail: alexanderkz@mail.ru
Received on July 18, 2017
Accepted on August 02, 2017

This article addresses problems related to automated processing of bibliographic data in scientometric systems by means of statistical analysis of large collections of such data. Experimental results on authors identification in bibliographic data are based on the data set presented in ISTINA, a scientometric information system developed and deployed at Moscow State University. The new algorithm for authors identification, presented in this paper, shows 95 % accuracy on the considered data set. Utilization of this algorithm improves the quality of data entered into the system, thus leading to a more reliable scientometric characteristics of individual researchers and administrative units. The paper also discusses possible approaches to some related practically important problems, such as thematic search and classification of publications using coauthoring graph and thematic classification of scientific journals. Automatic discovery of researchers topics of interest allows for on-demand generation of various documents suitable for decision making in specific research areas and could be useful for supplying system users with information on relevant journals or upcoming scientific events.

Keywords: succometrics, detection of regularities, bibliographic data, graph, citation, author, thematic analysis
pp. 556–562
For citation:
Kozitsin A. S., Afonin S. A. The Resolution of Ambiguities in the Identification of Authors of the Publication with the Use of Co-Authors' Graphs in Large Collections of Bibliographic Data, Programmnaya Ingeneria, 2017, vol. 8, no. 12, pp. 556—562.