Journal "Software Engineering"
a journal on theoretical and applied science and technology
ISSN 2220-3397

Issue N4 2023 year

DOI: 10.17587/prin.14.195-202
Algorithms for Finding Duplicate Conferences and Conference Groups in Scientometric Systems
A. S. Kozitsyn, Researcher,, Lomonosov Moscow State University, Moscow, 119192, Russian Federation
Corresponding author: Alexander S. Kozitsyn, Researcher, Lomonosov Moscow State University, Moscow, 119192, Russian Federation E-mail:
Received on January 19, 2023
Accepted on February 07, 2023

The article discusses the methods developed by the author for assessing the proximity of the description of conferences in order to detect duplicates, as well as building groups of conferences. An overview of the existing catalogs of conferences on the Internet is given. Their advantages and disadvantages are analyzed. The necessity of developing methods for more thorough verification of input data about conferences is substantiated. The description of the algorithms developed by the author and their software implementation and testing on big data is given on the example of the scientometric system IAS ISTINA. The developed algorithms make it possible to search for similar conferences by primary descriptions when registering a conference, search for duplicates in the database of the scientometric system, and combine conferences of different years into groups. The described methods can be used in the development of conference catalogs and scientometric systems to improve the quality of initial data verification.

Keywords: duplicates search, scientometrics, information systems, conference
pp. 195–202
For citation:
Kozitsyn A. S. Algorithms for Finding Duplicate Conferences and Conference Groups in Scientometric Systems, Programmnaya ingeneria, 2023, vol. 14, no. 4, pp. 195—202. DOI: 10.17587/prin.14.195-202. (in Russian).
  1. Intelligent system of case study of scientific and technical information (ISTINA) / Eds. V. A. Sadovnichy, Moscow, Moscow University Press, 2014, 262 p. (in Russian).
  2. Vasenin V. A., Zenzinov A. A., Lunev K. V. Using scien-tometric information-analytical systems to automate competitive procedures using the example of the information-analytical system ISTINA, Programmnaya ingeneria 2016, vol. 7, no. 10, pp. 472—480. DOI: 10.17587/prin.7.472-480 (in Russian).
  3. Sadovnichy V. A., Vasenin V. A., Afonin S. A. et al. Information system «ISTINA» as big data — a tool in the field of control based on the analysis of scientometric data, Knowledge — Ontolo­gies — Theories (ZONT-2015), Materials of the All-Russian Conference with international participation, Novosibirsk, 2015, pp. 115—123 (in Russian).
  4. Zavrazhin A. V., Karmanov M. V., Shubina I. V. Scientometrics: salvation or death?, Pravo i obrazovanie, 2022, no. 9, pp. 4—11 (in Russian).
  5. Polianin A. D. Disadvantages of citation indexes and Hirsch. Maximum Citation Indices, available at: http://eqworld.ipmnet. ru/ru/info/sci-edu/Polyanin_IndexH_2014.html (date of access 25.01.2023).
  6. Zharova E. N. Scientometrics in the field of social and humanitarian sciences: problems and solutions, Nauchnye i tekhnicheskie biblioteki, 2022, no. 4, pp. 34—53. DOI: 10.33186/1027­3689-2022-4-34-53 (in Russian).
  7. Kozitsyn A. S., Afonin S. A., Shachnev D. A. Metod otsenki tematicheskoi blizosti nauchnykh zhurnalov, Programmnaya ingeneria, 2020, vol. 11, no. 6, pp. 335—341. DOI: 10.17587/prin.11.335-341 (in Russian).
  8. Kozitsyn A. S. Algorithms for thematic data search in scientometric systems, Programmnaya ingeneria, 2022, vol. 13, no. 6, pp. 291—300. DOI: 10.17587/prin.13.291-300 (in Russian).
  9. Shachnev D. A. Searching for activity results and experts in a given subject area, taking results significance into account, Programmnaya ingeneria 2021, vol. 12, no. 5, pp. 260—266. DOI: 10.17587/prin.12.260-266.
  10. Kozitsyn A. S., Afonin S. A. Expert search method based on scientometric systems data, Elektronnye biblioteki, 2021, vol. 24, no. 5, pp. 870—888. DOI: 10.26907/1562-5419-2021-24-5-879-888 (in Russian).
  11. Manning K.,Ragkhavan P., Shiuttse C. Introduction to information retrieval, Moscow, Williams, 2011, 520 p. (in Russian).
  12. Kozitsyn A. S., Afonin S. A., Shachnev D. A. Methods for thematic search of conferences based on scientometric data, Nauchnyi servis v seti Internet, 2022, no. 24, pp. 332-339. DOI: 10.20948/abrau-2022-3 (in Russian).
  13. Kozitsyn A. S., Afonin S. A. Algorithm for resolving the ambiguity of the names of authors in the IAS ISTINA, Sovremennye informatsionnye tekhnologii i IT-obrazovanie, 2020, vol. 16, no. 1, pp. 108—117. DOI: 10.25559/SITITO.16.202001.108-117 (in Russian).
  14. Afonin S., Kozitsyn A., Astapov I. Sqlreports: Yet another relational database reporting system, Proceedings of the 9th International Conference on Software Engineering and Applications, 2014, pp. 529—534. DOI: 10.5220/0005114205290534.