Journal "Software Engineering"
a journal on theoretical and applied science and technology
ISSN 2220-3397
Issue N6 2018 year
The article presents the results of research on the direction of search models, algorithms and software to determine the semantic similarity between two keywords. The methods which are used in the work are based on the graph theory algorithms. The document is represented as a set of keywords associated with the document. A measure of contextual similarity of a pair of keywords is developed. A keywords graph is constructed for a given collection of documents. The nodes of the graph correspond to the keywords, and edges represent the fact of the contextual closeness of a pair of words. The method of clustering of the constructed graph is presented below. The keywords included in one cluster have the property of semantic similarity, which is an important result of this work. Software implementation of the developed models has been tested on the collections of scientific publications keywords, as well as on the collection of posts tags in the VKontakte social network.