Journal "Software Engineering"
a journal on theoretical and applied science and technology
ISSN 2220-3397

Issue N11 2013 year

Semantic Analysis as a Base for Duplicate Text Fragment Revealing
N. A. Sergievsky , A. A. Kharlamov , д e-mail: kharlamov@analyst.ru

The article offers methods and means of mild duplicate text fragment search on the base of semantic text network analysis. Extracting of semantic network of text as the text semantic portrait is their base. The network is preparing with technology of automatic semantic text analysis Text-Analyst, and then is using for texts sense comparing. Proposed approach uses several levels of texts sifting for fast and exact text duplicates revealing: for the first time fast semantic networks comparison and then — revealing of mild copies of text fragments.

Keywords: mild text duplicate, semantic network, sense comparing, mild copies revealing
pp. 22–31