• A
  • A
  • A
  • ABC
  • ABC
  • ABC
  • А
  • А
  • А
  • А
  • А
Regular version of the site

Article

Near-Duplicate Detection for Online-Shops Owners: An FCA-Based Approach

Lecture Notes in Computer Science. 2013. Vol. 7814. P. 722-725.
Ignatov D. I., Chubis Y., Konstantinov A. V.

We proposed a prototype of near-duplicate detection system for web-shop owners. It’s a typical situation for this online businesses to buy description of their goods from so-called copyrighters. Copyrighter can cheat from time to time and provide the owner with some almost identical descriptions for different items. In this paper we demonstrated how we can use FCA for fast clustering and revealing such duplicates in real online perfume shop’s datasets.