Document plagiarism detection using a new concept similarity in formal concept analysis (Q2039881)
From MaRDI portal
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Document plagiarism detection using a new concept similarity in formal concept analysis |
scientific article |
Statements
Document plagiarism detection using a new concept similarity in formal concept analysis (English)
0 references
5 July 2021
0 references
Summary: This paper proposes an algorithm for document plagiarism detection using the provided incremental knowledge construction with formal concept analysis (FCA). The incremental knowledge construction is presented to support document matching between the source document in storage and the suspect document. Thus, a new concept similarity measure is also proposed for retrieving formal concepts in the knowledge construction. The presented concept similarity employs appearance frequencies in the obtained knowledge construction. Our approach can be applied to retrieve relevant information because the obtained structure uses FCA in concept form that is definable by a conjunction of properties. This measure is mathematically proven to be a formal similarity metric. The performance of the proposed similarity measure is demonstrated in document plagiarism detection. Moreover, this paper provides an algorithm to build the information structure for document plagiarism detection. Thai text test collections are used for performance evaluation of the implemented web application.
0 references