Text Documents Plagiarism Detection using Rabin-Karp and Jaro-Winkler Distance Algorithms

Leonardo, Brinardi and Hansun, Seng (2017) Text Documents Plagiarism Detection using Rabin-Karp and Jaro-Winkler Distance Algorithms. Indonesian Journal of Electrical Engineering and Computer Science, 5 (2). ISSN 2502-4752

Full text not available from this repository.

Abstract

Plagiarism is an act that is considered by the university as a fraud by taking someone ideas or writings without mentioning the references and claimed as his own. Plagiarism detection system is generally implement string matching algorithm in a text document to search for common words between documents. There are some algorithms used for string matching, two of them are Rabin-Karp and Jaro-Winkler Distance algorithms. Rabin-Karp algorithm is one of compatible algorithms to solve the problem of multiple string patterns, while, Jaro-Winkler Distance algorithm has advantages in terms of time. A plagiarism detection application is developed and tested on different types of documents, i.e. doc, docx, pdf and txt. From the experimental results, we obtained that both of these algorithms can be used to perform plagiarism detection of those documents, but in terms of their effectiveness, Rabin-Karp algorithm is much more effective and faster in the process of detecting the document with the size more than 1000 KB.

Item Type: Article
Subjects: 000 Computer Science, Information and General Works > 000 Computer Science, Knowledge and Systems > 005 Computer Programming
800 Literature > 800 Literature, Rhetoric and Criticism
Divisions: Faculty of Engineering & Informatics > Informatics
Depositing User: Administrator UMN Library
Date Deposited: 19 Oct 2021 02:24
Last Modified: 12 Jun 2024 06:11
URI: https://kc.umn.ac.id/id/eprint/18848

Actions (login required)

View Item View Item