Penerapan Metode Rabin-Karp untuk Mengukur Kemiripan Kata Dua Dokumen Berbasis Web

  • Ramadhana Saputra
  • Ari Cahyono
  • M. Abu Amar Al Badawi Universitas Jenderal Achmad Yani Yogyakarta

Abstract

Scientific literature plays a significant role in the academic requirements of colleges, encompassing various types such as papers, reports, journals, and scripts. However, the issue of plagiarism, including the copying and plagiarizing of others' work, remains prevalent in the creation of scientific papers. In particular, digital content plagiarism often involves copy-pasting and quoting from original documents. To address this, measuring the similarity of words between documents becomes essential. In Dhamayanti's research, the recommendation is to enhance the Rabin-Karp algorithm by utilizing a distinct method [1]. This study builds upon previous research employing a string-matching method. Instead of the conventional cosine method, the substitution method employed string-Karp techniques within the Rabin-Karp algorithm, resulting in improved similarity percentages. The manufacturing of the application adopts the string-matching method using the Rabin-Karp algorithm. The algorithm matches 5-gram word sequences converted into hash values, and the similarity percentage is determined based on matching hash values. The presence of identical words indicates similarity. The application is tested using six scientific writing documents from diverse sources with related titles. Through 15 test runs, the accuracy level reached 90%.

References

[1] D. Dhamayanti and L. P. Sari, “Aplikasi Pendeteksi Plagiasi pada Universitas Indo Global Mandiri Berbasis Web,” Jurnal Informatika Global, vol. 10, no. 2, 2019.

[2] T. P. K. P. Pembinaan, “Kamus Besar Bahasa Indonesia,” Jakarta: Balai Pustaka, 1989.

[3] M. Mozgovoy, Enhancing computer-aided plagiarism detection. Joensuun yliopisto, 2007.

[4] P. Novantara, “Implementasi Algoritma Jaro-Winkler Distance Untuk Sistem Pendeteksi Plagiarisme Pada Dokumen Skripsi,” Buffer Informatika, vol. 3, no. 2, 2017.

[5] N. Alamsyah, “Perbandingan Algoritma Winnowing Dengan Algoritma Rabin Karp Untuk Mendeteksi Plagiarisme Pada Kemiripan Teks Judul Skripsi,” Technologia: Jurnal Ilmiah, vol. 8, no. 3, pp. 124–134, 2017.

[6] A. Prastyanti and S. N. Endah, “Sistem deteksi kemiripan kata pada dua dokumen menggunakan algoritma Rabin-Karp,” Universitas Diponegoro, 2014.

[7] I. Sommerville, Software engineering. Pearson, 2007.

[8] S. Buttcher, C. L. A. Clarke, and G. V Cormack, Information retrieval: Implementing and evaluating search engines. Mit Press, 2016.

[9] H. Schütze, C. D. Manning, and P. Raghavan, Introduction to information retrieval, vol. 39. Cambridge University Press Cambridge, 2008.

[10] D. I. Lesmana, “Sistem penilaian otomatis pada jawaban ujian berbentuk esai menggunakan metode Rabin Karp,” Universitas Islam Negeri Maulana Malik Ibrahim, 2012.
Published
2023-11-22
How to Cite
Saputra, R., Cahyono, A., & Al Badawi, M. A. A. (2023). Penerapan Metode Rabin-Karp untuk Mengukur Kemiripan Kata Dua Dokumen Berbasis Web. Teknomatika: Jurnal Informatika Dan Komputer, 14(1), 42-48. https://doi.org/10.30989/teknomatika.v14i1.1128
Section
Articles