Plagiarism Detection Using Natural Language  Processing Techniques

Muhammad Ilyas; Nasreen Malik; Ahmad Bilal; Saad Razzaq; Fahad Maqbool; Qaisar Abbas

PDF

Published Apr 10, 2021

Muhammad Ilyas

Department of Computer Science and IT, University of Sargodha, Sargodha

Nasreen Malik

Department of Computer Science, Virtual University of Pakistan, Jinnah Campus, Lahore, Pakistan.

Ahmad Bilal

School of Electrical Engineering and Computer Science, NUST, Islamabad, Pakistan.

Saad Razzaq

Department of Computer Science and IT, University of Sargodha, Sargodha

Fahad Maqbool

Department of Computer Science and IT, University of Sargodha, Sargodha

Qaisar Abbas

Department of Computer Science and IT, University of Sargodha, Sargodha

Abstract

Now a day’s plagiarism became very common in many fields of life such as research and educational fields. Due to the advancement in plagiarism techniques adopted by plagiarists, it is very difficult to detect plagiarism accurately by the existing technique. Different features are observed while checking plagiarism such as syntactic, lexical, semantic, and structural features. This research explores new and modern plagiarism detection tasks especially text-based plagiarism detection including monolingual plagiarism detection. We proposed a four-stage novel framework for plagiarism detection. Natural Language Processing (NLP) is used by this framework instead of focusing on traditional string-matching approaches. The objective of this framework is to explore text similarity by the combination of two metrics as Skip-Gram and Dice Coefficient on the corpus-based approach. Furthermore, the deep meaning of the text is explored by the use of the Deep and sallow NLP technique. Our results conclude that Heavy revision is identifying easily through Deep NLP. Shallow NLP prepares text very well that is processed further easily. Word2vec results are close to simple Deep NLP methods but word2vec also highlight those document that may not be highlighted by other technique. Synonym and phrase changes are also captured through Deep NLP.

How to Cite

Ilyas, M., Malik, N., Bilal, A., Razzaq, S., Maqbool, F., & Abbas, Q. (2021). Plagiarism Detection Using Natural Language Processing Techniques. Technical Journal, 26(01), 90-101. Retrieved from https://tj.uettaxila.edu.pk/index.php/technical-journal/article/view/1484

Issue

Vol 26 No 01 (2021): Technical Journal

Section

COMPUTER SCIENCE

The author transfers all copyright ownership of the manuscript entitled (title of article) to the Technical Journal in the event the work is published.

Article Sidebar

Main Article Content

Abstract

Article Details