Topic

Plagiarism detection

About: Plagiarism detection is a research topic. Over the lifetime, 1790 publications have been published within this topic receiving 24740 citations.

...read moreread less

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Measuring Sentences Similarity: A Survey

[...]

Mamdouh Farouk¹•Institutions (1)

Assiut University¹

06 Oct 2019-arXiv: Computation and Language

TL;DR: Word-to-word based, structure based, and vector-based are the most widely used approaches to find sentences similarity, but structure based similarity that measures similarity between sentences structures needs more investigation.

...read moreread less

Abstract: This study is to review the approaches used for measuring sentences similarity. Measuring similarity between natural language sentences is a crucial task for many Natural Language Processing applications such as text classification, information retrieval, question answering, and plagiarism detection. This survey classifies approaches of calculating sentences similarity based on the adopted methodology into three categories. Word-to-word based, structure based, and vector-based are the most widely used approaches to find sentences similarity. Each approach measures relatedness between short texts based on a specific perspective. In addition, datasets that are mostly used as benchmarks for evaluating techniques in this field are introduced to provide a complete view on this issue. The approaches that combine more than one perspective give better results. Moreover, structure based similarity that measures similarity between sentences structures needs more investigation.

...read moreread less

26 citations

Proceedings Article•DOI•

Plagiarism detection for Java: a tool comparison

[...]

Jurriaan Hage¹, Peter Rademaker¹, Nikè van Vugt²•Institutions (2)

Utrecht University¹, Open University in the Netherlands²

07 Apr 2011

TL;DR: Five tools for detecting plagiarism in Java source code texts: JPlag, Marble, moss, Plaggie, and sim are compared with respect to their features and performance.

...read moreread less

Abstract: In this paper we compare five tools for detecting plagiarism in Java source code texts: JPlag, Marble, moss, Plaggie, and sim. The tools are compared with respect to their features and performance. For the performance comparison we carried out two experiments: to compare the sensitivity of the tools for different plagiarism techniques we have applied the tools to a set of intentionally plagiarised programs. To get a picture of the precision of the tools, we have run the tools on several incarnations of a student assignment and compared the top 10's of the results.

...read moreread less

26 citations

Journal Article•DOI•

PlagDetect: a Java programming plagiarism detection tool

[...]

Zuhoor Al-Khanjari¹, Jinan Fiaidhi², R. A. Al-Hinai¹, Narayana Swamy Kutti¹•Institutions (2)

Sultan Qaboos University¹, Lakehead University²

01 Dec 2010-ACM Inroads

TL;DR: The research in this context involves at first examining various metrics used in plagiarism detection in program codes and secondly selecting an appropriate statistical measure using attribute counting metrics (ATMs) for detecting plagiarism in Java programming assignments.

...read moreread less

Abstract: Practical computing courses that involve significant amount of programming assessment tasks suffer from e-Plagiarism. A pragmatic solution for this problem could be by discouraging plagiarism particularly among the beginners in programming. One way to address this is to automate the detection of plagiarized work during the marking phase. Our research in this context involves at first examining various metrics used in plagiarism detection in program codes and secondly selecting an appropriate statistical measure using attribute counting metrics (ATMs) for detecting plagiarism in Java programming assignments. The goal of this investigation is to study the effectiveness of ATMs for detecting plagiarism among assignment submissions of introductory programming courses.

...read moreread less

26 citations

Proceedings Article•

Overview of the AraPlagDet PAN@FIRE2015 shared task on Arabic plagiarism detection

[...]

Imene Bensalem, Imene Boukhalfa, Paolo Rosso¹, Lahsen Abouenour, Kareem Darwish², Salim Chikhi - Show less +2 more•Institutions (2)

Polytechnic University of Valencia¹, Qatar Computing Research Institute²

01 Jan 2015

TL;DR: An overview paper describes these evaluation corpora of plagiarism detection methods for Arabic texts, discusses the participants' methods, and highlights their building blocks that could be language dependent.

...read moreread less

Abstract: is the first shared task that addresses the evaluation of plagiarism detection methods for Arabic texts. It has two sub- tasks, namely external plagiarism detection and intrinsic plagiarism detection. A total of 8 runs have been submitted and tested on the standardized corpora developed for the track. This overview paper describes these evaluation corpora, discusses the participants' methods, and highlights their building blocks that could be language dependent.

...read moreread less

26 citations

Proceedings Article•DOI•

Experiments with Convolutional Neural Network Models for Answer Selection

[...]

Jinfeng Rao¹, Hua He¹, Jimmy Lin²•Institutions (2)

University of Maryland, College Park¹, University of Waterloo²

07 Aug 2017

TL;DR: This paper attempts to replicate and reproduce the results of Severyn and Moschitti using their open-source code as well as to reproduce their results via a de novo implementation using a completely different deep learning toolkit.

...read moreread less

Abstract: In recent years, neural networks have been applied to many text processing problems. One example is learning a similarity function between pairs of text, which has applications to paraphrase extraction, plagiarism detection, question answering, and ad hoc retrieval. Within the information retrieval community, the convolutional neural network model proposed by Severyn and Moschitti in a SIGIR 2015 paper has gained prominence. This paper focuses on the problem of answer selection for question answering: we attempt to replicate the results of Severyn and Moschitti using their open-source code as well as to reproduce their results via a de novo (i.e., from scratch) implementation using a completely different deep learning toolkit. Our de novo implementation is instructive in ascertaining whether reported results generalize across toolkits, each of which have their idiosyncrasies. We were able to successfully replicate and reproduce the reported results of Severyn and Moschitti, albeit with minor differences in effectiveness, but affirming the overall design of their model. Additional ablation experiments break down the components of the model to show their contributions to overall effectiveness. Interestingly, we find that removing one component actually increases effectiveness and that a simplified model with only four word overlap features performs surprisingly well, even better than convolution feature maps alone.

...read moreread less

26 citations

Collapse

Network Information

Performance

Metrics

1,976

Papers

29,005

Citations

No. of papers in the topic in previous years
Year	Papers
2023	59
2022	126
2021	83
2020	118
2019	130
2018	125

Plagiarism detection

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics