Home
/
Topics
/
Plagiarism detection

Topic

Plagiarism detection

About: Plagiarism detection is a research topic. Over the lifetime, 1790 publications have been published within this topic receiving 24740 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1998
1997
1996
1994
1990
1989
1988
1987
1985
1981

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Practical issues for academics using the Turnitin plagiarism detection software

[...]

Karl O. Jones¹•Institutions (1)

Liverpool John Moores University¹

12 Jun 2008

TL;DR: Some issues that might be raised in employing Turnitin are highlighted and some approaches that academics might utilise to allow efficient use of the system are suggested.

...read moreread less

Abstract: The Turnitin plagiarism detection system allows individual student assignments to be uploaded and matched for similarity with content on the web, all other assignments uploaded by institutions using the system and certain journals. An online report is produced for each submission identifying the sources of those similarities and the percentage match. There is a significant benefit in using Turnitin to identify possible cases of plagiarism. This paper highlights some issues that might be raised in employing Turnitin and suggests some approaches that academics might utilise to allow efficient use of Turnitin.

...read moreread less

29 citations

Posted Content•

Plagiarism: Taxonomy, Tools and Detection Techniques.

[...]

Hussain Ahmed Chowdhury, Dhruba K. Bhattacharyya

19 Jan 2018-arXiv: Information Retrieval

TL;DR: A taxonomy of various plagiarism forms is presented and include discussion on each of these forms to highlight a list of issues and research challenges related to this evolving research problem.

...read moreread less

Abstract: To detect plagiarism of any form, it is essential to have broad knowledge of its possible forms and classes, and existence of various tools and systems for its detection. Based on impact or severity of damages, plagiarism may occur in an article or in any production in a number of ways. This survey presents a taxonomy of various plagiarism forms and include discussion on each of these forms. Over the years, a good number tools and techniques have been introduced to detect plagiarism. This paper highlights few promising methods for plagiarism detection based on machine learning techniques. We analyse the pros and cons of these methods and finally we highlight a list of issues and research challenges related to this evolving research problem.

...read moreread less

29 citations

Proceedings Article•DOI•

Analyzing Mathematical Content to Detect Academic Plagiarism

[...]

Norman Meuschke¹, Moritz Schubotz¹, Felix Hamborg¹, Tomáš Skopal², Bela Gipp¹ - Show less +1 more•Institutions (2)

University of Konstanz¹, Charles University in Prague²

06 Nov 2017

TL;DR: The results show that mathematical expressions are promising text-independent features to identify academic plagiarism in large collections and an open source parallel data processing pipeline built using the Apache Flink framework is developed.

...read moreread less

Abstract: This paper presents, to our knowledge, the first study on analyzing mathematical expressions to detect academic plagiarism. We make the following contributions. First, we investigate confirmed cases of plagiarism to categorize the similarities of mathematical content commonly found in plagiarized publications. From this investigation, we derive possible feature selection and feature comparison strategies for developing math-based detection approaches and a ground truth for our experiments. Second, we create a test collection by embedding confirmed cases of plagiarism into the NTCIR-11 MathIR Task dataset, which contains approx. 60 million mathematical expressions in 105,120 documents from arXiv.org. Third, we develop a first math-based detection approach by implementing and evaluating different feature comparison approaches using an open source parallel data processing pipeline built using the Apache Flink framework. The best performing approach identifies all but two of our real-world test cases at the top rank and achieves a mean reciprocal rank of 0.86. The results show that mathematical expressions are promising text-independent features to identify academic plagiarism in large collections. To facilitate future research on math-based plagiarism detection, we make our source code and data available.

...read moreread less

28 citations

Journal Article•DOI•

Uncovering source code reuse in large-scale academic environments

[...]

Enrique Flores¹, Alberto Barrón-Cedeño², Lidia Moreno¹, Paolo Rosso¹•Institutions (2)

Polytechnic University of Valencia¹, Polytechnic University of Catalonia²

01 May 2015-Computer Applications in Engineering Education

TL;DR: The purpose of this research is to uncover potential cases of source code reuse in large‐scale environments by using an automatic system based on the comparison of programs at character level to find similarities among multiple sets of source codes.

...read moreread less

Abstract: The advent of the Internet has caused an increase in content reuse, including source code. The purpose of this research is to uncover potential cases of source code reuse in large-scale environments. A good example is academia, where massive courses are taught to students who must demonstrate that they have acquired the knowledge. The need of detecting content reuse in quasi real-time encourages the development of automatic systems such as the one described in this paper for source code reuse detection. Our approach is based on the comparison of programs at character level. It is able to find potential cases of reuse across a huge number of assignments. It achieved better results than JPlag, the most used online system to find similarities among multiple sets of source codes. The most common obfuscation operations we found were changes in identifier names, comments and indentation. © 2014 Wiley Periodicals, Inc. Comput Appl Eng Educ 23:383–390, 2015; View this article online at wileyonlinelibrary.com/journal/cae; DOI 10.1002/cae.21608

...read moreread less

28 citations

Journal Article•DOI•

[...]

Ming Liu¹, Bo Lang¹, Zepeng Gu¹, Ahmed Zeeshan¹•Institutions (1)

Beihang University¹

21 Dec 2017-Tsinghua Science & Technology

TL;DR: In this article, a joint word-embedding model for long documents in the academic domain is proposed to improve the semantic representation quality of word vectors by incorporating a domain-specific semantic relation constraint into the traditional context constraint.

...read moreread less

28 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
…
45
46
47
48
49
50
51
…
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

1,976

Papers

29,005

Citations

No. of papers in the topic in previous years
Year	Papers
2023	59
2022	126
2021	83
2020	118
2019	130
2018	125

Plagiarism detection

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics