Home
/
Topics
/
Plagiarism detection

Topic

Plagiarism detection

About: Plagiarism detection is a research topic. Over the lifetime, 1790 publications have been published within this topic receiving 24740 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1998
1997
1996
1994
1990
1989
1988
1987
1985
1981

Papers

PDF

Open Access

More filters

Proceedings Article•

Detection of Simple Plagiarism in Computer Science Papers

[...]

Yaakov HaCohen-Kerner¹, Aharon Tayeb¹, Natan Ben-Dror¹•Institutions (1)

Jerusalem College of Technology¹

23 Aug 2010

TL;DR: This research developed software capable of simple plagiarism detection that has built a corpus containing 10,100 academic papers in computer science written in English and two test sets including papers that were randomly chosen from C.

...read moreread less

Abstract: Plagiarism is the use of the language and thoughts of another work and the representation of them as one's own original work. Various levels of plagiarism exist in many domains in general and in academic papers in particular. Therefore, diverse efforts are taken to automatically identify plagiarism. In this research, we developed software capable of simple plagiarism detection. We have built a corpus (C) containing 10,100 academic papers in computer science written in English and two test sets including papers that were randomly chosen from C. A widespread variety of baseline methods has been developed to identify identical or similar papers. Several methods are novel. The experimental results and their analysis show interesting findings. Some of the novel methods are among the best predictive methods.

...read moreread less

24 citations

Journal Article•DOI•

Source code author identification with unsupervised feature learning

[...]

Upul Bandara¹, Gamini Wijayarathna¹•Institutions (1)

University of Kelaniya¹

01 Feb 2013-Pattern Recognition Letters

TL;DR: This paper investigates an unsupervised feature learning technique called sparse auto-encoder as a method of extracting features from source code files and shows that performance is very close to the state of art techniques in the source code identification field.

...read moreread less

24 citations

Dissertation•

A study on plagiarism detection and plagiarism direction identification using natural language processing techniques

[...]

Man Yan Miranda Chong

01 Jan 2013

TL;DR: Man Yan Miranda Chong A thesis submitted in partial fulfilment of the requirements of the University of Wolverhampton for the degree of Doctor of Philosophy in 2013.

...read moreread less

Abstract: Man Yan Miranda Chong A thesis submitted in partial fulfilment of the requirements of the University of Wolverhampton for the degree of Doctor of Philosophy 2013

...read moreread less

24 citations

Journal Article•DOI•

To cheat or not to cheat? A trial of the JISC Plagiarism Detection Service with biological sciences students

[...]

Joanne Louise Badge¹, Alan J. Cann¹, Jon Scott¹•Institutions (1)

University of Leicester¹

19 Jun 2007-Assessment & Evaluation in Higher Education

TL;DR: In this article, the authors present the results of a 2-year trial of the JISC plagiarism detection service (PDS) involving hundreds of students and discuss the effectiveness of the service in detecting plagiarized material and in acting as a deterrent.

...read moreread less

Abstract: In the UK, there is great concern about the perceived increase in plagiarized work being submitted by students in higher educations. Although there is much debate, the reasons for the perceived change are not completely clear. Here we present the results of a 2‐year trial of the JISC Plagiarism Detection Service (PDS) involving hundreds of students. The effectiveness of the service in detecting plagiarized material and in acting as a deterrent are discussed. Although an increased number of cases of plagiarism were detected during the trial, the relative contributions of the electronic detection system and increased staff awareness remain unknown.

...read moreread less

24 citations

Journal Article•DOI•

[...]

Richard S. Forsyth¹, Serge Sharoff¹•Institutions (1)

University of Leeds¹

01 Apr 2014-Literary and Linguistic Computing

TL;DR: A multi-register corpus gathered for this purpose is introduced, in which each text has been located in a similarity space based on ratings by human readers, which provides a resource for testing similarity measures derived from computational text-processing against reference levels derived from human judgement.

...read moreread less

Abstract: Quantifying the similarity or dissimilarity between documents is an important task in authorship attribution, information retrieval, plagiarism detection, text mining, and many other areas of linguistic computing. Numerous similarity indices have been devised and used, but relatively little attention has been paid to calibrating such indices against externally imposed standards, mainly because of the difficulty of establishing agreed reference levels of inter-text similarity. The present article introduces a multi-register corpus gathered for this purpose, in which each text has been located in a similarity space based on ratings by human readers. This provides a resource for testing similarity measures derived from computational text-processing against reference levels derived from human judgement, i.e. external to the texts themselves. We describe the results of a benchmarking study in five different languages in which some widely used measures perform comparatively poorly. In particular, several alternative correlational measures (Pearson r, Spearman rho, tetrachoric correlation) consistently outperform cosine similarity on our data. A method of using what we call ‘anchor texts’ to extend this method from monolingual inter-text similarity-scoring to inter-text similarity-scoring across languages is also proposed and tested.

...read moreread less

23 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
…
57
58
59
60
61
62
63
…
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

1,976

Papers

29,005

Citations

No. of papers in the topic in previous years
Year	Papers
2023	59
2022	126
2021	83
2020	118
2019	130
2018	125

Plagiarism detection

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics