Home
/
Topics
/
Plagiarism detection

Topic

Plagiarism detection

About: Plagiarism detection is a research topic. Over the lifetime, 1790 publications have been published within this topic receiving 24740 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1998
1997
1996
1994
1990
1989
1988
1987
1985
1981

Papers

PDF

Open Access

More filters

Journal Article•DOI•

DOCODE 3.0 (DOcument COpy DEtector)

[...]

Juan D. Velásquez¹, Yerko Covacevich¹, Francisco Molina¹, Edison Marrese-Taylor¹, Cristian Rodriguez¹, Felipe Bravo-Marquez² - Show less +2 more•Institutions (2)

University of Chile¹, University of Waikato²

01 Jan 2016-Information Fusion

TL;DR: DOCODE 3.0 is presented, a Web system for educational institutions that performs automatic analysis of large quantities of digital documents in relation to their degree of originality, and produces a number of visualizations and reports to let teachers and professors gain insights on the originality of the documents they review.

...read moreread less

31 citations

Journal Article•DOI•

The Mind of a Plagiarist

[...]

Jon Baggaley¹, Bob Spencer¹•Institutions (1)

Athabasca University¹

01 Mar 2005-Learning, Media and Technology

TL;DR: In this article, a case of serial plagiarism in the work of a graduate student in an online distance education program is discussed, and the complexity of the student's thinking and the manner in which the case was handled by the teacher and the university.

...read moreread less

Abstract: The ease with which material may be ‘copied and pasted’ from the Internet into written work is raising concern in educational institutions, and particularly in those disciplines that use online sources and methods in their curriculum. A case of ‘serial plagiarism’ is discussed, in the work of a graduate student in an online distance education program. The complexity of the student’s thinking is emphasized, and the manner in which the case was handled by the teacher and the university. The use of an online plagiarism‐checking technology (Turnitin.com) and the value of such services are discussed. The case illustrates the importance of explaining the precise nature of plagiarism to students, of providing clear warnings about its consequences and of developing a careful institutional approach to plagiarism detection and prevention.

...read moreread less

31 citations

Proceedings Article•DOI•

Similarity patterns in language

[...]

Jonathan Helfman¹•Institutions (1)

Bell Labs¹

04 Oct 1994

TL;DR: Dotplot is a technique for visualizing patterns of string matches in millions of lines of text and code that identify subtler relationships in text analysis, software engineering, and information retrieval.

...read moreread less

Abstract: Dotplot is a technique for visualizing patterns of string matches in millions of lines of text and code. Patterns may be explored interactively or detected automatically. Applications include text analysis (author identification, plagiarism detection, translation alignment, etc.), software engineering (module and version identification, subroutine categorization, redundant code identification, etc.), and information retrieval (identification of similar records in results of queries). Patterns are interpreted though a visual language. Squares identify unordered matches (documents with lots of matching words or subroutines with lots of matching symbols), while diagonals identify ordered matches (copies, versions, and translations). Patterns of squares and diagonals have more complex interpretations that identify subtler relationships. >

...read moreread less

31 citations

Proceedings Article•DOI•

Plagiarism detection in text using Vector Space Model

[...]

Asif Ekbal¹, Sriparna Saha¹, Gaurav Choudhary¹•Institutions (1)

Indian Institute of Technology Patna¹

01 Dec 2012

TL;DR: This paper proposes a technique based on textual similarity for external plagiarism detection that uses an approach based on the traditional Vector Space Model (VSM) for this candidate selection.

...read moreread less

Abstract: Plagiarism denotes the act of copying someone else's idea (or, works) and claiming it as his/her own. Plagiarism detection is the procedure to detect the texts of a given document which are plagiarized, i.e. copied from from some other documents. Potential challenges are due to the facts that plagiarists often obfuscate the copied texts; might shuffle, remove, insert, or replace words or short phrases; might also restructure the sentences replacing words with synonyms; and changing the order of appearances of words in a sentence. In this paper we propose a technique based on textual similarity for external plagiarism detection. For a given suspicious document we have to identify the set of source documents from which the suspicious document is copied. The method we propose comprises of four phases. In the first phase, we process all the documents to generate tokens, lemmas, finding Part-of-Speech (PoS) classes, character-offsets, sentence numbers and named-entity (NE) classes. In the second phase we select a subset of documents that may possibly be the sources of plagiarism. We use an approach based on the traditional Vector Space Model (VSM) for this candidate selection. In the third phase we use a graph-based approach to find out the similar passages in suspicious document and selected source documents. Finally we filter out the false detections1.

...read moreread less

31 citations

Journal Article•DOI•

Efficient clustering-based source code plagiarism detection using PIY

[...]

Tony Ohmann¹, Imad Rahal²•Institutions (2)

University of Massachusetts Amherst¹, College of Saint Benedict²

01 May 2015-Knowledge and Information Systems

TL;DR: This work presents an approach called program it yourself (PIY) which is empirically shown to outperform MOSS in detection accuracy, and is also capable of maintaining detection accuracy and reasonable runtimes even when using extremely large data repositories.

...read moreread less

Abstract: Vast amounts of information available online make plagiarism increasingly easy to commit, and this is particularly true of source code. The traditional approach of detecting copied work in a course setting is manual inspection. This is not only tedious but also typically misses code plagiarized from outside sources or even from an earlier offering of the course. Systems to automatically detect source code plagiarism exist but tend to focus on small submission sets. One such system that has become the standard in automated source code plagiarism detection is measure of software similarity (MOSS) Schleimer et al. in proceedings of the 2003 ACM SIGMOD international conference on management of data, ACM, San Diego, 2003. In this work, we present an approach called program it yourself (PIY) which is empirically shown to outperform MOSS in detection accuracy. By utilizing parallel processing and data clustering, PIY is also capable of maintaining detection accuracy and reasonable runtimes even when using extremely large data repositories.

...read moreread less

31 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
…
41
42
43
44
45
46
47
…
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

1,976

Papers

29,005

Citations

No. of papers in the topic in previous years
Year	Papers
2023	59
2022	126
2021	83
2020	118
2019	130
2018	125

Plagiarism detection

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics