Home
/
Topics
/
Plagiarism detection

Topic

Plagiarism detection

About: Plagiarism detection is a research topic. Over the lifetime, 1790 publications have been published within this topic receiving 24740 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1998
1997
1996
1994
1990
1989
1988
1987
1985
1981

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

[...]

Kyle Williams¹, Jian Wu¹, C. Lee Giles¹•Institutions (1)

Pennsylvania State University¹

16 Sep 2014

TL;DR: SimSeerX is introduced, a search engine for similar document retrieval that receives whole documents as queries and returns a ranked list of similar documents.

...read moreread less

Abstract: The need to find similar documents occurs in many settings, such as in plagiarism detection or research paper recommendation. Manually constructing queries to find similar documents may be overly complex, thus motivating the use of whole documents as queries. This paper introduces SimSeerX, a search engine for similar document retrieval that receives whole documents as queries and returns a ranked list of similar documents. Key to the design of SimSeerX is that is able to work with multiple similarity functions and document collections. We present the architecture and interface of SimSeerX, show its applicability with 3 different similarity functions and demonstrate its scalability on a collection of 3.5 million academic documents.

...read moreread less

10 citations

Book Chapter•DOI•

Plagiarism Detection in Texts Obfuscated with Homoglyphs

[...]

Faisal Alvi¹, Faisal Alvi², Mark Stevenson¹, Paul Clough¹•Institutions (2)

University of Sheffield¹, King Fahd University of Petroleum and Minerals²

08 Apr 2017

TL;DR: Two alternative approaches for detecting plagiarism in homoglyph obfuscated texts are presented: the first approach utilizes the Unicode list of confusables to replaceHomoglyphs with visually identical letters, while the second approach uses a similarity score computed using normalized hamming distance to match homoglyPH obfuscated words with source words.

...read moreread less

Abstract: Homoglyphs can be used for disguising plagiarized text by replacing letters in source texts with visually identical letters from other scripts. Most current plagiarism detection systems are not able to detect plagiarism when text has been obfuscated using homoglyphs. In this work, we present two alternative approaches for detecting plagiarism in homoglyph obfuscated texts. The first approach utilizes the Unicode list of confusables to replace homoglyphs with visually identical letters, while the second approach uses a similarity score computed using normalized hamming distance to match homoglyph obfuscated words with source words. Empirical testing on datasets from PAN-2015 shows that both approaches perform equally well for plagiarism detection in homoglyph obfuscated texts.

...read moreread less

10 citations

Pen-Based Electronic Grading of Online Student Submissions

[...]

Jeffrey L. Popyack, Nira Herrmann, Bruce W. Char, Paul Zoski, Chris Cera, Robert N. Lass¹ - Show less +2 more•Institutions (1)

Drexel University¹

01 Jan 2003

TL;DR: Electronic submission of student assignments certainly provides many advantages for the faculty member and graders, and paperless transactions are especially useful when the number of submissions is large and the assignments must be distributed to multiple locations.

...read moreread less

Abstract: And so it is with grading assignments that have been submitted electronically. Electronic submission of student assignments certainly provides many advantages for the faculty member and graders. For instance, electronic submissions are easier to manage and keep track of than their paper counterparts, particularly as the number of submissions gets large. Submissions can be time-stamped automatically and archived, thus minimizing the potential for disputes over lateness and lost assignments and/or grades. Furthermore, archives can help resolve issues involving academic dishonesty and/or plagiarism. Finally, paperless transactions are especially useful when the number of submissions is large and the assignments must be distributed to multiple locations (such as to teaching assistants, graders, and plagiarism detection software).

...read moreread less

10 citations

Proceedings Article•DOI•

Deep Investigation of Cross-Language Plagiarism Detection Methods.

[...]

Jérémy Ferrero, Laurent Besacier¹, Didier Schwab¹, Frédéric Agnès•Institutions (1)

University of Grenoble¹

03 Aug 2017

TL;DR: This paper investigates cross-language plagiarism detection methods for 6 language pairs on 2 granularities of text units in order to draw robust conclusions on the best methods while deeply analyzing correlations across document styles and languages.

...read moreread less

Abstract: This paper is a deep investigation of cross-language plagiarism detection methods on a new recently introduced open dataset, which contains parallel and comparable collections of documents with multiple characteristics (different genres, languages and sizes of texts). We investigate cross-language plagiarism detection methods for 6 language pairs on 2 granularities of text units in order to draw robust conclusions on the best methods while deeply analyzing correlations across document styles and languages.

...read moreread less

10 citations

Book Chapter•DOI•

AST-based plagiarism detection method

[...]

Liping Zhang¹, Dongsheng Liu¹, Yanchen Li¹, Mei Zhong¹•Institutions (1)

Inner Mongolia Normal University¹

01 Jan 2012-Computer Engineering and Design

TL;DR: A code plagiarism detection based on the AST is studied that pre-formats code, analysis lexical and syntax and obtains the corresponding AST and calculates the similarity of the code sequence and gets the code plagiarisms detection report.

...read moreread less

Abstract: In this paper, a code plagiarism detection based on the AST is studied. It pre-formats code, analysis lexical and syntax and obtains the corresponding AST. Then it traverses AST to generate code sequences, calculates the similarity of the code sequence and gets the code plagiarism detection report. Test results verify the effectiveness of the method.

...read moreread less

9 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
…
125
126
127
128
129
130
131
…
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

1,976

Papers

29,005

Citations

No. of papers in the topic in previous years
Year	Papers
2023	59
2022	126
2021	83
2020	118
2019	130
2018	125

Plagiarism detection

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics