scispace - formally typeset
Search or ask a question
Topic

Plagiarism detection

About: Plagiarism detection is a research topic. Over the lifetime, 1790 publications have been published within this topic receiving 24740 citations.


Papers
More filters
Journal ArticleDOI
TL;DR: In this article, the authors examined errors in a corpus of 150 academic essays written by Turkish EFL students studying at the Department of English Language and Literature at a public university in Turkey.
Abstract: The present study aims to explore Turkish EFL students’ major writing difficulties by analyzing the frequent writing errors in academic essays. Accordingly, the study examined errors in a corpus of 150 academic essays written by Turkish EFL students studying at the Department of English Language and Literature at a public university in Turkey. The essays were written on assigned topics as take home exam papers or assignments in the context of a first year academic writing course. The corpus consisted of essays of various lengths ranging from 500 word essays to 1500 word essays. The essays were compiled into a corpus and analyzed by using a concordance program. The essays were also checked for plagiarism using the online plagiarism detection software and plagiarized essays were excluded from the analysis. Errors were classified by using an error classification system which was organized according to lexico-grammatical categories. The resulting categories consisted of mostly syntactic and lexical categories of error but academic style errors were considered as well. As a result of the analysis, in terms of error categories, the most frequent errors were observed in the verb related error categories. When considered individually, the most frequent errors were observed in noun modification and were mostly interference related.

8 citations

Posted Content
TL;DR: This paper investigated cross-language plagiarism detection methods for 6 language pairs on 2 granularities of text units in order to draw robust conclusions on the best methods while deeply analyzing correlations across document styles and languages.
Abstract: This paper is a deep investigation of cross-language plagiarism detection methods on a new recently introduced open dataset, which contains parallel and comparable collections of documents with multiple characteristics (different genres, languages and sizes of texts). We investigate cross-language plagiarism detection methods for 6 language pairs on 2 granularities of text units in order to draw robust conclusions on the best methods while deeply analyzing correlations across document styles and languages.

8 citations

Proceedings ArticleDOI
01 Dec 2010
TL;DR: A new method is proposed by combining structure metric with semantic computing techniques that is capable of identifying not only the primary cheating means in code copy, but also the senior ones, such as replacing control structures with equivalent structures.
Abstract: Source code documents are vulnerable to being plagiarized As the central component of Code Plagiarism Detection (CPD), Code Similarity Detection (CSD) attracts more and more attention In this paper, we proposed a new method for CSD by combining structure metric with semantic computing techniques It is capable of identifying not only the primary cheating means in code copy, but also the senior ones, such as replacing control structures with equivalent structures We describe the design and implementation of the method, and make some comparative experiments against MOSS and the structure only method Experiments show that the method proposed in this paper can obtain more effective similar values of code-pairs

8 citations

Proceedings ArticleDOI
23 May 2013
TL;DR: To detect plagiarism on programming course, the plagiarism detection method based on Abstract Syntax Tree (AST) is proposed and can also find the “copy cluster” accurately.
Abstract: To detect plagiarism on programming course, the plagiarism detection method based on Abstract Syntax Tree (AST) is proposed. First, we parse source codes into the corresponding AST by syntax analyzer and a biology sequence matching algorithm is used to calculate the similarities of programs. Second, the AST features of similar parts of the programs are extracted and then space vectors of the features are obtained. Finally, we find “copy cluster” by clustering the vectors. Experimental results show that this method has a good effect on the detection of plagiarism and can also find the “copy cluster” accurately.

8 citations

Journal ArticleDOI
TL;DR: This research implements an Online Detection Plagiarism System (ODPS) providing a web-based user interface and a combined approach is proven that it is better than a single approach for source codes of various styles.

8 citations


Network Information
Related Topics (5)
Active learning
42.3K papers, 1.1M citations
78% related
The Internet
213.2K papers, 3.8M citations
77% related
Software development
73.8K papers, 1.4M citations
77% related
Graph (abstract data type)
69.9K papers, 1.2M citations
76% related
Deep learning
79.8K papers, 2.1M citations
76% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
202359
2022126
202183
2020118
2019130
2018125