scispace - formally typeset
Search or ask a question
Institution

University of the Aegean

EducationMytilene, Greece
About: University of the Aegean is a education organization based out in Mytilene, Greece. It is known for research contribution in the topics: Population & Context (language use). The organization has 2818 authors who have published 8100 publications receiving 179275 citations. The organization is also known as: UAEG.


Papers
More filters
01 Jan 2009
TL;DR: A new method is presented that attempts to quantify the style variation within a document using character n-gram profiles and a style change function based on an appropriate dissimilarity measure originally proposed for author identification.
Abstract: The task of intrinsic plagiarism detection deals with cases where no reference corpus is available and it is exclusively based on stylistic changes or inconsistencies within a given document. In this paper a new method is presented that attempts to quantify the style variation within a document using character n-gram profiles and a style change function based on an appropriate dissimilarity measure originally proposed for author identification. In addition, we propose a set of heuristic rules that attempt to detect plagiarism-free documents and plagiarized passages, as well as to reduce the effect of irrelevant style changes within a document. The proposed approach is evaluated on the recently-available corpus of the 1 st Int.

163 citations

Journal ArticleDOI
Lawrence N. Hudson1, Tim Newbold2, Tim Newbold3, Sara Contu1  +570 moreInstitutions (291)
TL;DR: The PREDICTS project as discussed by the authors provides a large, reasonably representative database of comparable samples of biodiversity from multiple sites that differ in the nature or intensity of human impacts relating to land use.
Abstract: The PREDICTS project—Projecting Responses of Ecological Diversity In Changing Terrestrial Systems (www.predicts.org.uk)—has collated from published studies a large, reasonably representative database of comparable samples of biodiversity from multiple sites that differ in the nature or intensity of human impacts relating to land use. We have used this evidence base to develop global and regional statistical models of how local biodiversity responds to these measures. We describe and make freely available this 2016 release of the database, containing more than 3.2 million records sampled at over 26,000 locations and representing over 47,000 species. We outline how the database can help in answering a range of questions in ecology and conservation biology. To our knowledge, this is the largest and most geographically and taxonomically representative database of spatial comparisons of biodiversity that has been collated to date; it will be useful to researchers and international efforts wishing to model and understand the global status of biodiversity.

162 citations

Journal ArticleDOI
TL;DR: The RUBICODE project as discussed by the authors draws on expertise from a range of disciplines to develop and integrate frameworks for assessing the impacts of environmental change on ecosystem service provision, and for rationalising biodiversity conservation in that light.
Abstract: The RUBICODE project draws on expertise from a range of disciplines to develop and integrate frameworks for assessing the impacts of environmental change on ecosystem service provision, and for rationalising biodiversity conservation in that light. With such diverse expertise and concepts involved, interested parties will not be familiar with all the key terminology. This paper defines the terms as used within the project and, where useful, discusses some reasoning behind the definitions. Terms are grouped by concept rather than being listed alphabetically.

161 citations

Journal ArticleDOI
TL;DR: A theoretical framework based on the theory of contextualism is proposed and applied in the analysis of the processes of formulating, implementing and adopting a security policy in two different organisations.

161 citations

Journal ArticleDOI
TL;DR: This paper presents methods to handle imbalanced multi-class textual datasets based on two text corpora of two languages, namely, newswire stories in English and newspaper reportage in Arabic and explores text sampling methods in order to construct a training set according to a desirable distribution over the classes.
Abstract: Authorship analysis of electronic texts assists digital forensics and anti-terror investigation. Author identification can be seen as a single-label multi-class text categorization problem. Very often, there are extremely few training texts at least for some of the candidate authors or there is a significant variation in the text-length among the available training texts of the candidate authors. Moreover, in this task usually there is no similarity between the distribution of training and test texts over the classes, that is, a basic assumption of inductive learning does not apply. In this paper, we present methods to handle imbalanced multi-class textual datasets. The main idea is to segment the training texts into text samples according to the size of the class, thus producing a fairer classification model. Hence, minority classes can be segmented into many short samples and majority classes into less and longer samples. We explore text sampling methods in order to construct a training set according to a desirable distribution over the classes. Essentially, by text sampling we provide new synthetic data that artificially increase the training size of a class. Based on two text corpora of two languages, namely, newswire stories in English and newspaper reportage in Arabic, we present a series of authorship identification experiments on various multi-class imbalanced cases that reveal the properties of the presented methods.

161 citations


Authors

Showing all 2889 results

Network Information
Related Institutions (5)
Polytechnic University of Catalonia
45.3K papers, 949.3K citations

89% related

Aristotle University of Thessaloniki
58.2K papers, 1.4M citations

88% related

Lancaster University
44.5K papers, 1.6M citations

88% related

City University of Hong Kong
60.1K papers, 1.7M citations

87% related

Aalto University
32.6K papers, 829.6K citations

87% related

Performance
Metrics
No. of papers from the Institution in previous years
YearPapers
202345
202292
2021479
2020493
2019543
2018447