L
Lukasz Bolikowski
Researcher at University of Warsaw
Publications - 18
Citations - 346
Lukasz Bolikowski is an academic researcher from University of Warsaw. The author has contributed to research in topics: Metadata & Digital library. The author has an hindex of 8, co-authored 18 publications receiving 282 citations.
Papers
More filters
Journal ArticleDOI
CERMINE: automatic extraction of structured metadata from scientific literature
TL;DR: The overall workflow architecture of CERMINE is outlined, details about individual steps implementations are provided and the evaluation of the extraction workflow carried out with the use of a large dataset showed good performance for most metadata types.
Journal ArticleDOI
OpenAIREplus: the European Scholarly Communication Data Infrastructure
TL;DR: The highlevel architecture and functionalities of that infrastructure, including services designed to collect, interlink and provide access to peerreviewed and nonpeer reviewed publications, datasets, and projects of the European Commission and national funding schemes, are described.
Proceedings ArticleDOI
CERMINE -- Automatic Extraction of Metadata and References from Scientific Literature
TL;DR: The paper describes the overall workflow architecture of CERMINE, provides details about individual implementations and reports evaluation methodology and results.
Journal ArticleDOI
GROTOAP2 The Methodology of Creating a Large Ground Truth Dataset of Scientific Articles
TL;DR: A large dataset of ground truth files containing labelled fragments of scientific articles in PDF format, useful for training and evaluation of document content analysis-related solutions and was successfully used for training CERMINE, the authors' system for extracting metadata and content from scientific articles.
Proceedings ArticleDOI
A Modular Metadata Extraction System for Born-Digital Articles
TL;DR: A comprehensive system for extracting metadata from scholarly articles based on a modular workflow which allows for evaluation, unit testing and replacement of individual components, optimized towards processing of born-digital documents, but may accept scanned document images as well.