scispace - formally typeset
Search or ask a question
Institution

Ontotext

CompanySofia, Bulgaria
About: Ontotext is a company organization based out in Sofia, Bulgaria. It is known for research contribution in the topics: Semantic Web & Ontology (information science). The organization has 54 authors who have published 97 publications receiving 3701 citations. The organization is also known as: Sirma AI.


Papers
More filters
Journal ArticleDOI
Atanas Kiryakov1, Borislav Popov1, Ivan Terziev1, Dimitar Manov1, Damyan Ognyanoff1 
TL;DR: This paper presents a semantically enhanced information extraction system, which provides automatic semantic annotation with references to classes in the ontology and to instances and argues that such large-scale, fully automatic methods are essential for the transformation of the current largely textual web into a Semantic Web.

651 citations

Book ChapterDOI
20 Oct 2003
TL;DR: A simplistic upper-level ontology is introduced which starts with some basic philosophic distinctions and goes down to the most popular entity types, thus providing many of the inter-domain common sense concepts and allowing easy domain-specific extensions.
Abstract: The Semantic Web realization depends on the availability of critical mass of metadata for the web content, linked to formal knowledge about the world. This paper presents our vision about a holistic system allowing annotation, indexing, and retrieval of documents with respect to real-world entities. A system (called KIM), partially implementing this concept is shortly presented and used for evaluation and demonstration. Our understanding is that a system for semantic annotation should be based upon specific knowledge about the world, rather than indifferent to any ontological commitments and general knowledge. To assure efficiency and reusability of the metadata we introduce a simplistic upper-level ontology which starts with some basic philosophic distinctions and goes down to the most popular entity types (people, companies, cities, etc.), thus providing many of the inter-domain common sense concepts and allowing easy domain-specific extensions. Based on the ontology, an extensive knowledge base of entities descriptions is maintained. Semantically enhanced information extraction system providing automatic annotation with references to classes in the ontology and instances in the knowledge base is presented. Based on these annotations, we perform IR-like indexing and retrieval, further extended using the ontology and knowledge about the specific entities.

366 citations

Book ChapterDOI
20 Nov 2005
TL;DR: The experiment demonstrates that OWLIM can scale to millions of statements even on commodity desktop hardware, and shows that such reasoners can be efficient for very big knowledge bases, in scenarios when delete operations should not be handled in real-time.
Abstract: OWLIM is a high-performance Storage and Inference Layer (SAIL) for Sesame, which performs OWL DLP reasoning, based on forward-chaining of entilement rules. The reasoning and query evaluation are performed in-memory, while in the same time OWLIM provides a reliable persistence, based on N-Triples files. This paper presents OWLIM, together with an evaluation of its scalability over synthetic, but realistic, dataset encoded with respect to PROTON ontology. The experiment demonstrates that OWLIM can scale to millions of statements even on commodity desktop hardware. On an almost-entry-level server, OWLIM can manage a knowledge base of 10 million explicit statements, which are extended to about 19 millions after forward chaining. The upload and storage speed is about 3,000 statement/sec. at the maximal size of the repository, but it starts at more than 18,000 (for a small repository) and slows down smoothly. As it can be expected for such an inference strategy, delete operations are expensive, taking as much as few minutes. In the same time, a variety of queries can be evaluated within milliseconds. The experiment shows that such reasoners can be efficient for very big knowledge bases, in scenarios when delete operations should not be handled in real-time.

308 citations

Journal ArticleDOI
Borislav Popov1, Atanas Kiryakov1, Damyan Ognyanoff1, Dimitar Manov1, Angel Kirilov1 
TL;DR: The KIM platform as mentioned in this paper provides a knowledge and information management framework and services for automatic semantic annotation, indexing, and retrieval of documents, based on a simple model of real-world entity concepts and quasi-exhaustive instance knowledge.
Abstract: The KIM platform provides a novel Knowledge and Information Management framework and services for automatic semantic annotation, indexing, and retrieval of documents. It provides a mature and semantically enabled infrastructure for scalable and customizable information extraction (IE) as Our understanding is that a system for semantic annotation should be based upon a simple model of real-world entity concepts, complemented with quasi-exhaustive instance knowledge. To ensure efficiency, easy sharing, and reusability of the metadata we introduce an upper-level ontology. Based on the ontology, a large-scale instance base of entity descriptions is maintained. The knowledge resources involved are handled by use of state-of-the-art Semantic Web technology and standards, including RDF(S) repositories, ontology middleware and reasoning. From a technical point of view, the platform allows KIM-based applications to use it for automatic semantic annotation, for content retrieval based on semantic queries, and for semantic repository access. As a framework, KIM also allows various IE modules, semantic repositories and information retrieval engines to be plugged into it. This paper presents the KIM platform, with an emphasis on its architecture, interfaces, front-ends, and other technical issues.

297 citations

Book ChapterDOI
20 Oct 2003
TL;DR: The KIM platform allows KIM-based applications to use it for automatic semantic annotation, content retrieval based on semantic restrictions, and querying and modifying the underlying ontologies and knowledge bases.
Abstract: The KIM platform provides a novel Knowledge and Information Management infrastructure and services for automatic semantic annotation, indexing, and retrieval of documents. It provides mature infrastructure for scaleable and customizable information extraction (IE) as well as annotation and document management, based on GATE. In order to provide basic level of performance and allow easy bootstrapping of applications, KIM is equipped with an upper-level ontology and a knowledge base providing extensive coverage of entities of general importance. The ontologies and knowledge bases involved are handled using cutting edge Semantic Web technology and standards, including RDF(S) repositories, ontology middleware and reasoning. From technical point of view, the platform allows KIM-based applications to use it for automatic semantic annotation, content retrieval based on semantic restrictions, and querying and modifying the underlying ontologies and knowledge bases. This paper presents the KIM platform, with emphasize on its architecture, interfaces, tools, and other technical issues.

291 citations


Authors

Showing all 54 results

Network Information
Related Institutions (5)
Centrum Wiskunde & Informatica
4.8K papers, 128.2K citations

84% related

fondazione bruno kessler
4.7K papers, 94.4K citations

83% related

National Institute of Informatics
7.4K papers, 129.3K citations

81% related

Information Sciences Institute
2K papers, 114.6K citations

81% related

Performance
Metrics
No. of papers from the Institution in previous years
YearPapers
20213
20203
20194
20184
20172
20166