scispace - formally typeset
Search or ask a question
Institution

Turku Centre for Computer Science

FacilityTurku, Finland
About: Turku Centre for Computer Science is a facility organization based out in Turku, Finland. It is known for research contribution in the topics: Decidability & Word (group theory). The organization has 382 authors who have published 1027 publications receiving 19560 citations.


Papers
More filters
Journal ArticleDOI
TL;DR: A corpus targeted at protein, gene, and RNA relationships which serves as a resource for the development of information extraction systems and their components such as parsers and domain analyzers is introduced.
Abstract: Lately, there has been a great interest in the application of information extraction methods to the biomedical domain, in particular, to the extraction of relationships of genes, proteins, and RNA from scientific publications. The development and evaluation of such methods requires annotated domain corpora. We present BioInfer (Bio Information Extraction Resource), a new public resource providing an annotated corpus of biomedical English. We describe an annotation scheme capturing named entities and their relationships along with a dependency analysis of sentence syntax. We further present ontologies defining the types of entities and relationships annotated in the corpus. Currently, the corpus contains 1100 sentences from abstracts of biomedical research articles annotated for relationships, named entities, as well as syntactic dependencies. Supporting software is provided with the corpus. The corpus is unique in the domain in combining these annotation types for a single set of sentences, and in the level of detail of the relationship annotation. We introduce a corpus targeted at protein, gene, and RNA relationships which serves as a resource for the development of information extraction systems and their components such as parsers and domain analyzers. The corpus will be maintained and further developed with a current version being available at http://www.it.utu.fi/BioInfer .

479 citations

Proceedings Article
01 May 2014
TL;DR: This work proposes a two-layered taxonomy: a set of broadly attested universal grammatical relations, to which language-specific relations can be added, and a lexicalist stance of the Stanford Dependencies, which leads to a particular, partially new treatment of compounding, prepositions, and morphology.
Abstract: Revisiting the now de facto standard Stanford dependency representation, we propose an improved taxonomy to capture grammatical relations across languages, including morphologically rich ones. We suggest a two-layered taxonomy: a set of broadly attested universal grammatical relations, to which language-specific relations can be added. We emphasize the lexicalist stance of the Stanford Dependencies, which leads to a particular, partially new treatment of compounding, prepositions, and morphology. We show how existing dependency schemes for several languages map onto the universal taxonomy proposed here and close with consideration of practical implications of dependency representation choices for NLP applications, in particular parsing.

457 citations

MonographDOI
01 Oct 1996
TL;DR: Introduction and philosophy Chinese remainder algorithm in modular computations in algorithmics in bridging computation in coding theory in cryptography tutorial in information theory tutorial in algebra list of mathematical symbols.
Abstract: Introduction and philosophy Chinese remainder algorithm in modular computations in algorithmics in bridging computations in coding theory in cryptography tutorial in information theory tutorial in algebra list of mathematical symbols.

383 citations

Journal ArticleDOI
TL;DR: A detailed evaluation of the effects of training and testing on different resources is performed, providing insight into the challenges involved in applying a system beyond the data it was trained on, and several pitfalls that can make evaluations of PPI-extraction systems incomparable, or even invalid are identified.
Abstract: Automated extraction of protein-protein interactions (PPI) is an important and widely studied task in biomedical text mining. We propose a graph kernel based approach for this task. In contrast to earlier approaches to PPI extraction, the introduced all-paths graph kernel has the capability to make use of full, general dependency graphs representing the sentence structure. We evaluate the proposed method on five publicly available PPI corpora, providing the most comprehensive evaluation done for a machine learning based PPI-extraction system. We additionally perform a detailed evaluation of the effects of training and testing on different resources, providing insight into the challenges involved in applying a system beyond the data it was trained on. Our method is shown to achieve state-of-the-art performance with respect to comparable evaluations, with 56.4 F-score and 84.8 AUC on the AImed corpus. We show that the graph kernel approach performs on state-of-the-art level in PPI extraction, and note the possible extension to the task of extracting complex interactions. Cross-corpus results provide further insight into how the learning generalizes beyond individual corpora. Further, we identify several pitfalls that can make evaluations of PPI-extraction systems incomparable, or even invalid. These include incorrect cross-validation strategies and problems related to comparing F-score results achieved on different evaluation resources. Recommendations for avoiding these pitfalls are provided.

294 citations

Journal ArticleDOI
TL;DR: The notations of weighted interval-valued possibilistic mean value, variance and covariance of fuzzy numbers are introduced, which are consistent with the extension principle and it is shown that the weighted variance of linear combination of fuzzyNumbers can be computed in a similar manner as in probability theory.

251 citations


Authors

Showing all 383 results

NameH-indexPapersCitations
José A. Teixeira101141447329
Cunsheng Ding6125411116
Jun'ichi Tsujii5938915985
Arto Salomaa5637417706
Tero Aittokallio522718689
Risto Lahdelma481496637
Hannu Tenhunen4581911661
Mats Gyllenberg442048029
Sampo Pyysalo421538839
Olli Polo421405303
Pasi Liljeberg403066959
Tapio Salakoski382317271
Filip Ginter371567294
Robert Fullér371525848
Juha Plosila353424917
Network Information
Related Institutions (5)
Vienna University of Technology
49.3K papers, 1.3M citations

84% related

Eindhoven University of Technology
52.9K papers, 1.5M citations

83% related

Aalto University
32.6K papers, 829.6K citations

83% related

Carnegie Mellon University
104.3K papers, 5.9M citations

82% related

Performance
Metrics
No. of papers from the Institution in previous years
YearPapers
20231
20223
20213
20209
20198
201816