Journal ArticleDOI
Comparison of vector space model methodologies to reconcile cross-species neuroanatomical concepts.
TLDR
Results indicate that the NTAR system could assist neuroscientists with thesauri creation for closely related, highly detailed neuroanatomical domains.Abstract:
Generating informational thesauri that classify, cross-reference, and retrieve diverse and highly detailed neuroscientific information requires identifying related neuroanatomical terms and acronyms within and between species (Gorin et al., 2001) Manual construction of such informational thesauri is laborious, and we describe implementing and evaluating a neuroanatomical term and acronym reconciliation (NTAR) system to assist domain experts with this task. NTAR is composed of two modules. The neuroanatomical term extraction (NTE) module employs a hidden Markov model (HMM) in conjunction with lexical rules to extract neuroanatomical terms (NT) and acronyms (NA) from textual material. The output of the NTE is formatted into collections of term- or acronym-indexed documents composed of sentences and word phrases extracted from textual material. The second information retrieval (IR) module utilizes a vector space model (VSM) and includes a novel, automated relevance feedback algorithm. The IR module retrieves statistically related neuroanatomical terms and acronyms in response to queried neuroanatomical terms and acronyms. Neuroanatomical terms and acronyms retrieval obtained from term-based inquiries were compared with (1) term retrieval obtained by including automated relevance feedback and with (2) term retrieval using “document-to-document” comparisons (context-based VSM). The retrieval of synonymous and similar primate and macaque thalamic terms and acronyms in response to a query list of human thalamic terminology by these three IR approaches was compared against a previously published, manually constructed concordance table of homologous cross-species terms and acronyms. Term-based VSM with automated relevance feedback retrieved 70% and 80% of these primate and macaque terms and acronyms, respectively, listed in the concordance table. Automated feedback algorithm correctly identified 87% of the macaque terms and acronyms that were independently selected by a domain expert as being appropriate for manual relevance feedback. Context-based VSM correctly retrieved 97% and 98% of the primate and macaque terms and acronyms listed in the term homology table. These results indicate that the NTAR system could assist neuroscientists with thesauri creation for closely related, highly detailed neuroanatomical domains.read more
Citations
More filters
Journal Article
fMRI neuroinformatics
Finn Årup Nielsen,Mark Schram Christensen,Kristoffer Hougaard Madsen,Torben Ellegaard Lund,Lars Kai Hansen +4 more
TL;DR: The handling, processing, and analysis of fMRI data would be inconceivable without computer-based methods, and fMRI neuroinformatics is concerned with research, development, and operation of these methods.
Journal ArticleDOI
Using text mining to link journal articles to neuroanatomical databases
Leon French,Paul Pavlidis +1 more
TL;DR: An approach for automatically mapping formal identifiers of neuroanatomical regions to text found in journal abstracts is applied, applying it to a large body of abstracts from the Journal of Comparative Neurology (JCN).
Book ChapterDOI
Text-mining and neuroscience.
Kyle H. Ambert,Aaron Cohen +1 more
TL;DR: Some of the recent developments for better using the vast amount of textual information generated in neuroscience research and publication are reviewed and several use cases are suggested that will demonstrate how bench neuroscientists can take advantage of the resources that are available.
Journal ArticleDOI
Text-mining tools for optimizing community database curation workflows in neuroscience
TL;DR: Three sets of studies designed to construct automated tools for alleviating three bottlenecks in the workflow of a community-curated knowledge base of neuroscience-related information are presented.
References
More filters
Journal ArticleDOI
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.
Stephen F. Altschul,Thomas L. Madden,Alejandro A. Schäffer,Jinghui Zhang,Zheng Zhang,Webb Miller,David J. Lipman +6 more
TL;DR: A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original.
Book
Introduction to Modern Information Retrieval
Gerard Salton,Michael J. McGill +1 more
TL;DR: Reading is a need and a hobby at once and this condition is the on that will make you feel that you must read.
Book
Modern Information Retrieval
TL;DR: In this article, the authors present a rigorous and complete textbook for a first course on information retrieval from the computer science (as opposed to a user-centred) perspective, which provides an up-to-date student oriented treatment of the subject.
Journal ArticleDOI
An algorithm for suffix stripping
TL;DR: An algorithm for suffix stripping is described, which has been implemented as a short, fast program in BCPL, and performs slightly better than a much more elaborate system with which it has been compared.
Journal ArticleDOI
A vector space model for automatic indexing
Gerard Salton,A. Wong,C. S. Yang +2 more
TL;DR: An approach based on space density computations is used to choose an optimum indexing vocabulary for a collection of documents, demonstating the usefulness of the model.