scispace - formally typeset
W

W. John Wilbur

Researcher at National Institutes of Health

Publications -  145
Citations -  6881

W. John Wilbur is an academic researcher from National Institutes of Health. The author has contributed to research in topics: Sentence & Document retrieval. The author has an hindex of 39, co-authored 143 publications receiving 6366 citations. Previous affiliations of W. John Wilbur include University of Maryland, College Park.

Papers
More filters
Journal ArticleDOI

Tagging gene and protein names in biomedical text.

TL;DR: This work proposes to approach the detection of gene and protein names in scientific abstracts as part-of-speech tagging, the most basic form of linguistic corpus annotation, and demonstrates that this method can be applied to large sets of MEDLINE abstracts, without the need for special conditions or human experts to predetermine relevant subsets.
Journal ArticleDOI

The automatic identification of stop words

TL;DR: It is shown how the concept of relevance may be replaced by the condition of being highly rated by a similarity measure and it becomes possible to identify the stop words in a cullectmn by automated statistical testing.
Journal ArticleDOI

GeneWays: a system for extracting, analyzing, visualizing, and integrating molecular pathway data

TL;DR: GeneWays analyzes interactions between molecular substances, drawing on multiple sources of information to infer a consensus view of molecular networks, and is designed as an open platform, allowing researchers to query, review, and critique stored information.
Journal ArticleDOI

GENETAG: a tagged corpus for gene/protein named entity recognition.

TL;DR: The annotation of GENETAG required intricate manual judgments by annotators which hindered tagging consistency, and the data were pre-segmented into words, to provide indices supporting comparison of system responses to the "gold standard", however, character- based indices would have been more robust than word-based indices.