scispace - formally typeset
Patent

Devices and methods for generating and managing a database

TLDR
In this article, an automated method of creating or updating a database of resumes and related documents is proposed. But, the method is limited to the retrieval of documents from a network of documents, where the document is the most relevant document to the subject taxonomy stored in the retrieval priority list.
Abstract
An automated method of creating or updating a database of resumes and related documents, the method comprising, a) entering at least one example document that is relevant to a subject taxonomy in a retrieval priority list, if there is a plurality of example documents stored in the retrieval priority list, ranking the example documents according to the relevancy of the example documents to the subject taxonomy; b) retrieving a document from a network of documents, where the document is the most relevant document to the subject taxonomy stored in the retrieval priority list; c) harvesting information from specified fields of the document; d) classifying the information into one or more classes according to specified categories of the subject taxonomy; e) storing the information into a database; f) determining whether the information are links to other documents; g) ranking the link's according to relevancy to the subject taxonomy, and storing the links in the retrieval priority list according to the relevancy; h) terminating the method, provided the method's stop criteria have been met; and i) repeating steps b) through h), provided the method's stop criteria has not been met.

read more

Citations
More filters
Patent

Common common object

TL;DR: In this paper, an enterprise management information in a first format (500) for use by a first computerized system is transformed into an intermediate format (514) to readily make the stored enterprise information available for use in a second computerised system that utilizes a second format (520).
Patent

Product common object

TL;DR: In this paper, the stored product management information in a first format for use by a first computerized system is transformed to readily make the stored information available for use in a second computerised system that utilizes a second format in a cost-efficient and time-efficient manner.
Patent

Method and apparatus for minimizing storage of common attachment files in an e-mail communications server

TL;DR: In this paper, the authors proposed an e-mail communications system that minimizes the number of duplicate copies of common attachment files to email communications that are stored in the mail store of an E-mail server.
Patent

Semantic network methods to disambiguate natural language meaning

Lawrence Au
TL;DR: In this article, a data processor system automatically disambiguates a contextual meaning of natural language symbols to enable precise meanings to be stored for later retrieval from a natural language database, so that NLP database design is automatic, to enable flexible and efficient natural language interfaces to computers, household appliances and hand-held devices.
Patent

Method and apparatus for focused crawling

TL;DR: In this paper, the present invention relates to dynamic discovery of documents or information through a focused crawler or search engine, and it pertains to the field of computer software development.
References
More filters
Journal ArticleDOI

Focused crawling: a new approach to topic-specific Web resource discovery

TL;DR: A new hypertext resource discovery system called a Focused Crawler that is robust against large perturbations in the starting set of URLs, and capable of exploring out and discovering valuable resources that are dozens of links away from the start set, while carefully pruning the millions of pages that may lie within this same radius.
Journal ArticleDOI

Efficient crawling through URL ordering

TL;DR: In this paper, the authors study in what order a crawler should visit the URLs it has seen, in order to obtain more "important" pages first, and they show that a good ordering scheme can obtain important pages significantly faster than one without.
Proceedings Article

Learning to extract symbolic knowledge from the World Wide Web

TL;DR: The goal of the research described here is to automatically create a computer understandable world wide knowledge base whose content mirrors that of the World Wide Web, and several machine learning algorithms for this task are described.
Patent

Apparatus and methods for an information retrieval system that employs natural language processing of search results to improve overall precision

TL;DR: In this article, a set of logical forms for a query is compared against a set for each of the retrieved documents in order to ascertain a match between any such logical forms in both sets, and the retained documents are ranked in order of descending score and then presented to a user in that order.
Patent

System and method for the management of candidate recruiting information

TL;DR: In this paper, a system for automated candidate recruiting using a network includes a candidate web engine operable to communicate with the network and to present a candidate survey form to a client of the network.
Related Papers (5)