scispace - formally typeset
Patent

Method and apparatus for preventing topic drift in queries in hyperlinked environments

Reads0
Chats0
TLDR
In this paper, a method and apparatus for preventing topic drift in queries in hyperlinked environments uses equivalence components for ranking pages containing information that is relevant to the topic of a user query input to a search engine.
Abstract
A method and apparatus for preventing topic drift in queries in hyperlinked environments uses equivalence components for ranking pages containing information that is relevant to the topic of a user query input to a search engine. The method includes the step of providing a query to a search engine, where the query represents a predetermined topic; retrieving at least one page associated with the query; constructing a graph representing the pages in memory; creating at least one equivalence component representing a subset of the graph; processing each equivalence component; eliminating the equivalence component in accordance with whether it matches the predetermined topic; and ranking the remaining pages.

read more

Citations
More filters
Patent

Method and apparatus for digital media management, retrieval, and collaboration

TL;DR: The glossary manager as mentioned in this paper is a glossary management tool that makes it easy for each client to customize terminology to the needs of a particular business by using glossaries to provide more familiar context for their users.
Patent

Method and apparatus for ranking web page search results

TL;DR: In this article, a method and apparatus for ranking a plurality of pages identified during a search of a linked database includes forming a linear combination of two or more matrices, and using the coefficients of the eigenvector of the resulting matrix to rank the quality of the pages.
Patent

Communications network with converged services

TL;DR: In this paper, the authors propose a shared service VPN architecture, where trust and security are established at the edge of the network, as the information enters from the customer's site.
Patent

Method and apparatus for measuring similarity among electronic documents

TL;DR: In this article, a method and apparatus for determining when electronic documents stored in a large collection of documents are similar to one another is provided for determining the similarity of documents stored on the same server.
Patent

Web page connectivity server construction

TL;DR: In this article, the authors present a process for constructing a server for collecting, arranging and storing data that defines the connectivity of pages on the World Wide Web (Web), where the process input is a set of compressed ASCII links files, wherein each links file is a series of source URLs and corresponding destination URLs.
References
More filters
Journal ArticleDOI

The anatomy of a large-scale hypertextual Web search engine

TL;DR: This paper provides an in-depth description of Google, a prototype of a large-scale search engine which makes heavy use of the structure present in hypertext and looks at the problem of how to effectively deal with uncontrolled hypertext collections where anyone can publish anything they want.
Patent

Method for ranking documents in a hyperlinked environment using connectivity and selective content analysis

TL;DR: In this article, a set of documents are ranked according to their content and their connectivity by using topic distillation, and a relevance weight is correspondingly assigned to each node, and nodes in the second subset having relevance weight less than a predetermined threshold are pruned from the graph.
Patent

Method for clustering closely resembling data objects

TL;DR: In this article, a computer-implemented method determines the resemblance of data objects such as Web pages, where each data object is partitioned into a sequence of tokens and the tokens are grouped into overlapping sets of the tokens to form shingles.
Patent

Method for identifying near duplicate pages in a hyperlinked database

TL;DR: In this article, a method is described for identifying pages that are near duplicates in a linked database, where two pages are selected, a first page and a second page, and the number of outgoing links for each selected page is determined.
Patent

Document filtering via directed acyclic graphs

TL;DR: In this paper, a plurality of user queries including terms connected by logical operators is received, and terms and sub-expressions are combined into distinct subexpressions and embedded into a directed acyclic graph (DAG) having a pluralityof nodes.