scispace - formally typeset
Search or ask a question

Showing papers on "Metadata repository published in 2007"


Patent
20 Aug 2007
TL;DR: In this paper, a system for utilizing metadata created either at a central location for shared use by connected users, or at each individual user's location, to enhance user's enjoyment of available broadcast programming content is presented.
Abstract: A system for utilizing metadata created either at a central location for shared use by connected users, or at each individual user's location, to enhance user's enjoyment of available broadcast programming content. A variety of mechanisms are employed for automatically and manually identifying and designating programming segments, associating descriptive metadata which the identified segments, distributing the metadata for use at client locations, and using the supplied metadata to selectively record and playback desired programming.

2,229 citations


Patent
22 Mar 2007
TL;DR: In this paper, an information dispersal sytem in which original data to be stored is separated into a number of data "slices" in such a manner that the data in each subset is less usable or less recognizable or completely unusable or completely unrecognizable by itself except when combined with some or all of the other data subsets.
Abstract: Briefly, the present invention relates to an information dispersal sytem in which original data to be stored is separated into a number of data 'slices' in such a manner that the data in each subset is less usable or less recognizable or completely unusable or completely unrecognizable by itself except when combined with some or all of the other data subsets. These data subsets are stored on separate storage devices as a way of increasing privacy and security. In accordance with an important aspect of the invention, a metadata management system stores and indexes user files across all of the storage nodes. A number of applications run on the servers supporting these storage nodes and are responsible for controlling the metadata. Metadata is the information about the data, the data slices or data subsets and the way in which these data subsets are dispersed among different storage nodes running over the network. As used herein, metadata includes data source names, their size, last modification date, authentication information etc. This information is required to keep track of dispersed data subsets among all the nodes in the system. Every time new data subsets are stored and old ones are removed from the storage nodes, the metadata is updated. In accordance with an important aspect of the invention, the metadata management system stores metadata for dispersed data where: the dispersed data is in several pieces; the metadata is in a separate dataspace from the dispersed data. Accordingly, the metadata management system is able to manage the metadata in a manner that is computationally efficient relative to known systems in order to enable broad use of the invention using the types of computers generally used by businesses, consumers and other organizations currently.

947 citations


Patent
31 Aug 2007
TL;DR: In this paper, a set of metadata is searched for metadata corresponding to the action, where the search is limited by the action context, and selected metadata for the action is inserted into a collection, including a reference to the set of executable instructions that implements the action and a description of the action.
Abstract: A computer readable storage medium includes executable instructions to receive a request for an action. An action context is received where the action context includes an application requesting the action. A set of metadata is searched for metadata corresponding to the action, where the search is limited by the action context. Selected metadata for the action is inserted into a collection. The selected metadata is a result of searching the set of metadata. The selected metadata includes a reference to the set of executable instructions that implements the action and a description of the action. The collection is then returned.

643 citations


01 Aug 2007
TL;DR: This document defines fifteen metadata elements for resource description in a cross-disciplinary information environment and provides information for the Internet community.
Abstract: This document defines fifteen metadata elements for resource description in a cross-disciplinary information environment. This memo provides information for the Internet community.

178 citations


Patent
06 Feb 2007
TL;DR: In this article, content identifiers are associated with respective metadata and user's experience with the content can be enhanced through use of the metadata, through which a user can get a better experience with content.
Abstract: Content identifiers are associated with respective metadata. Through use of the metadata, a user's experience with the content can be enhanced. A variety of other arrangements are also detailed.

175 citations


Patent
09 Aug 2007
TL;DR: In this paper, the authors present systems and methods for automating the EII, using a smart integration engine based on metadata, which is used for seamless integration of a fully-distributed organization with many data sources and technologies.
Abstract: The present invention discloses systems and methods for automating the EII, using a smart integration engine based on metadata. On-line execution (i.e. data access, retrieval, or update) is automated by integrating heterogeneous data sources via a centralized smart engine based on metadata of all data sources managed in a metadata repository. The data-source assets are mapped to business metadata (terminology) giving programmers the ability to use business terms, and overcome technical terms. IT departments can use the business-level terms for easy and fast programming of all services “at the business level”. The integration is performed by the engine (via pre-configuration) automatically, dynamically, and on-line, regardless of topology or technology changes, without user or administrator intervention. MDOA is a high-level concept in which the metadata maps the technical low-level terms to business high-level terms. MDOA is used for seamless integration of a fully-distributed organization with many data sources and technologies.

152 citations


Patent
03 May 2007
TL;DR: In this paper, a graphical user interface (GUI) control based on the metadata associated with the search hits is constructed and displayed with search results in a standard view, and the metadata in the search results is arranged in a tabular view which is embedded in the display of search results and rendered invisible until selected by the user.
Abstract: Records in databases or unstructured files are enriched with metadata and are indexed for retrieval by a search engine. In response to a search request, a graphical user interface (GUI) control based on the metadata associated with the search hits is constructed and displayed with the search results in a standard view. Selection of a metadata value via the GUI control filters the previously matched records down to those matching the value selected via the GUI control. The metadata in the search results is arranged in a tabular view which is embedded in the display of search results and rendered invisible until selected by the user. Reports can be constructed from an identifier each returned record set for presenting, analyzing and modifying the data, and for generating further reports.

145 citations


Patent
21 Dec 2007
TL;DR: In this article, the authors proposed a method for targeting advertisements that selects a first content item that has an associated set of metadata, such as a tag or keyword used by the first user in relation to the content item.
Abstract: A method for targeting advertisements selects a first content item that has an associated set of metadata. The associated metadata is for providing information regarding the first content item. The method identifies a first user having a relationship to the first content item. The first user has a set of profile information. The method determines a first metadata element such as, for example, a tag or keyword used by the first user in relation to the first content item. The first metadata element is generated by one or more users of the first content item such as, for example, the first user or a second user. The method selects a first advertisement for presentation to the first user. The selection process uses data associated with one or more of the first content item, the first user, and the first metadata element. Additional embodiments of the invention include a system and a computer readable medium for implementation of the foregoing.

130 citations


Patent
01 May 2007
TL;DR: In this paper, the first metadata corresponding to a first video asset is generated, which includes text describing contents displayed when the first video assets is played and a pointer to a location within a video file that corresponds to the video asset.
Abstract: The invention relates to methods and apparatus for providing media assets over a network. First metadata corresponding to a first video asset is generated. The first metadata includes text describing contents displayed when the first video asset is played and a pointer to a location within a video file that corresponds to the first video asset. The pointer includes at least two of a start location, an end location, and a duration. The first metadata is transmitted for receipt by a client system capable of playing the first video asset. The client system displays portions of the text of the first metadata to a user of the client system, and uses the pointer of the first metadata to facilitate requesting the first video asset from a video server for transmitting video assets over the network.

119 citations


Patent
30 Mar 2007
TL;DR: In this article, a method, system, and computer-readable storage medium are disclosed for exporting data from a deduplication data store to a non-deduplicated data store.
Abstract: A method, system, and computer-readable storage medium are disclosed for exporting data from a deduplication data store to a non-deduplication data store. A set of data may be stored in the deduplication data store in a format eliminating one or more duplicates of data objects in the set of data. The set of data in the deduplication data store may be stored separately from metadata describing the set of data. The set of data stored in the deduplication data store may be read. The set of data read from the deduplication data store and the metadata may be stored in a non-deduplication data store. In the non-deduplication data store, the set of data is stored in a format preserving the one or more duplicates of data objects in the set of data.

114 citations


Patent
26 Jun 2007
TL;DR: In this paper, a method and system for extracting relevant information from content metadata is provided, where user access to content is monitored and a set of extraction rules for information extraction is selected.
Abstract: A method and system for extracting relevant information from content metadata is provided User access to content is monitored A set of extraction rules for information extraction is selected Key information is extracted from metadata for the content based on the selected extraction rules Additionally, a type for the content can be determined, and a set of extraction rules is selected based on the content type The key information is used in queries for searching information of potential interest to the user, related to the content accessed

Patent
16 Apr 2007
TL;DR: In this article, a method of acquiring data associated with a television program consistent with certain embodiments involves acquiring information that identifies a currently playing television program; receiving a command from a user interface that selects an image forming a portion of a video displayed on the television, wherein said frame of video is a part of the television program.
Abstract: A method of acquiring data associated with a television program consistent with certain embodiments involves acquiring information that identifies a currently playing television program; receiving a command from a user interface that selects an image forming a portion of a frame of video displayed on the television, wherein said frame of video is a portion of the television program; accessing a specified web site that contains a database of metadata associated with television programs via the Internet; querying the specified web site for metadata associated with the image by providing the image along with the information that identifies the currently playing television program; receiving a response from the specified web site that provides metadata associated with the image; and displaying at least a portion of the metadata. This abstract is not to be considered limiting, since other embodiments may deviate from the features described in this abstract.

Patent
09 Apr 2007
TL;DR: In this article, a system for generating a composite video having a plurality of video assets is described. But the system is limited to a single video asset and does not support the generation of video data from multiple video assets.
Abstract: Systems and methods for generating a composite video having a plurality of video assets are provided. Systems may include storage having a plurality of video assets, a metadata generator, and a composite video generator. The metadata generator processes respective ones of the video assets to generate a metadata track representative of information that is descriptive of the content of the video asset. The composite video generator receives a plurality of metadata tracks and, in response, processes the associated video assets and the metadata tracks to generate a composite video asset having video data from the video assets. The composite video asset may have a metadata list panel that presents information representative of the metadata information as video data appearing within the composite video asset. The metadata list panel visually presents the sequence of video assets in the composite video asset.

Patent
02 Jan 2007

Patent
29 Oct 2007
TL;DR: In this article, the authors present a method for obtaining metadata associated with images, audio and video from a handheld device by computing attributes of the data using a processor, which utilizes the processor to operate on the data.
Abstract: The present invention relates generally to obtaining metadata associated with images, audio and video. Once claim recites a method including: obtaining data corresponding to media content from a handheld device, the data representing picture elements of an image or video or representing audible portions of an audio signal; computing attributes of the data using a processor, said act of computing utilizes the processor to operate on the data; using computed attributes of the data to identify the media content or to identify metadata associated with the media content; obtaining metadata associated with the media content; and providing the metadata to the handheld device from a network resource. Other combinations and claims are provided as well.

Journal ArticleDOI
01 Feb 2007
TL;DR: The experimental results show that, by using INFOMAP, this paper can extract author, title, journal, volume, number (issue), year, and page information from different kinds of reference styles with a high degree of precision.
Abstract: The integration of bibliographical information on scholarly publications available on the Internet is an important task in the academic community. Accurate reference metadata extraction from such publications is essential for the integration of metadata from heterogeneous reference sources. In this paper, we propose a hierarchical template-based reference metadata extraction method for scholarly publications. We adopt a hierarchical knowledge representation framework called INFOMAP, which automatically extracts metadata. The experimental results show that, by using INFOMAP, we can extract author, title, journal, volume, number (issue), year, and page information from different kinds of reference styles with a high degree of precision. The overall average accuracy is 92.39% for the six major reference styles compared in this study.

Patent
20 Jul 2007
TL;DR: In this article, an information processing apparatus and method, a program, and a recording medium, in which a content is recommended to each user on the basis of even the metadata that is assigned with no classification.
Abstract: Disclosed herein are an information processing apparatus and method, a program, and a recording medium, in which a content is recommended to each user on the basis of even the metadata that is assigned with no classification. A metadata analysis block resolves metadata acquired by a metadata acquisition block into components. A dictionary data generation block generates dictionary data in which genre is correlated with keyword and each component. An associated-information database generation block references the dictionary data to assign genre to the metadata which are assigned with no genre, thereby generating an associated-information database of content. An associated-information search block references the dictionary data to identify a genre from a keyword of interest data to search for associated information, thereby recommending content to the user. The present invention is applicable to personal computers or HDD recorders.

Proceedings ArticleDOI
11 Jun 2007
TL;DR: It is argued that the relational model augmented with queries as data values is a natural way to uniformly model data, arbitrary metadata and their associations, and relational queries with a join mechanism augmented to permit matching of query result relations, instead of only atomic values, is an elegant way of uniformly query across data and metadata.
Abstract: There is a growing need to associate a variety of metadata with the underlying data, but a simple, elegant approach to uniformly model and query both the data and the metadata has been elusive. In this paper, we argue that (1) the relational model augmented with queries as data values is a natural way to uniformly model data, arbitrary metadata and their associations, and (2) relational queries with a join mechanism augmented to permit matching of query result relations, instead of only atomic values, is an elegant way to uniformly query across data and metadata. We describe the architecture of a system we have prototyped for this purpose, demonstrate the generality of our approach and evaluate the performance of the system, in comparison with previous proposals for metadata management.

Patent
27 Nov 2007
TL;DR: In this paper, a user search request is made and a subset of requested objects is defined that correspond to the user search, and a relevancy value is computed for each of the subsets of the requested objects using the fixed parameter metadata and/or the dynamic metadata.
Abstract: A method of displaying a plurality of digital objects includes storing the plurality of objects in a database, associating fixed parameter metadata and dynamic metadata with each of the digital objects, and classifying each of the digital objects in the database based on at least one of the fixed parameter metadata and the dynamic metadata. A user search request is then received and a subset of requested objects is defined that correspond to the user search request. A relevancy value is computed for each of the subset of requested objects using the fixed parameter metadata and/or the dynamic metadata. The objects are then displayed on a user display such that the most relevant objects are presented to the user and less relevant objects are spaced from the most relevant object. The display may be two- or three-dimensional and includes all relevant images in a single display.

Patent
22 Jan 2007
TL;DR: In this paper, a method for automatically presenting digital content to a user of a computer computes task-related metadata from data which may include i) a most recent event record, ii) an automatic prediction of the current task being executed by the user, iii) past event records and associated task identifiers stored in a database and/or iv) content in resources associated with a given task.
Abstract: A method for automatically presenting digital content to a user of a computer computes task-related metadata from data which may include i) a most recent event record, ii) a most recent specification received from the user of a task being performed by the user, iii) an automatic prediction of the current task being executed by the user, iii) past event records and associated task identifiers stored in a database and/or iv) content in resources associated with a given task. The task-related metadata is communicated to a digital content service provider. Digital content relevant to the user based on the task-related metadata is then selected, sent to the computer, and presented to the user. Rules and filters control the metadata going out and the content coming in, allowing automatic adaptation based on the current task, characteristics of the content, and other factors.

Patent
Gregory J. Wolff1, Kurt Piersol1
18 May 2007
TL;DR: In this article, a method and apparatus for synchronizing distributed work is described, which comprises receiving first and second metadata entries, adding the first andsecond metadata entries to a set corresponding to a digital object, and providing access to first and Second unique identifiers used for referencing the first metadata entries respectively.
Abstract: A method and apparatus is disclosed herein for synchronizing distributed work. In one embodiment, the method comprises receiving first and second metadata entries, adding the first and second metadata entries to a set corresponding to a digital object, and providing access to first and second unique identifiers used for referencing the first and second metadata entries respectively, where the first and second unique identifiers are based on contents of the first and second metadata entries respectively.

Patent
14 Feb 2007
TL;DR: An automated method of attaching metadata to a segment of content using a programmed processor consistent with certain embodiments involves retrieving metadata relating to the segment and rendering the metadata as visually perceptible text in one or more frames of video.
Abstract: An automated method of attaching metadata to a segment of content using a programmed processor consistent with certain embodiments involves retrieving metadata relating to the segment of content; rendering the metadata as visually perceptible text in one or more frames of video; appending the one or more frames of video to the video as the first one or more frames of the video to produce metadata enhanced video; and storing or transmitting the metadata enhanced video. This abstract is not to be considered limiting, since other embodiments may deviate from the features described in this abstract.

Proceedings Article
27 Aug 2007
TL;DR: An ontology-based metadata interoperability approach, which exploits, in an optimal way, the semantics of metadata schemas, and the use of CIDOC/CRM ontology as a mediating schema is proposed.
Abstract: Metadata interoperability is an active research area, especially for cultural heritage collections, which consist of heterogeneous objects described by a variety of metadata schemas. In this paper we propose an ontology-based metadata interoperability approach, which exploits, in an optimal way, the semantics of metadata schemas. In particular, we propose the use of CIDOC/CRM ontology as a mediating schema and present a methodology for mapping DC Type Vocabulary to CIDOC/CRM, demonstrating a real-world effort for ontology-based metadata integration.

Patent
28 Dec 2007
TL;DR: In this article, a logging system includes an event receiver and a storage manager, where the receiver receives log data, processes it, and outputs a data chunk and stores them so that they can be queried.
Abstract: A logging system includes an event receiver and a storage manager. The receiver receives log data, processes it, and outputs a data 'chunk.' The manager receives data chunks and stores them so that they can be queried. The receiver includes buffers that store events and a metadata structure that stores metadata about the contents of the buffers. The metadata includes a unique identifier associated with the receiver, the number of events in the buffers, and, for each 'field of interest,' a minimum value and a maximum value that reflect the range of values of that field over all of the events in the buffers. A chunk includes the metadata structure and a compressed version of the contents of the buffers. The metadata structure acts as a search index when querying event data. The logging system can be used in conjunction with a security information/event management (SIEM) system.

Journal Article
TL;DR: A complete and automatic mapping of the whole MPEG-7 standard to OWL has been set up based on XML2OWL mapping and XML2RDF mapping to move multimedia metadata to semantic web.

Patent
18 Jan 2007
TL;DR: In this article, file transitions are identified that are to be tracked and at least one element of metadata is generated that characterizes each identified file transition, and the action metadata may follow that history of each tracked file as well as the histories of copies of each traced file over time.
Abstract: File transitions are identified that are to be tracked and at least one element of metadata is generated that characterizes each identified file transition to be tracked. Upon receiving a request for historical transition information, elements of metadata corresponding to at least two instances of a tracked file are aggregated and information is provided responsive to the request that is derived from the aggregated elements of metadata. The action metadata may follow that history of each tracked file as well as the histories of copies of each tracked file over time. Thus, an operator can manage the infrastructure of a corresponding computing environment with knowledge of the current and historical activities of files.

Patent
27 Feb 2007
TL;DR: In this article, a client submitting metadata content can validate the metadata content prior to submission of metadata content and/or associated media content, such as audio, video, image, or podcast data.
Abstract: The disclosed embodiments relate generally to the submission of metadata content and media content to a media distribution system. The media content can include, for example, audio, video, image, or podcast data. In accordance with one embodiment, a client submitting metadata content can validate the metadata content prior to submission of the metadata content and/or associated media content. A media distribution system receiving metadata content can also validate the metadata content.

Journal ArticleDOI
TL;DR: Two photo clustering algorithms for generating meaningful photo groups are introduced: Hierarchical event clustering; and Clothing based person recognition, which assumes that people who wear similar clothing and appear in photos taken in one day are very likely to be the same person.

Patent
26 Dec 2007
TL;DR: In this paper, a metadata management system is described for a business intelligence architecture having a metadata repository for content that defines a user environment of the BIA architecture, which includes a user interface generator to display information regarding a plurality of objects in the metadata repository and to facilitate selection of a group of the plurality.
Abstract: A metadata management system is described for a business intelligence architecture having a metadata repository for content that defines a user environment of the business intelligence architecture. The metadata management system includes a user interface generator to display information regarding a plurality of objects in the metadata repository and to facilitate selection of a group of the plurality of objects, a content editor to evaluate the content stored in the metadata repository for each object of the selected group and to modify in a batch job the content for each object of the selected group for storage of the modified content in the metadata repository, and a communication manager to issue instructions for the storage of the modified content in the metadata repository, the instructions being configured in accordance with a communication protocol of the business intelligence architecture utilized to control the metadata repository.

Patent
24 Jul 2007
TL;DR: In this article, the authors present methods and apparatus for utilizing information (e.g., metadata) relating to content in a multimedia distribution network, where the information comprises metadata relating to the bitrate profile of deterministic content such as stored video.
Abstract: Methods and apparatus for utilizing information (e.g., metadata) relating to content in a multimedia distribution network. In one embodiment, the network comprises a hybrid fiber coax (HFC) cable network, and the information comprises metadata relating to the bitrate profile of deterministic content such as stored video. Content sources, or the network operator themselves, generate the metadata which may then be used by the operator to adjust or optimize the operation of the network; e.g., more efficiently allocate the program to a multiplex. Network apparatus adapted to implement the metadata functionality and related business methods are also disclosed.