scispace - formally typeset
Search or ask a question
Topic

Metadata repository

About: Metadata repository is a research topic. Over the lifetime, 5841 publications have been published within this topic receiving 121778 citations.


Papers
More filters
Proceedings ArticleDOI
09 Dec 2008
TL;DR: A large research data set called CABS120k is introduced, which is created from a variety of information sources such as AOL500k, the Open Directory Project, del.icio.us/Yahoo!, Google and the WWW in general to investigate several characteristics of metadata including length, novelty, diversity, and similarity.
Abstract: In this paper, we study and compare three different but related types of metadata about web documents: social annotations provided by readers of web documents, hyperlink anchor text provided by authors of web documents, and search queries of users trying to find web documents. We introduce a large research data set called CABS120k08 which we have created for this study from a variety of information sources such as AOL500k, the Open Directory Project, del.icio.us/Yahoo!, Google and the WWW in general. We use this data set to investigate several characteristics of said metadata including length, novelty, diversity, and similarity and discuss theoretical and practical implications.

57 citations

Patent
11 Jun 2002
TL;DR: In this paper, a descriptor stream is formed from the resulting pairs of metadata entities and references to the content and is stored separately from the files comprising multimedia content, which can be accessed without a need to reparse the entire stream.
Abstract: Available storage media capacity for personal video recording increases continuously. metadata can be used to organize the recordings, search for content and access specific recordings. If metadata are embedded within the multimedia content itself, like DVB specific Service Information, which are multiplexed with the audio and video streams to form a MPEG-2 transport stream, a search based on this metadata would require an inefficient and time consuming search through all multimedia content stored. According to the invention metadata information is gathered, analyzed and processed to form metadata entities, which are amended by a reference to the content itself. A descriptor stream is formed from the resulting pairs of metadata entities and references to the content and is stored separately from the files comprising multimedia content. In this way, for data of an MPEG-2 transport stream the metadata can be accessed without a need to reparse the entire stream.

57 citations

Patent
15 Jun 2001
TL;DR: A JPEG2000 file includes a plurality of boxes containing data suitable to render an image including a metadata box that includes information within the box describing the content of the image as mentioned in this paper, which is compliant with the MPEG-7 specification.
Abstract: A JPEG2000 file includes a plurality of boxes containing data suitable to render an image including a metadata box that includes information within the box describing the content of the image. The information within the metadata box describing content may be MPEG-7 data, which is compliant with the MPEG-7 specification.

57 citations

Proceedings ArticleDOI
16 Jun 2008
TL;DR: A hybrid approach that enables authoritative metadata generated by traditional cataloguing methods to be merged with community annotations and tags to enable libraries, archives and repositories to leverage community enthusiasm for tagging and annotation, augment their metadata and enhance their discovery services is proposed.
Abstract: Collaborative, social tagging and annotation systems have exploded on the Internet as part of the Web 2.0 phenomenon. Systems such as Flickr, Del.icio.us, Technorati, Connotea and LibraryThing, provide a community-driven approach to classifying information and resources on the Web, so that they can be browsed, discovered and re-used. Although social tagging sites provide simple, user-relevant tags, there are issues associated with the quality of the metadata and the scalability compared with conventional indexing systems. In this paper we propose a hybrid approach that enables authoritative metadata generated by traditional cataloguing methods to be merged with community annotations and tags. The HarvANA (Harvesting and Aggregating Networked Annotations) system uses a standardized but extensible RDF model for representing the annotations/tags and OAI-PMH to harvest the annotations/tags from distributed community servers. The harvested annotations are aggregated with the authoritative metadata in a centralized metadata store. This streamlined, interoperable, scalable approach enables libraries, archives and repositories to leverage community enthusiasm for tagging and annotation, augment their metadata and enhance their discovery services. This paper describes the HarvANA system and its evaluation through a collaborative testbed with the National Library of Australia using architectural images from PictureAustralia.

56 citations

Patent
21 Mar 2002
TL;DR: In this paper, a registry service architecture is proposed for dynamic query resolution, which uses a language and sematics common to that used to define a scoped hierarchical structure of relationships between data entities.
Abstract: A registry service architecture provides for dynamic query resolution. The architecture utilises a language and sematics which is common to that used to define a scoped hierarchical structure of relationships between data entities. The relationships are defined by metadata associated with the entities. The metadata may be held in a separate archive.

56 citations


Network Information
Related Topics (5)
Information system
107.5K papers, 1.8M citations
85% related
User interface
85.4K papers, 1.7M citations
81% related
Software
130.5K papers, 2M citations
80% related
Mobile computing
51.3K papers, 1M citations
80% related
Support vector machine
73.6K papers, 1.7M citations
80% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
202332
202279
202113
202011
201921
201824