scispace - formally typeset
Search or ask a question

Showing papers on "Metadata repository published in 2003"


Proceedings ArticleDOI
05 Apr 2003
TL;DR: An alternative based on enabling users to navigate along conceptual dimensions that describe the images is presented, which makes use of hierarchical faceted metadata and dynamically generated query previews.
Abstract: There are currently two dominant interface types for searching and browsing large image collections: keyword-based search, and searching by overall similarity to sample images. We present an alternative based on enabling users to navigate along conceptual dimensions that describe the images. The interface makes use of hierarchical faceted metadata and dynamically generated query previews. A usability study, in which 32 art history students explored a collection of 35,000 fine arts images, compares this approach to a standard image search interface. Despite the unfamiliarity and power of the interface (attributes that often lead to rejection of new search interfaces), the study results show that 90% of the participants preferred the metadata approach overall, 97% said that it helped them learn more about the collection, 75% found it more flexible, and 72% found it easier to use than a standard baseline system. These results indicate that a category-based approach is a successful way to provide access to image collections.

1,074 citations


Patent
10 Dec 2003
TL;DR: In this paper, a distributed data storage system for sharing data among client computers running different types of operating systems by separating metadata from data is presented. But the client computers communicate with the metadata servers using a Storage Tank protocol and over a control network.
Abstract: A distributed data storage system for sharing data among client computers running different types of operating systems by separating metadata from data. Data is stored in storage pools that are accessed by the client computers through a storage network. Metadata is stored in a metadata store and provided to the client computers by a cluster of metadata servers. The client computers communicate with the metadata servers using a Storage Tank protocol and over a control network. Each client computer runs an operating system-specific client program that provides the client side functions of the Storage Tank protocol. The client program preferably includes a file system interface for communicating with the file system in the storage system and user applications, a client state manager for providing data consistency, and a plurality of operating system services for communicating with the metadata servers.

976 citations


Patent
04 Feb 2003
TL;DR: In this paper, a processing system for retrieving interrelated documents is described, which comprises a document repository (130) for storing a plurality of documents, a metadata repository (140, 145), and a sociological analysis engine (150) to identify relationships between the documents using the metadata elements from the metadata repository.
Abstract: A processing system for retrieving interrelated documents is described. The system comprises a document repository (130) for storing a plurality of documents, a metadata repository (140, 145) for storing a plurality of metadata elements to represent relations between the documents, and a sociological analysis engine (150) to identify relationships between the documents using the metadata elements from the metadata repository (140, 145).

752 citations


Patent
16 Jul 2003
TL;DR: A computer data processing system including a central processing unit configured with a novel integrated computer control software system for the management of data objects including dynamic and automatic organization, linking, finding, cross-referencing, viewing and retrieval of multiple objects regardless of nature or source.
Abstract: A computer data processing system including a central processing unit configured with a novel integrated computer control software system for the management of data objects including dynamic and automatic organization, linking, finding, cross-referencing, viewing and retrieval of multiple objects regardless of nature or source. The inventive system provides underlying component architecture having an object-oriented database structure and a metadata database structure which is unique in storing only one instance of each object while linking the object to multiple collections and domains by unique metadata links for the grouping into and retrieval from any of the collections. The system employs configurable, extensible attribute/properties of data objects in metadata format, and a truly user-friendly configurable interface that facilitates faster, more unified, comprehensive, useful and meaningful information management. Additional features include a sticky path object hierarchy viewing system, key phrase linking, viewing by reference, and drag-and-drop relationship link creation.

667 citations


Proceedings ArticleDOI
15 Nov 2003
TL;DR: The Metadata Catalog Service (MCS) as mentioned in this paper provides a mechanism for storing and accessing descriptive metadata and allows users to query for data items based on desired attributes, such as attributes.
Abstract: Advances in computational, storage and network technologies as well as middle ware such as the Globus Toolkit allow scientists to expand the sophistication and scope of data-intensive applications. These applications produce and analyze terabytes and petabytes of data that are distributed in millions of files or objects. To manage these large data sets efficiently, metadata or descriptive information about the data needs to be managed. There are various types of metadata, and it is likely that a range of metadata services will exist in Grid environments that are specialized for particular types of metadata cataloguing and discovery. In this paper, we present the design of a Metadata Catalog Service (MCS) that provides a mechanism for storing and accessing descriptive metadata and allows users to query for data items based on desired attributes. We describe our experience in using the MCS with several applications and present a scalability study of the service.

258 citations


Patent
29 Aug 2003
TL;DR: In this paper, the authors propose a data mapping architecture for mapping between two or more data sources without modifying the metadata or structure of the data sources themselves, and also support updates.
Abstract: A data mapping architecture for mapping between two or more data sources without modifying the metadata or structure of the data sources themselves. Data mapping also supports updates. The architecture also supports at least the case where data sources that are being mapped, are given, their schemas predefined, and cannot be changed. The architecture includes a mapping component that receives respective metadata from at least two arbitrary data models, and maps expressions between the data models.

234 citations


Patent
09 Apr 2003
TL;DR: In this article, a system, method, and program for improving the performance for SQL queries is presented, where multidimensional metadata associated with a cube model metadata object is obtained.
Abstract: Disclosed is a system, method, and program for improving the performance for SQL queries. Multidimensional metadata associated with a cube model metadata object is obtained. One or more summary tables to be built are automatically identified based on the obtained multidimensional metadata. One or more indexes to create are automatically identified based on the obtained multidimensional metadata.

210 citations


Patent
Kevin G. Currans1
17 Nov 2003
TL;DR: In this paper, a system and method for correlating an image with information associated with the image comprising identifying image metadata for the image, wherein the image metadata includes information associating with conditions at the time of image capture, searching one or more information sources using parameters in the imagemetadata to collect inference information from the information sources, and displaying the inference information to a user.
Abstract: The present invention is directed to a system and method for correlating an image with information associated with the image comprising identifying image metadata for the image, wherein the image metadata includes information associated with conditions at the time of image capture, searching one or more information sources using parameters in the image metadata to collect inference information from the information sources, and displaying the inference information to a user.

208 citations


Patent
19 Dec 2003
TL;DR: In this article, a query of search criteria formulated via a graphical user interface is applied to a search filter to create a search folder, which stores links to matching new or changed data items.
Abstract: Providing persisting search folders within a computer that continuously identify data items having metadata matching a query of search criteria. A query of search criteria formulated via a graphical user interface is applied to a search filter to create a search folder. When the search folder is made live, the search filter is used to search one or more data stores for data items having metadata matching the query of search criteria. Upon finding these data items, the search folder is populated with a link to each data item having metadata matching the query of search criteria. The search folder detects when any new data items are added to a data store and when a change occurs to any metadata of data items previously stored in a data store. Upon detection, the search folders store links to matching new or changed data items.

181 citations


Patent
29 Aug 2003
TL;DR: In this paper, the authors propose to assign metadata to each image file and categorize each image according to one or more metadata schemes, such as image date, image subject, and image location.
Abstract: Data for electronic images is stored in a server. Metadata is assigned to each image file and categorizes each image according to one or more schemes. Possible metadata schemes include image date, one or more image subjects, and image location. The image files may then be searched based on the assigned metadata. Images may be stored in a database that includes at least one virtual folder corresponding to each metadata scheme, with each image having at least one entry in each folder. Each folder may further have subfolders that correspond to sub-categories of a categorization scheme. Each image may then have an entry in each subfolder which describes a part of the image metadata. A date search interface allows a user to select a year of interest, then a month, and then a day. A location search interface allows a user to select a subregion of a displayed region.

160 citations


Journal ArticleDOI
TL;DR: A collection of de definitions and observations about data and the metadata that make them useful are adopted: differentiating between "semantic" and "syntactic" metadata, and deining categories such as "translational and "use" metadata.
Abstract: In the process of implementing a protocol for the transport of science data, the Open Source Project for a Network Data Access Protocol (OPeNDAP) group has learned a considerable amount about the internal anatomy of what are commonly considered monolithic concepts. In order to communicate among our group, we have adopted a collection of deinitions and observations about data and the metadata that make them useful: differentiating between "semantic" and "syntactic" metadata, and deining categories such as "translational" and "use" metadata. We share the deinitions and categorizations here in the hope that others will ind them as useful as we do.

Patent
01 Aug 2003
TL;DR: A system and method for electronic file management includes an object-oriented file management database, a volume manager, and a coherency manager as mentioned in this paper, and a user interface facilitates user interaction with the file management system.
Abstract: A system and method for electronic file management includes an object-oriented file management database, a volume manager, and a coherency manager. The volume manager manages electronic files and metadata relating to the files of one or more volumes. Each volume may include folders, files, and/or other digital content. A user interface facilitates user interaction with the file management system. The user interface enables a user to view and manage, within the file management system, metadata associated with the electronic files by graphically displaying information about the files and the metadata and enabling the user to manipulate the files and the metadata.

Patent
13 Jan 2003
TL;DR: In this paper, a system, method, and program for specifying multidimensional calculations is presented, in which a statement is generated for retrieving multi-dimensional information using metadata in the cube model metadata object and the measure metadata objects, wherein each of the metadata objects specifies one or more aggregations.
Abstract: Provided is a system, method, and program for specifying multidimensional calculations. Selection of a subset of a cube model metadata object that is generated from a facts metadata object and one or more dimension metadata objects is received. The facts metadata object references one or more measure metadata objects. A statement is generated for retrieving multidimensional information using metadata in the cube model metadata object and the measure metadata objects, wherein each of the measure metadata objects specifies one or more aggregations.

Patent
26 Jun 2003
TL;DR: In this paper, the authors proposed a method to extract metadata pertaining to real data stored in at least one database (DB) from a single meta DB server, and metadata that match a retrieval request are extracted by search of the metaDB server.
Abstract: Metadata pertaining to real data stored in at least one database (DB) are collected and managed at a single meta DB server, and metadata that match a retrieval request are extracted by search of the meta DB server. Even when a plurality of DBs and DB servers for managing DBs are present on a network, all metadata that match the retrieval request can be extracted independently of the DBs that the metadata pertain to. Hence, all data that match a retrieval request can be obtained from a single server, independently of the actual locations of the distributed DBs and DB servers.

Patent
29 Dec 2003
TL;DR: In this paper, an agent monitors communications between the machines of the computer network for communications relevant to the command object, and modifies the agent's command object by adding network address information of additional machines to maintain coherency of the metadata registry and local copies thereof.
Abstract: A computer network has several machines, including machines having storage systems, and communication resources for communicating between the machines. A metadata registry having information about data stored on the network is distributed on two or more machines of the network, and local copies of part of this metadata registry reside on a compute machine of the network. The metadata registry has a command object that comprises network address information about at least some of the machines of the computer network that participate in a first communication. An agent monitors communications between the machines of the computer network for communications relevant to the command object, the agent modifies the command object by adding network address information of additional machines of the computer network that should participate in the first communication between said machines to maintain coherency of the metadata registry and local copies thereof.

Patent
27 Oct 2003
TL;DR: In this article, the authors describe techniques for the acquisition of physiological data from a medical device, and manipulation and storage of the physiological data in a format that can allow the data to easily be shared between systems and viewed using different applications.
Abstract: Techniques are described for acquisition of physiological data from a medical device, and manipulation and storage of the physiological data in a format that can allows the data to easily be shared between systems and viewed using different applications. Moreover, the techniques provide for the aggregation of the physiological data acquired from a medical device over multiple telemetry sessions. A system, for example, includes a plurality of medical device programmers to collect physiology data via telemetry sessions with medical devices, and a programmer gateway to receive the session data from the medical device programmers and to process the session data for aggregation in one or more data stores. The session data may include metadata that conforms to a data description language, such as XML, and the programmer gateway may execute a translation engine to store portions of the session data within respective data stores based on the metadata.

Patent
24 Oct 2003
TL;DR: In this paper, a method and system of providing easy configuration of asset metadata for each type of asset in a Web-based asset management is disclosed, which comprises of one XML schema file for each asset type defined in the system, one XSL asset creation process stylesheet file that can be used with any XML schemas file, and a web-based application that uses the XML scheme file and XSL SA styleheets file to display a form for the user to enter the metadata for a new asset, and to validate the data entry against the schema.
Abstract: A method and system of providing easy configuration of asset metadata for each type of asset in a Web-based asset management is disclosed. The method and system comprises of one XML schema file for each asset type defined in the system, one XSL asset creation process stylesheet file that can be used with any XML schema file, one XSL asset metadata layout stylesheet for each asset type defined in the system, a Web-based application that uses the XML schema file and XSL asset creation process stylesheet file to display a form for the user to enter the metadata for a new asset, and to validate the data entry against the schema, and a XML database to store XML asset attribute files (one for each unique asset). Additionally, the method and system makes it possible for viewable files be linked to an asset metadata field with a unique XML asset attribute file, such as to facilitate previewing. The method and system further includes how metadata searching can be performed using the provided XML schema and XSL asset metadata layout stylesheets. Accordingly, a system and method in accordance with the present invention helps minimize development effort to handle custom asset metadata for different asset types and to allow different metadata fields to be searchable with various criteria for different asset types. It is emphasized that this abstract is provided to comply with the rules requiring an abstract which will allow a searcher or other reader to quickly ascertain the subject matter of the technical disclosure. It is submitted with the understanding that it will not be used to interpret or limit the scope or meaning of the claims.

Patent
14 Mar 2003
TL;DR: In this paper, a policy-based data management system, method, and apparatus are configured to operate over a distributed storage system such as a storage area network (SAN), where files to be stored on the network are each assigned a service class and a storage pool based on the application of policies to file attributes such as file name, type, user, etc.
Abstract: A policy-based data management system, method, and apparatus are disclosed. The system, method, and apparatus are configured to operate over a distributed storage system such as a storage area network (SAN). Files to be stored on the network are each assigned a service class and a storage pool based on the application of policies to file attributes such as file name, type, user, etc. The service class and storage pool designations are stored as metadata. Files may be retrieved using the metadata to identify the storage pool where the file is stored, and the service class listed within the metadata may be used to control the manner in which the file is handled. A metadata server may be utilized to provide the appropriate service class of files in response to requests from remote clients that may be of different computing platforms.

Patent
24 Mar 2003
TL;DR: In this article, a system and method for user modification of metadata in a shell browser is presented, where a group of items and associated metadata values are displayed in a window of the shell browser.
Abstract: A system and method for user modification of metadata in a shell browser. A group of items and associated metadata values are displayed in a window of the shell browser. An edit control permits user modification of metadata values displayed in the window. The user can modify metadata associated with a welcome pane, a selected item, or multiple selected items. A data structure stored on one or more computer-readable media contains metadata associated with items displayed in a shell browser, including user modifiable metadata which is also displayed in the shell browser.

Patent
17 Dec 2003
TL;DR: In this article, a system and method for providing metadata interaction and visualization with task-related objects is described, where a plurality of task-oriented items includes a storage component specifying metadata relating to at least one of planning, executing or completing a task.
Abstract: A system and method for providing metadata interaction and visualization with task-related objects is described. A plurality of task-oriented items is defined. Each task-oriented item includes a storage component specifying metadata relating to at least one of planning, executing or completing a task. A visualization is provided to tie in the task-oriented items and associate modeling logic operating on at least one such task-oriented item. The visualization is displayed by highlighting at least one of interdependencies and conflicts between the metadata of the task-oriented items.

Patent
04 Sep 2003
TL;DR: In this article, a multi-pass approach was used to identify the standardized metadata associated with the input metadata, and the extracted tokens represented a portion of the input data set, by creating a token group comprising a plurality of selected tokens.
Abstract: Providing standardized metadata associated with media content responsive to input metadata. The invention extracts one or more tokens from the input metadata. Each of the extracted tokens represents a portion of the input metadata. The invention creates a token group comprising a plurality of selected tokens. The invention searches the database of standardized metadata using a multi-pass approach using the token group and the extracted tokens to identify the standardized metadata associated with the input metadata.

Patent
19 Dec 2003
TL;DR: Disclosed as discussed by the authors is a system for authoring metadata that describe multimedia contents, where a storage device loads information on a currently edited metadata document, and a metadata editor visualizes the loaded metadata document according to a predetermined method, and allows a user to edit the metadata document.
Abstract: Disclosed is a system for authoring metadata that describe multimedia contents. A storage device loads information on a currently edited metadata document so as to describe multimedia contents, and a metadata editor visualizes the loaded metadata document according to a predetermined method, and allows a user to edit the metadata document. A multimedia access reproducer accesses the input multimedia contents to reproduce corresponding multimedia contents, and an inter-media metadata interface links the multimedia access reproducer and the metadata editor to browse contents and effectively edit information relating to a specific interval of multimedia contents. A metadata output device outputs information on the loaded metadata document according to a predefined format.

Patent
15 Dec 2003
TL;DR: In this paper, file-sharing queries need only be performed by the metadata repository receiving the query, and not by all associated metadata repositories, so that all metadata repositories may generate similar search results.
Abstract: An Internet-scale file sharing system includes a client-side file sharing application that allows file-sharing users to identify files to share and transmit metadata corresponding to those files to a metadata repository. A server-side application operating on the metadata repository tracks metadata received from associated file-sharing users, as well as metadata from other affiliated metadata repositories. Each metadata repository acts as a search engine for any querying users and can provide search results based on locally stored metadata alone. Each metadata repository may additionally choose to locally-store popular files from an associated file-sharing user so as to alleviate transmission burdens on that file-sharing user. Associated metadata repositories each periodically synchronize their stored metadata so that all metadata repositories may generate similar search results. In such manner, file-sharing queries need only be performed by the metadata repository receiving the query, and not by all associated metadata repositories.

Journal ArticleDOI
TL;DR: This work provides a framework, CREAM, that allows for creation of metadata, i.e., metadata that instantiate interrelated definitions of classes in a domain ontology rather than a comparatively rigid template-like schema such as Dublin Core.

Journal ArticleDOI
TL;DR: Two key technologies for generating metadata about content - automatic categorization and information extraction are explained, and the applications that metadata makes possible, can transform an organization's reservoir of unstructured content into a well-organized repository of knowledge.
Abstract: There's content everywhere, but not the information you need. Content analysis can organize a pile of text into a richly accessible repository. This article explains two key technologies for generating metadata about content - automatic categorization and information extraction. These technologies, and the applications that metadata makes possible, can transform an organization's reservoir of unstructured content into a well-organized repository of knowledge. With metadata available, a company's search system can move beyond simple dialogs to richer means of access that work in more situations. Information visualization, for example, uses metadata and our innate visual abilities to improve access. Besides better access, metadata enables intelligent switching in the content flows of various organizational processes - for example, making it possible to automatically route the right information to the right person. A third class of metadata applications involves mining text to extract features for analysis using the statistical approaches typically applied to structured data. For example, if you turn the text fields in a survey into data, you can then analyze the text along with other data fields. All these metadata-powered applications can improve your company's use of its information resources.

Patent
Yang-lim Choi1
09 Apr 2003
TL;DR: In this article, the authors present a metadata management system that manages metadata in a metadata transmission server by generating a plurality of metadata fragment data by partitioning metadata to be transmitted based upon predetermined segment units.
Abstract: Managing metadata in a metadata transmission server by generating a plurality of metadata fragment data by partitioning metadata to be transmitted based upon predetermined segment units, selecting predetermined metadata fragment data from among the plurality of the metadata fragment data, generating metadata-related authentication information using the selected metadata fragment data, and transmitting the selected metadata fragment data and the metadata-related authentication information including data format information indicating type of the selected metadata fragment data. A metadata receiving client uses the transmitted metadata fragment data, the metadata-related authentication information and the metadata format type information to authenticate the received metadata.

Proceedings ArticleDOI
16 Jul 2003
TL;DR: A tool for generating visualizations of ontologies and metadata by using a modified spring embedder to achieve an automatic layout and shows that interesting information about the data relationships can be extracted through the visualization of the physical graph structure.
Abstract: Implicit information embedded in semantic Web graphs, such as topography, clusters, and disconnected subgraphs is difficult to extract from text files. Visualizations of the graphs can reveal some of these features, but existing systems for visualizing metadata focus on aspects other than understanding the greater structure. We present a tool for generating visualizations of ontologies and metadata by using a modified spring embedder to achieve an automatic layout. Through a case study using a mid-sized ontology, we show that interesting information about the data relationships can be extracted through our visualization of the physical graph structure.

Patent
07 Aug 2003
TL;DR: In this article, the authors present a portal for interactively viewing enterprise metadata, including a memory for storing a data structure in the form of a graph, with nodes representing asset metadata for enterprise data assets and edges representing relationships between asset metadata.
Abstract: A portal for interactively viewing enterprise metadata, including a memory for storing a data structure in the form of a graph, with nodes representing asset metadata for enterprise data assets and edges representing relationships between asset metadata, a path finder for generating at least one path within the graph satisfying prescribed constraints, and a report generator for generating a report about the graph, based on paths generated by the path finder. A method and computer readable storage medium are also described and claimed.

Patent
18 Jul 2003
TL;DR: In this paper, the authors describe a system that provides the user with integrated, contemporaneous property data directly related to the media file being played, providing automatic, integrated access to data from multiple databases, simply by accessing a related media file through a media player.
Abstract: Methods, computer readable mediums and systems provide media player users with a full contextual metadata experience. Metadata include multiple forms of property data, or information, relating to media accessed by a media player, such as a CD in a CD-ROM drive of a computer. Metadata is transferred from a server to a client. Identification parameters associated with the accessed media file are submitted by the client to a server, and property data is retrieved and forwarded to the client. The metadata provides the user with integrated, contemporaneous property data directly related to the media file being played, providing automatic, integrated access to data from multiple databases, simply by accessing a related media file through a media player, without further direction from the user.

Patent
Hyoseop Shin1
16 Jul 2003
TL;DR: In this paper, an index structure of metadata is provided for searching for information on contents and a method for providing indices of the metadata, and an apparatus for searching the metadata using the index structure.
Abstract: An index structure of metadata provided for searching for information on contents and a method for providing indices of the metadata, and a method and an apparatus for searching for the metadata using the index structure of the metadata are provided, in which the index structure of the metadata includes values of multi-keys and identification information of the metadata corresponding to the value of the multi-key, wherein the multi-keys are structured by combination of predetermined fields of the metadata.