scispace - formally typeset
Search or ask a question
Topic

Meta Data Services

About: Meta Data Services is a research topic. Over the lifetime, 2564 publications have been published within this topic receiving 40102 citations.


Papers
More filters
Proceedings ArticleDOI
01 Nov 2012
TL;DR: The design and implementation of DMooseFS is presented, a system with simple and efficient distributed metadata management based on the well-known distributed file system MooseFS, and experiment results show that the system is effective and efficient, compared with the original version of MooseFS.
Abstract: Distributed file system is one of the key blocks of data centers. With the fast increase of user scale, metadata management in distributed file system should also be distributed to multiple nodes so as to achieve high scalability and reliability. However, existing works usually focus on the design of distributed metadata management approach, and very few efforts have been put on real implementation. In this paper, we present the design and implementation of DMooseFS, a system with simple and efficient distributed metadata management. Our work is based on the well-known distributed file system MooseFS. We extend it by introducing multiple metadata servers (MDS). The metadata is divided into non-overlapping parts using static subtree partitioning, and each MDS is assigned with one of the parts according to a hash algorithm. Such a hybrid design is efficient because it can avoid the drawbacks of static subtree partitioning and/or hash-based approaches. We implement DMooseFS by modifying existing implementation of MooseFS and test it in real deployment. Technical issues in implementation are also found and addressed. Experiment results from real world deployment show that our system is effective and efficient, compared with the original version of MooseFS.

10 citations

Journal ArticleDOI
TL;DR: The authors' work focuses on how to understand and model metadata requirements to support the work of end users of an integrative statistical knowledge network (SKN) and provides a set of strategies by which the results of user studies can be systematically utilized to support that design.
Abstract: Metadata and an appropriate metadata model are non-trivial components of information architecture conceptualization and implementation, particularly when disparate and dispersed systems are integrated. Metadata availability can enhance retrieval processes, improve information organization and navigation, and support management of digital objects. To support these activities efficiently, metadata need to be modeled appropriately for the tasks. The authors' work focuses on how to understand and model metadata requirements to support the work of end users of an integrative statistical knowledge network (SKN). They report on a series of user studies. These studies provide an understanding of metadata elements necessary for a variety of user-oriented tasks, related business rules associated with the use of these elements, and their relationship to other perspectives on metadata model development. This work demonstrates the importance of the user perspective in this type of design activity and provides a set of strategies by which the results of user studies can be systematically utilized to support that design.

10 citations

13 Apr 2003
TL;DR: XFML Core is an open XML format for publishing and sharing hierarchical faceted metadata and indexing efforts that is lightweight and easy to implement, yet uniquely powerful.
Abstract: XFML Core is an open XML format for publishing and sharing hierarchical faceted metadata and indexing efforts. XFML Core is lightweight and easy to implement, yet uniquely powerful.

10 citations

Patent
07 Oct 2010
TL;DR: In this paper, a computer implemented method and system provide for automatic selection and extraction of metadata and media content from projects in a craft tool using including techniques such as pattern recognition for audio and visual content.
Abstract: A computer implemented method and system provide for automatic selection and extraction of metadata and media content from projects in a craft tool. Automated identification, classification and management of such metadata and content is provided using including techniques such as pattern recognition for audio and visual content. The automatic tracking and centralised storage of metadata and content for compliance purposes can be facilitated, and can enable querying of organised metadata stored in a central database. In an example, metadata and media content are extracted automatically from a project in a craft tool at a client system and are forwarded to a host system for the creation of a cue sheet including timings for media files from timing metadata in a project file to create the timings on the cue sheet.

10 citations

Proceedings ArticleDOI
13 Jun 2012
TL;DR: This paper compares style and content features on existing state-of-the-art methods on two newly created real-world data sets for metadata extraction and shows that two-stage SVMs provide reasonable performance in solving the challenge of metadata extraction for crowdsourcing bibliographic metadata management.
Abstract: Social research networks such as Mendeley and CiteULike offer various services for collaboratively managing bibliographic metadata. Compared with traditional libraries, metadata quality is of crucial importance in order to create a crowdsourced bibliographic catalog for search and browsing. Artifacts, in particular PDFs which are managed by the users of the social research networks, become one important metadata source and the starting point for creating a homogeneous, high quality, bibliographic catalog. Natural Language Processing and Information Extraction techniques have been employed to extract structured information from unstructured sources. However, given highly heterogeneous artifacts that cover a range of publication styles, stemming from different publication sources, and imperfect PDF processing tools, how accurate are metadata extraction methods in such real-world settings? This paper focuses on answering that question by investigating the use of Conditional Random Fields and Support Vector Machines on real-world data gathered from Mendeley and Linked-Data repositories. We compare style and content features on existing state-of-the-art methods on two newly created real-world data sets for metadata extraction. Our analysis shows that two-stage SVMs provide reasonable performance in solving the challenge of metadata extraction for crowdsourcing bibliographic metadata management.

10 citations


Network Information
Related Topics (5)
Web page
50.3K papers, 975.1K citations
83% related
Metadata
43.9K papers, 642.7K citations
82% related
Web service
57.6K papers, 989K citations
80% related
Ontology (information science)
57K papers, 869.1K citations
78% related
User interface
85.4K papers, 1.7M citations
76% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
202313
202261
20212
20202
20196
20188