scispace - formally typeset
Patent

Declarative external data source importation, exportation, and metadata reflection utilizing http and hdfs protocols

Reads0
Chats0
TLDR
In this article, the authors describe a data enrichment system that enables declarative external data source importation and exportation, where a user can specify via a user interface input for identifying different data sources from which to obtain input data.
Abstract
Techniques are disclosure for a data enrichment system that enables declarative external data source importation and exportation. A user can specify via a user interface input for identifying different data sources from which to obtain input data. The data enrichment system is configured to import and export various types of sources storing resources such as URL-based resources and HDFS-based resources for high-speed bi-directional metadata and data interchange. Connection metadata (e.g., credentials, access paths, etc.) can be managed by the data enrichment system in a declarative format for managing and visualizing the connection metadata.

read more

Citations
More filters
Patent

Pluggable fault detection tests for data pipelines

TL;DR: In this paper, the authors present methods and systems which allow engineers or administrators to create modular plugins which represent the logic for various fault detection tests that can be performed on data pipelines and shared among different software deployments.
Patent

Systems and methods for importing data from electronic data files

TL;DR: In this paper, the authors described a system for importing data from electronic data files, where the data importation system may apply detector/transformer plugins to the received data files to transform the files for importation into one or more data analysis systems and databases.
Patent

Linked data oriented entity classification method and system

Ge Tao, +1 more
TL;DR: In this paper, a linked data oriented entity classification method and system which is aimed at the problem of entity classification of linked data is presented. And the method and the system is easy to implement and debug, is high in efficiency, is good in accuracy, can be used for performing knowledge management on the linked data, and achieve high-precision classification of the entity.
Patent

Secure deployment of a software package

TL;DR: In this paper, the authors describe techniques for easy and secure deployment of a software package from a server to a customer-controlled computing device using a deployment engine running on a server.
Patent

Data pipeline monitoring

TL;DR: In this article, a method and system for data pipeline monitoring receives an event data object and a current status data object from one or more subsystems of a pipeline, and analyzes the event data objects and the status data objects to determine a first and second validation value.
References
More filters
Proceedings ArticleDOI

Yago: a core of semantic knowledge

TL;DR: YAGO as discussed by the authors is a light-weight and extensible ontology with high coverage and quality, which includes the Is-A hierarchy as well as non-taxonomic relations between entities (such as HASONEPRIZE).
Posted Content

Exploiting Similarities among Languages for Machine Translation

TL;DR: This method can translate missing word and phrase entries by learning language structures based on large monolingual data and mapping between languages from small bilingual data and uses distributed representation of words and learns a linear mapping between vector spaces of languages.
Patent

Methods and apparatus for finding semantic information, such as usage logs, similar to a query using a pattern lattice data space

TL;DR: A pattern lattice data space as a framework for analyzing data, in which both schema-based and statistical analysis are accommodated, is defined in this paper, where ways to manage the size of the lattice structures in the pattern lattices data space are described Utilities to classify or cluster, search (find similar data), or relate data using lattice fragments in the Pattern lattice space are also described Superpattern cone or lattice generation function, which may be used by the classification and clustering functions, is also described In addition, a sub-pattern cone and lattice
Proceedings ArticleDOI

Evaluating similarity measures for emergent semantics of social tagging

TL;DR: An evaluation framework to compare various general folksonomy-based similarity measures, which are derived from several established information-theoretic, statistical, and practical measures and provides an external grounding by user-validated semantic proxies based on WordNet and the Open Directory Project.
Related Papers (5)