Patent
Declarative external data source importation, exportation, and metadata reflection utilizing http and hdfs protocols
Reads0
Chats0
TLDR
In this article, the authors describe a data enrichment system that enables declarative external data source importation and exportation, where a user can specify via a user interface input for identifying different data sources from which to obtain input data.Abstract:
Techniques are disclosure for a data enrichment system that enables declarative external data source importation and exportation. A user can specify via a user interface input for identifying different data sources from which to obtain input data. The data enrichment system is configured to import and export various types of sources storing resources such as URL-based resources and HDFS-based resources for high-speed bi-directional metadata and data interchange. Connection metadata (e.g., credentials, access paths, etc.) can be managed by the data enrichment system in a declarative format for managing and visualizing the connection metadata.read more
Citations
More filters
Patent
Pluggable fault detection tests for data pipelines
TL;DR: In this paper, the authors present methods and systems which allow engineers or administrators to create modular plugins which represent the logic for various fault detection tests that can be performed on data pipelines and shared among different software deployments.
Patent
Systems and methods for importing data from electronic data files
Stephen Yazicioglu,Christopher Luck,Giardina Robert,Justin Streufert,Timothy Slatcher,Gregory O'Conner,Brandon Marc-Aurele,Olivia Zhu,Howard Schindel,Henry Tung,Lucas Ray,Christopher Leech,Eric Jeney,Stefan Negrus,Jason Lee,Alessandro Mingione,John Mckinstry Doyle,Hunter Pitelka,Ethan Lozano,Joel Ossher,Matthew Fedderly +20 more
TL;DR: In this paper, the authors described a system for importing data from electronic data files, where the data importation system may apply detector/transformer plugins to the received data files to transform the files for importation into one or more data analysis systems and databases.
Patent
Linked data oriented entity classification method and system
Ge Tao,Sui Zhifang +1 more
TL;DR: In this paper, a linked data oriented entity classification method and system which is aimed at the problem of entity classification of linked data is presented. And the method and the system is easy to implement and debug, is high in efficiency, is good in accuracy, can be used for performing knowledge management on the linked data, and achieve high-precision classification of the entity.
Patent
Secure deployment of a software package
TL;DR: In this paper, the authors describe techniques for easy and secure deployment of a software package from a server to a customer-controlled computing device using a deployment engine running on a server.
Patent
Data pipeline monitoring
Jesse Rickard,Peter Maag,Jared Newman,Giulio Mecocci,Harish Subbanarasimhia,Adrian Marius Dumitran,Andrzej Skrodzki,Jonah Scheinerman,Gregory Slonim,Alexandru Antihi +9 more
TL;DR: In this article, a method and system for data pipeline monitoring receives an event data object and a current status data object from one or more subsystems of a pipeline, and analyzes the event data objects and the status data objects to determine a first and second validation value.
References
More filters
Proceedings ArticleDOI
Yago: a core of semantic knowledge
TL;DR: YAGO as discussed by the authors is a light-weight and extensible ontology with high coverage and quality, which includes the Is-A hierarchy as well as non-taxonomic relations between entities (such as HASONEPRIZE).
Posted Content
Exploiting Similarities among Languages for Machine Translation
TL;DR: This method can translate missing word and phrase entries by learning language structures based on large monolingual data and mapping between languages from small bilingual data and uses distributed representation of words and learns a linear mapping between vector spaces of languages.
Patent
Methods and apparatus for finding semantic information, such as usage logs, similar to a query using a pattern lattice data space
TL;DR: A pattern lattice data space as a framework for analyzing data, in which both schema-based and statistical analysis are accommodated, is defined in this paper, where ways to manage the size of the lattice structures in the pattern lattices data space are described Utilities to classify or cluster, search (find similar data), or relate data using lattice fragments in the Pattern lattice space are also described Superpattern cone or lattice generation function, which may be used by the classification and clustering functions, is also described In addition, a sub-pattern cone and lattice
Proceedings ArticleDOI
Evaluating similarity measures for emergent semantics of social tagging
TL;DR: An evaluation framework to compare various general folksonomy-based similarity measures, which are derived from several established information-theoretic, statistical, and practical measures and provides an external grounding by user-validated semantic proxies based on WordNet and the Open Directory Project.