Open Access
Multilingual Content Extraction Extended with Background Knowledge for Military Intelligence
TLDR
The combined deep and shallow parsing approach with Head-driven Phrase Structured Grammars, the inference process is introduced and it is shown how background knowledge is integrated into the logical inferences to increase the extent, quality, and accuracy of the content extraction.Abstract:
: Written information for military purposes is available in abundance. Documents are written in many languages. The question is how we can automate the content extraction of these documents. One possible approach is based on shallow parsing (information extraction) with application specific combination of analysis results. One example of this, the ZENON research system, does a partial content analysis of some English, Dari, and Tajik texts. Another principal approach for content extraction is based on a combination of deep and shallow parsing with logical inferences on the analysis results. In the project "Multilingual content analysis with semantic inference on military relevant texts" (mIE) we followed the second approach. In this paper, we present the results of the mIE project. First, we briefly contrast the ZENON project to the mIE project. In the main part of the paper, the mIE project is presented. After explaining the combined deep and shallow parsing approach with Head-driven Phrase Structured Grammars, the inference process is introduced. Then we show how background knowledge (WordNet, YAGO) is integrated into the logical inferences to increase the extent, quality, and accuracy of the content extraction. The prototype also is presented. The presentation includes briefing charts.read more
Citations
More filters
Proceedings Article
NLP as an essential ingredient of effective OSINT frameworks
TL;DR: This work has conceptualized an analysis framework with a strong focus on various techniques of natural language processing to aggregate, manipulate, and analyze intelligence information.
Proceedings Article
Automatic exploitation of multilingual information for military intelligence purposes
Sandra Noubours,Matthias Hecking +1 more
TL;DR: It is argued that multilingual NLP technology can strongly support military operations.
References
More filters
Proceedings ArticleDOI
An Introduction to the Syntax and Content of Cyc
TL;DR: Spring Symposium on Formalizing and Compiling Background Knowledge and Its Applications to Knowledge Representation and Question Answering, Stanford, CA, March 2006.
Proceedings ArticleDOI
Linguistically Motivated Large-Scale NLP with C&C and Boxer
TL;DR: An NLP system which is based on syntactic and semantic formalisms from theoretical linguistics, and which is used to analyse the entire Gigaword corpus in less than 5 days using only 18 processors, represents a break-through in NLP technology.
Proceedings ArticleDOI
Recognising Textual Entailment with Logical Inference
Johan Bos,Katja Markert +1 more
TL;DR: This work incorporates model building, a technique borrowed from automated reasoning, and shows that it is a useful robust method to approximate entailment, and uses machine learning to combine these deep semantic analysis techniques with simple shallow word overlap.
Proceedings ArticleDOI
The grammar matrix: an open-source starter-kit for the rapid development of cross-linguistically consistent broad-coverage precision grammars
TL;DR: The grammar matrix is an open-source starter-kit for the development of broad-coverage HPSGs that facilitates not only quick start-up but also rapid growth towards the wide coverage necessary for robust natural language processing and the precision parses and semantic representations necessary for natural language understanding.
Introduction to Information Extraction Technology
Douglas E. Appelt,David Israel +1 more
TL;DR: An introduction to pinch technology linhoffmarch, an introduction to information extraction itl nist gov, and a gentle introduction to blockchain technology web.
Related Papers (5)
A friendly merger of conceptual expectations and linguistic analysis in a text processing system
P.S. Jacobs,L.F. Rau +1 more