scispace - formally typeset
Search or ask a question
Topic

Knowledge extraction

About: Knowledge extraction is a research topic. Over the lifetime, 20251 publications have been published within this topic receiving 413401 citations.


Papers
More filters
Journal ArticleDOI
TL;DR: A knowledge acquisition and representation approach using the fuzzy evidential reasoning approach and dynamic adaptive FPNs to solve the problems of domain experts' diversity experience and reason the rule-based knowledge more intelligently is presented.
Abstract: The two most important issues of expert systems are the acquisition of domain experts' professional knowledge and the representation and reasoning of the knowledge rules that have been identified. First, during expert knowledge acquisition processes, the domain expert panel often demonstrates different experience and knowledge from one another and produces different types of knowledge information such as complete and incomplete, precise and imprecise, and known and unknown because of its cross-functional and multidisciplinary nature. Second, as a promising tool for knowledge representation and reasoning, fuzzy Petri nets (FPNs) still suffer a couple of deficiencies. The parameters in current FPN models could not accurately represent the increasingly complex knowledge-based systems, and the rules in most existing knowledge inference frameworks could not be dynamically adjustable according to propositions' variation as human cognition and thinking. In this paper, we present a knowledge acquisition and representation approach using the fuzzy evidential reasoning approach and dynamic adaptive FPNs to solve the problems mentioned above. As is illustrated by the numerical example, the proposed approach can well capture experts' diversity experience, enhance the knowledge representation power, and reason the rule-based knowledge more intelligently.

112 citations

15 Jun 2001
TL;DR: The concept of a knowledge-rich context, its major types and its components, and a methodology for developing extraction tools that is based on lexical, grammatical and paralinguistic patterns are defined.
Abstract: Knowledge-rich contexts express conceptual information for a term. Terminographers need such contexts to construct definitions, and to acquire domain knowledge. This paper summarizes what we have learned about extracting knowledge-rich contexts semi-automatically. First, we define the concept of a knowledge-rich context, its major types and its components. Second, we describe a methodology for developing extraction tools that is based on lexical, grammatical and paralinguistic patterns. Third, we outline the most problematic research issues that must be addressed before semi-automatic knowledge extraction can become a fully mature field.

112 citations

Journal ArticleDOI
TL;DR: An LS approach to generate IF-THEN rules for causal databases is proposed and both type-1 and interval type-2 fuzzy sets are considered, and the degree of reliability is especially valuable for finding the most reliable and representative rules.
Abstract: Linguistic summarization (LS) is a data mining or knowledge discovery approach to extract patterns from databases. Many authors have used this technique to generate summaries like “Most senior workers have high salary,” which can be used to better understand and communicate about data; however, few of them have used it to generate IF-THEN rules like “IF X is large and Y is medium, THEN Z is small,” which not only facilitate understanding and communication of data but can also be used in decision-making. In this paper, an LS approach to generate IF-THEN rules for causal databases is proposed. Both type-1 and interval type-2 fuzzy sets are considered. Five quality measures-the degrees of truth, sufficient coverage, reliability, outlier, and simplicity-are defined. Among them, the degree of reliability is especially valuable for finding the most reliable and representative rules, and the degree of outlier can be used to identify outlier rules and data for close-up investigation. An improved parallel coordinates approach for visualizing the IF-THEN rules is also proposed. Experiments on two datasets demonstrate our LS and rule visualization approaches. Finally, the relationships between our LS approach and the Wang-Mendel (WM) method, perceptual reasoning, and granular computing are pointed out.

112 citations

Book
20 Jan 2010
TL;DR: Domain Driven Data Mining enhances the actionability and wider deployment of existing data-centered data mining through a combination of domain and business oriented factors, constraints and intelligence.
Abstract: In the present thriving global economy a need has evolved for complex data analysis to enhance an organizations production systems, decision-making tactics, and performance. In turn, data mining has emerged as one of the most active areas in information technologies. Domain Driven Data Mining offers state-of the-art research and development outcomes on methodologies, techniques, approaches and successful applications in domain driven, actionable knowledge discovery. About this book: Enhances the actionability and wider deployment of existing data-centered data mining through a combination of domain and business oriented factors, constraints and intelligence. Examines real-world challenges to and complexities of the current KDD methodologies and techniques. Details a paradigm shift from "data-centered pattern mining" to "domain driven actionable knowledge discovery" for next-generation KDD research and applications. Bridges the gap between business expectations and research output through detailed exploration of the findings, thoughts and lessons learned in conducting several large-scale, real-world data mining business applications Includes techniques, methodologies and case studies in real-life enterprise data mining Addresses new areas such as blog mining Domain Driven Data Mining is suitable for researchers, practitioners and university students in the areas of data mining and knowledge discovery, knowledge engineering, human-computer interaction, artificial intelligence, intelligent information processing, decision support systems, knowledge management, and KDD project management.

112 citations

Patent
05 Jul 2005
TL;DR: In this paper, a system and method of making unstructured data available to structured data analysis tools is presented, which includes middleware software that can be used in combination with structured data tools to perform analysis on both structured and unstructural data.
Abstract: A system and method of making unstructured data available to structured data analysis tools. The system includes middleware software that can be used in combination with structured data tools to perform analysis on both structured and unstructured data. Data can be read from a wide variety of unstructured sources. The data may then be transformed with commercial data transformation products that may, for example, extract individual pieces of data and determine relationships between the extracted data. The transformed data and relationships may then be passed through an extraction/transform/load (ETL) layer and placed in a structured schema. The structured schema may then be made available to commercial or proprietary structured data analysis tools.

111 citations


Network Information
Related Topics (5)
Cluster analysis
146.5K papers, 2.9M citations
90% related
Support vector machine
73.6K papers, 1.7M citations
90% related
Artificial neural network
207K papers, 4.5M citations
87% related
Fuzzy logic
151.2K papers, 2.3M citations
86% related
Feature extraction
111.8K papers, 2.1M citations
86% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
2023120
2022285
2021506
2020660
2019740
2018683