Home
/
Topics
/
Knowledge extraction

Topic

Knowledge extraction

About: Knowledge extraction is a research topic. Over the lifetime, 20251 publications have been published within this topic receiving 413401 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1972
1970

Papers

PDF

Open Access

More filters

Patent•

Method and system of knowledge based search engine using text mining

[...]

Hongfeng Yin

12 Feb 2003

TL;DR: In this article, a method of text mining is disclosed for automatically building text knowledge base, which is applied to the web pages downloaded from internet/intranet or other text documents to extract phrases information.

...read moreread less

Abstract: A method of text mining is disclosed for automatically building text knowledge base. First, the text mining is applied to the web pages downloaded from internet/intranet or other text documents to extract phrases information. Then, the phrases are classified using automatic classification method or using existed classification information. In addition, the weights between the phrases are trained by using the text information in the web pages or the documents. A knowledge base system is built using the text mining results. The knowledge base is used to directly provide knowledge for a search. Also, the knowledge base helps search engine refine search results.

...read moreread less

82 citations

Journal Article•DOI•

Schema mapping discovery from data instances

[...]

Georg Gottlob¹, Pierre Senellart²•Institutions (2)

University of Oxford¹, Télécom ParisTech²

08 Feb 2010-Journal of the ACM

TL;DR: A theoretical framework for discovering relationships between two database instances over distinct and unknown schemata is introduced and it is shown that this definition yields “intuitive” results when applied on database instances derived from each other by basic operations.

...read moreread less

Abstract: We introduce a theoretical framework for discovering relationships between two database instances over distinct and unknown schemata. This framework is grounded in the context of data exchange. We formalize the problem of understanding the relationship between two instances as that of obtaining a schema mapping so that a minimum repair of this mapping provides a perfect description of the target instance given the source instance. We show that this definition yields “intuitive” results when applied on database instances derived from each other by basic operations. We study the complexity of decision problems related to this optimality notion in the context of different logical languages and show that, even in very restricted cases, the problem is of high complexity.

...read moreread less

82 citations

Journal Article•DOI•

Data mining using rule extraction from Kohonen self-organising maps

[...]

James Malone¹, Kenneth McGarry¹, Stefan Wermter¹, Chris Bowerman¹•Institutions (1)

University of Sunderland¹

01 Mar 2006-Neural Computing and Applications

TL;DR: This paper presents a technique which can be used to extract propositional IF..THEN type rules from the SOM network’s internal parameters and can provide a human understandable description of the discovered clusters.

...read moreread less

Abstract: The Kohonen self-organising feature map (SOM) has several important properties that can be used within the data mining/knowledge discovery and exploratory data analysis process. A key characteristic of the SOM is its topology preserving ability to map a multi-dimensional input into a two-dimensional form. This feature is used for classification and clustering of data. However, a great deal of effort is still required to interpret the cluster boundaries. In this paper we present a technique which can be used to extract propositional IF..THEN type rules from the SOM network’s internal parameters. Such extracted rules can provide a human understandable description of the discovered clusters.

...read moreread less

82 citations

Proceedings Article•DOI•

From “folklore” to “living design memory”

[...]

Loren Terveen¹, Peter G. Selfridge¹, David Long¹•Institutions (1)

Bell Labs¹

01 May 1993

TL;DR: A tool is built that serves as a living design memory for a large software development organization that delivers knowledge to developers effectively and is embedded in organizational practice to ensure that the knowledge it contains evolves as necessary.

...read moreread less

Abstract: We identify an important type of software design knowledge that we call community specific folklore and show problems with current approaches to managing it. We built a tool that serves as a living design memory for a large software development organization. The tool delivers knowledge to developers effectively and is embedded in organizational practice to ensure that the knowledge it contains evolves as necessary. This work illustrates important lessons in building knowledge management systems, integrating novel technology into organizational practice, and managing research-development partnerships.

...read moreread less

82 citations

Book Chapter•DOI•

Rough Set Analysis of Preference-Ordered Data

[...]

Roman Słowiński¹, Salvatore Greco², Benedetto Matarazzo²•Institutions (2)

Poznań University of Technology¹, University of Catania²

14 Oct 2002-Lecture Notes in Computer Science

TL;DR: The paper proposes a new approach to knowledge discovery from data, taking into account prior knowledge about preference semantics in patterns to be discovered, called Dominance-based Rough Set Approach (DRSA), able to approximate this partition by means of dominance relations.

...read moreread less

Abstract: The paper is devoted to knowledge discovery from data, taking into account prior knowledge about preference semantics in patterns to be discovered. The data concern a set of situations (objects, states, examples) described by a set of attributes (properties, features, characteristics). The attributes are, in general, divided into condition and decision attributes, corresponding to input and output of a situation. The situations are partitioned by decision attributes into decision classes. A pattern discovered from the data has a symbolic form of decision rule or decision tree. In many practical problems, some condition attributes are defined on preference-ordered scales and the decision classes are also preference-ordered. The known methods of knowledge discovery ignore, unfortunately, this preference information, taking thus a risk of drawing wrong patterns. To deal with preference-ordered data we propose to use a new approach called Dominance-based Rough Set Approach (DRSA). Given a set of situations described by at least one condition attribute with preference-ordered scale and partitioned into preference-ordered classes, the new rough set approach is able to approximate this partition by means of dominance relations. The rough approximation of this partition is a starting point for induction of "if..., then..." decision rules. The syntax of these rules is adapted to represent preference orders. The DRSA analyses only facts present in data and possible inconsistencies are identified. It preserves the concept of granular computing, however, the granules are dominance cones in evaluation space, and not bounded sets. It is also concordant with the paradigm of computing with words, as it exploits ordinal, and not necessarily cardinal, character of data.

...read moreread less

82 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
…
183
184
185
186
187
188
189
…
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

20,644

Papers

453,302

Citations

No. of papers in the topic in previous years
Year	Papers
2023	120
2022	285
2021	506
2020	660
2019	740
2018	683

Knowledge extraction

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics