Home
/
Authors
/
Krys J. Kochut

Author

Krys J. Kochut

Bio: Krys J. Kochut is an academic researcher from University of Georgia. The author has contributed to research in topics: Workflow & Ontology (information science). The author has an hindex of 29, co-authored 83 publications receiving 3630 citations.

Papers published on a yearly basis

2020
2019
2018
2017
2016
2015
2014
2013
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1993
1992
1991
1990

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Quality of Service for Workflows and Web Service Processes

[...]

Jorge Cardoso¹, Amit P. Sheth², John A. Miller², Jonathan Arnold², Krys J. Kochut² - Show less +1 more•Institutions (2)

University of Madeira¹, University of Georgia²

01 Apr 2004-Journal of Web Semantics

TL;DR: In this article, the authors present a predictive QoS model that makes it possible to compute the quality of service (QoS) for workflows automatically based on atomic task QoS attributes.

...read moreread less

807 citations

Posted Content•

A Brief Survey of Text Mining: Classification, Clustering and Extraction Techniques

[...]

Mehdi Allahyari, Seyedamin Pouriyeh, Mehdi Assefi, Saeid Safaei, Elizabeth D. Trippe, Juan B. Gutierrez, Krys J. Kochut - Show less +3 more

10 Jul 2017-arXiv: Computation and Language

TL;DR: Several of the most fundamental text mining tasks and techniques including text pre-processing, classification and clustering are described, which briefly explain text mining in biomedical and health care domains.

...read moreread less

Abstract: The amount of text that is generated every day is increasing dramatically. This tremendous volume of mostly unstructured text cannot be simply processed and perceived by computers. Therefore, efficient and effective techniques and algorithms are required to discover useful patterns. Text mining is the task of extracting meaningful information from text, which has gained significant attentions in recent years. In this paper, we describe several of the most fundamental text mining tasks and techniques including text pre-processing, classification and clustering. Additionally, we briefly explain text mining in biomedical and health care domains.

...read moreread less

422 citations

Journal Article•DOI•

Text Summarization Techniques: A Brief Survey

[...]

Mehdi Allahyari, Seyedamin Pouriyeh, Mehdi Assefi, Saeid Safaei, Elizabeth D. Trippe, Juan B. Gutierrez, Krys J. Kochut - Show less +3 more

07 Jul 2017-International Journal of Advanced Computer Science and Applications

Abstract: In recent years, there has been a explosion in the amount of text data from a variety of sources. This volume of text is an invaluable source of information and knowledge which needs to be effectively summarized to be useful. Text summarization is the task of shortening a text document into a condensed version keeping all the important information and content of the original document. In this review, the main approaches to automatic text summarization are described. We review the different processes for summarization and describe the effectiveness and shortcomings of the different methods.

...read moreread less

303 citations

Book Chapter•DOI•

SPARQLeR: Extended Sparql for Semantic Association Discovery

[...]

Krys J. Kochut¹, Maciej Janik¹•Institutions (1)

University of Georgia¹

03 Jun 2007

TL;DR: SPARQLeR is a novel extension of the SPARQL query language which adds the support for semantic path queries and allows easy and natural formulation of queries involving a wide variety of regular path patterns in RDF graphs.

...read moreread less

Abstract: Complex relationships, frequently referred to as semantic associa-tions, are the essence of the Semantic Web. Query and retrieval of semantic associations has been an important task in many analytical and scientific activities, such as detecting money laundering and querying for metabolic pathways in biochemistry. We believe that support for semantic path queries should be an integral component of RDF query languages. In this paper, we present SPARQLeR, a novel extension of the SPARQL query language which adds the support for semantic path queries. The proposed extension fits seamlessly within the overall syntax and semantics of SPARQL and allows easy and natural formulation of queries involving a wide variety of regular path patterns in RDF graphs. SPARQLeR's path patterns can capture many low-level details of the queried associations. We also present an implementation of SPARQLeR and its initial performance results. Our implementation is built over BRAHMS, our own RDF storage system.

...read moreread less

186 citations

Journal Article•DOI•

Exception Handling in Workflow Systems

[...]

Zongwei Luo¹, Amit P. Sheth¹, Krys J. Kochut¹, John A. Miller¹•Institutions (1)

University of Georgia¹

18 Aug 2000-Applied Intelligence

TL;DR: In this paper, a defeasible workflow framework is proposed to support exception handling for workflow management using ECA rules to capture more contexts in workflow modeling, and a case-based reasoning mechanism with integrated human involvement is used to improve the exception handling capabilities.

...read moreread less

Abstract: In this paper, defeasible workflow is proposed as a framework to support exception handling for workflow management. By using the “justified” ECA rules to capture more contexts in workflow modeling, defeasible workflow uses context dependent reasoning to enhance the exception handling capability of workflow management systems. In particular, this limits possible alternative exception handler candidates in dealing with exceptional situations. Furthermore, a case-based reasoning (CBR) mechanism with integrated human involvement is used to improve the exception handling capabilities. This involves collecting cases to capture experiences in handling exceptions, retrieving similar prior exception handling cases, and reusing the exception handling experiences captured in those cases in new situations.

...read moreread less

163 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17

Collapse

Cited by

PDF

Open Access

More filters

Data Mining - Concepts and Techniques.

[...]

Petra Perner

01 Jan 2002

9,314 citations

Journal Article•

Patterns of Somatic Mutation in Human Cancer Genomes

[...]

Michael R. Stratton¹•Institutions (1)

Wellcome Trust Sanger Institute¹

15 Nov 2007-Clinical Cancer Research

TL;DR: In this paper, the coding exons of the family of 518 protein kinases were sequenced in 210 cancers of diverse histological types to explore the nature of the information that will be derived from cancer genome sequencing.

...read moreread less

Abstract: AACR Centennial Conference: Translational Cancer Medicine-- Nov 4-8, 2007; Singapore PL02-05 All cancers are due to abnormalities in DNA. The availability of the human genome sequence has led to the proposal that resequencing of cancer genomes will reveal the full complement of somatic mutations and hence all the cancer genes. To explore the nature of the information that will be derived from cancer genome sequencing we have sequenced the coding exons of the family of 518 protein kinases, ~1.3Mb DNA per cancer sample, in 210 cancers of diverse histological types. Despite the screen being directed toward the coding regions of a gene family that has previously been strongly implicated in oncogenesis, the results indicate that the majority of somatic mutations detected are “passengers”. There is considerable variation in the number and pattern of these mutations between individual cancers, indicating substantial diversity of processes of molecular evolution between cancers. The imprints of exogenous mutagenic exposures, mutagenic treatment regimes and DNA repair defects can all be seen in the distinctive mutational signatures of individual cancers. This systematic mutation screen and others have previously yielded a number of cancer genes that are frequently mutated in one or more cancer types and which are now anticancer drug targets (for example BRAF , PIK3CA , and EGFR ). However, detailed analyses of the data from our screen additionally suggest that there exist a large number of additional “driver” mutations which are distributed across a substantial number of genes. It therefore appears that cells may be able to utilise mutations in a large repertoire of potential cancer genes to acquire the neoplastic phenotype. However, many of these genes are employed only infrequently. These findings may have implications for future anticancer drug development.

...read moreread less

2,737 citations

Journal Article•

Fast Tree: Computing Large Minimum-Evolution Trees with Profiles instead of a Distance Matrix

[...]

Morgan N. Price, Paramvir S. Dehal, Adam P. Arkin

18 Jun 2009-Lawrence Berkeley National Laboratory

TL;DR: FastTree as mentioned in this paper uses sequence profiles of internal nodes in the tree to implement neighbor-joining and uses heuristics to quickly identify candidate joins, then uses nearest-neighbor interchanges to reduce the length of the tree.

...read moreread less

Abstract: Gene families are growing rapidly, but standard methods for inferring phylogenies do not scale to alignments with over 10,000 sequences. We present FastTree, a method for constructing large phylogenies and for estimating their reliability. Instead of storing a distance matrix, FastTree stores sequence profiles of internal nodes in the tree. FastTree uses these profiles to implement neighbor-joining and uses heuristics to quickly identify candidate joins. FastTree then uses nearest-neighbor interchanges to reduce the length of the tree. For an alignment with N sequences, L sites, and a different characters, a distance matrix requires O(N^2) space and O(N^2 L) time, but FastTree requires just O( NLa + N sqrt(N) ) memory and O( N sqrt(N) log(N) L a ) time. To estimate the tree's reliability, FastTree uses local bootstrapping, which gives another 100-fold speedup over a distance matrix. For example, FastTree computed a tree and support values for 158,022 distinct 16S ribosomal RNAs in 17 hours and 2.4 gigabytes of memory. Just computing pairwise Jukes-Cantor distances and storing them, without inferring a tree or bootstrapping, would require 17 hours and 50 gigabytes of memory. In simulations, FastTree was slightly more accurate than neighbor joining, BIONJ, or FastME; on genuine alignments, FastTree's topologies had higher likelihoods. FastTree is available at http://microbesonline.org/fasttree.

...read moreread less

2,436 citations

Journal Article•DOI•

Survey of graph database models

[...]

Renzo Angles¹, Claudio Gutierrez¹•Institutions (1)

University of Chile¹

22 Feb 2008-ACM Computing Surveys

TL;DR: The main objective of this survey is to present the work that has been conducted in the area of graph database modeling, concentrating on data structures, query languages, and integrity constraints.

...read moreread less

Abstract: Graph database models can be defined as those in which data structures for the schema and instances are modeled as graphs or generalizations of them, and data manipulation is expressed by graph-oriented operations and type constructors. These models took off in the eighties and early nineties alongside object-oriented models. Their influence gradually died out with the emergence of other database models, in particular geographical, spatial, semistructured, and XML. Recently, the need to manage information with graph-like nature has reestablished the relevance of this area. The main objective of this survey is to present the work that has been conducted in the area of graph database modeling, concentrating on data structures, query languages, and integrity constraints.

...read moreread less

1,669 citations

Journal Article•DOI•

Genome sequencing and analysis of Aspergillus oryzae

[...]

Masayuki Machida¹, Kiyoshi Asai¹, Motoaki Sano¹, Toshihiro Tanaka², Toshitaka Kumagai¹, Goro Terai¹, Goro Terai³, Ken Ichi Kusumoto, Toshihide Arima, Osamu Akita, Yutaka Kashiwagi, Keietsu Abe⁴, Katsuya Gomi⁴, Hiroyuki Horiuchi⁵, Katsuhiko Kitamoto⁵, Tetsuo Kobayashi⁶, Michio Takeuchi⁷, David W. Denning⁸, James E. Galagan⁹, William C. Nierman¹⁰, Jiujiang Yu¹¹, David B. Archer¹², Joan W. Bennett¹³, Deepak Bhatnagar¹¹, Thomas E. Cleveland¹¹, Natalie D. Fedorova¹⁴, Osamu Gotoh¹, Hiroshi Horikawa², Akira Hosoyama², Masayuki Ichinomiya⁵, Rie Igarashi², Kazuhiro Iwashita, Praveen R. Juvvadi⁵, Masashi Kato⁶, Yumiko Kato², Taishin Kin¹, Akira Kokubun², Hiroshi Maeda⁴, Noriko Maeyama², Jun-ichi Maruyama⁵, Hideki Nagasaki¹, Tasuku Nakajima⁴, Ken Oda, Kinya Okada¹, Ian T. Paulsen¹⁴, Kazutoshi Sakamoto, Toshihiko Sawano², Mikio Takahashi², Kumiko Takase¹, Yasunobu Terabayashi¹, Jennifer R. Wortman¹⁴, Osamu Yamada, Youhei Yamagata⁴, Hideharu Anazawa, Yoji Hata, Yoshinao Koide, Takashi Komori³, Yasuji Koyama¹⁵, Toshitaka Minetoki, Sivasundaram Suharnan, Akimitsu Tanaka, Katsumi Isono², Satoru Kuhara¹⁶, Naotake Ogasawara¹⁷, Hisashi Kikuchi² - Show less +61 more•Institutions (17)

National Institute of Advanced Industrial Science and Technology¹, National Institute of Technology and Evaluation², Intec, Inc.³, Tohoku University⁴, University of Tokyo⁵, Nagoya University⁶, Tokyo University of Agriculture and Technology⁷, University of Manchester⁸, Broad Institute⁹, George Washington University¹⁰, Agricultural Research Service¹¹, University of Nottingham¹², Tulane University¹³, J. Craig Venter Institute¹⁴, Kikkoman¹⁵, Kyushu University¹⁶, Nara Institute of Science and Technology¹⁷

22 Dec 2005-Nature

TL;DR: Specific expansion of genes for secretory hydrolytic enzymes, amino acid metabolism and amino acid/sugar uptake transporters supports the idea that A. oryzae is an ideal microorganism for fermentation.

...read moreread less

Abstract: The genome of Aspergillus oryzae, a fungus important for the production of traditional fermented foods and beverages in Japan, has been sequenced. The ability to secrete large amounts of proteins and the development of a transformation system have facilitated the use of A. oryzae in modern biotechnology. Although both A. oryzae and Aspergillus flavus belong to the section Flavi of the subgenus Circumdati of Aspergillus, A. oryzae, unlike A. flavus, does not produce aflatoxin, and its long history of use in the food industry has proved its safety. Here we show that the 37-megabase (Mb) genome of A. oryzae contains 12,074 genes and is expanded by 7-9 Mb in comparison with the genomes of Aspergillus nidulans and Aspergillus fumigatus. Comparison of the three aspergilli species revealed the presence of syntenic blocks and A. oryzae-specific blocks (lacking synteny with A. nidulans and A. fumigatus) in a mosaic manner throughout the genome of A. oryzae. The blocks of A. oryzae-specific sequence are enriched for genes involved in metabolism, particularly those for the synthesis of secondary metabolites. Specific expansion of genes for secretory hydrolytic enzymes, amino acid metabolism and amino acid/sugar uptake transporters supports the idea that A. oryzae is an ideal microorganism for fermentation.

...read moreread less

1,149 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse