scispace - formally typeset
Search or ask a question
Author

Su-Cheng Haw

Bio: Su-Cheng Haw is an academic researcher from Multimedia University. The author has contributed to research in topics: XML database & XML. The author has an hindex of 10, co-authored 88 publications receiving 472 citations.


Papers
More filters
Journal ArticleDOI
TL;DR: The approach for ontology extraction on top of RDB by incorporating concept hierarchy as background knowledge is proposed, which is more efficient than the current approaches and can be applied in any of the fields such as eGoverment, eCommerce and so on.
Abstract: Relational Database (RDB) has been widely used as the back-end database of information system. Contains a wealth of high-quality information, RDB provides conceptual model and metadata needed in the ontology construction. However, most of the existing ontology building approaches convert RDB schema without considering the knowledge resided in the database. This paper proposed the approach for ontology extraction on top of RDB by incorporating concept hierarchy as background knowledge. Incorporating the background knowledge in the building process of Web Ontology Language (OWL) ontology gives two main advantages: (1) accelerate the building process, thereby minimizing the conversion cost; (2) background knowledge guides the extraction of knowledge resided in database. The experimental simulation using a gold standard shows that the Taxonomic F-measure (TF) evaluation reaches 90% while Relation Overlap (RO) is 83.33%. In term of processing time, this approach is more efficient than the current approaches. In addition, our approach can be applied in any of the fields such as eGoverment, eCommerce and so on.

92 citations

Journal ArticleDOI
TL;DR: An indexing classification scheme is suggested and some of the current trends in indexing methods, which indicate a clear shift towards hybrid indexing are discussed, are discussed.
Abstract: With the rapid emergence of XML as a data exchange standard over the Web, storing and querying XML data have become critical issues. The two main approaches to storing XML data are (1) to employ traditional storage such as relational database, object-oriented database and so on, and (2) to create an XML-specific native storage. The storage representation affects the efficiency of query processing. In this paper, firstly, we review the two approaches for storing XML data. Secondly, we review various query optimization techniques such as indexing, labeling and join algorithms to enhance query processing in both approaches. Next, we suggest an indexing classification scheme and discuss some of the current trends in indexing methods, which indicate a clear shift towards hybrid indexing.

57 citations

Journal ArticleDOI
TL;DR: TwigX-Guide is presented, a hybrid system, which takes advantage of the beautiful features of path summary in DataGuide and region encoding in TwigStack to improve complex query processing.

36 citations

Proceedings ArticleDOI
07 May 2007
TL;DR: An extensive comparative study and benchmarking on the popular XML parsers found in the market today is done and a non-validating SAX based XML parser, xParse, is proposed, which proves the viability of the approach.
Abstract: Due to its flexibility and efficiency in transmission of data, XML has become the emerging standard of data transfer and data exchange across the Internet. XML document must always be checked for well formedness before data transfer and exchange can take place. To choose the right parser for an organization respective system is crucial and critical; since improper parser will lead to degradation in performance and decrease in productivity. In this paper, we do an extensive comparative study and benchmarking on the popular XML parsers found in the market today. In addition, we also propose a non-validating SAX based XML parser, xParse. We implemented our technique and present the performance results, which prove the viability of our approach.

26 citations

Journal Article
TL;DR: The storage representation for XML document is surveyed, the XML query processing and optimization techniques with respect to the particular storage instance are reviewed, and the advantages and limitations of optimization techniques are reviewed.
Abstract: Over the past few years, XML (eXtensible Mark-up Language) has emerged as the standard for information representation and data exchange over the Internet. This paper provides a kick-start for new researches venturing in XML databases field. We survey the storage representation for XML document, review the XML query processing and optimization techniques with respect to the particular storage instance. Various optimization technologies have been developed to solve the query retrieval and updating problems. Towards the later year, most researchers proposed hybrid optimization techniques. Hybrid system opens the possibility of covering each technology’s weakness by its strengths. This paper reviews the advantages and limitations of optimization techniques. Keywords—indexing, labeling scheme, query optimization, XML storage.

20 citations


Cited by
More filters
Proceedings ArticleDOI
13 Nov 2009
TL;DR: The method gives 90% accuracy and 100% recall in detecting abnormality at patient level; and achieves an average precision of 91% and recall of 90% at the slice level.
Abstract: Computed tomographic (CT) images are widely used in the diagnosis of stroke. In this paper, we present an automated method to detect and classify an abnormality into acute infarct, chronic infarct and hemorrhage at the slice level of non-contrast CT images. The proposed method consists of three main steps: image enhancement, detection of mid-line symmetry and classification of abnormal slices. A windowing operation is performed on the intensity distribution to enhance the region of interest. Domain knowledge about the anatomical structure of the skull and the brain is used to detect abnormalities in a rotation- and translation-invariant manner. A two-level classification scheme is used to detect abnormalities using features derived in the intensity and the wavelet domain. The proposed method has been evaluated on a dataset of 15 patients (347 image slices). The method gives 90% accuracy and 100% recall in detecting abnormality at patient level; and achieves an average precision of 91% and recall of 90% at the slice level.

113 citations

Journal ArticleDOI
TL;DR: A processing framework is proposed that seeks to optimize the searching efficiency of typed resources in terms of IoT data, information and knowledge inside an integrated architecture, and the framework includes Data Graph, Information Graph and Knowledge Graph.
Abstract: Web services are middleware designed to support the interoperation between different software systems and devices over the Web. Today, we encounter a variety of situations in which services deployed on the Internet of things (IoT), such as wireless sensor networks, ZigBee networks, and mobile edge computing frameworks, have become a widely used infrastructure that has become more flexible, intelligent and automated. This system supports multimedia applications, E-commerce transactions, business collaborations and information processing. However, how to manage these services has been a popular topic in IoT research. Existing research covers numerous resource models, based on sensors or human interactions. For everything as a service, things are available as a service include products, processes, resource management and security provision. To cope with the challenge of how to manage these services, we present an extension of Data, Information, Knowledge and Wisdom architecture as a resource expression model to construct a systematic approach to modeling both entity and relationship elements. The entity elements are formalized from a fully typed, multiple-related dimensions perspective to obtain a whole frequency-value-based representation of entities in the real world. A relationship model is extended and applied to define resource models based on relationships defined from a semantics perspective that is based on our proposed existence-level reasoning. Then, a processing framework is proposed that seeks to optimize the searching efficiency of typed resources in terms of IoT data, information and knowledge inside an integrated architecture, and the framework includes Data Graph, Information Graph and Knowledge Graph. We concentrate on improving performance in accessing and processing resources and providing resource security protection by utilizing the cost difference of both type conversions of resources and traversing on resources. Finally, an application scenario is simulated to illustrate the usage of the proposed framework. This scenario shows the feasibility and effectiveness of our method, considering the conversion, traversing and storage costs. Our method can help improve the optimization of services and scheduling resources of multimedia systems.

90 citations

Journal ArticleDOI
TL;DR: The application of the proposed method for early detection of ischemic stroke is demonstrated to improve efficiency and accuracy of clinical practice and the results are quantitatively evaluated by a human expert.

88 citations

Patent
11 Jan 2013
TL;DR: In this paper, a hybrid execution plan can be generated by replacing the procedural pattern with the equivalent declarative operator, and a query execution plan processing cost can be assigned to execution of the hybrid plan.
Abstract: A procedural pattern in a received query execution plan can be matched to a stored pattern for which an equivalent declarative operator has been pre-defined. The query execution plan can describe a query for accessing data. A hybrid execution plan can be generated by replacing the procedural pattern with the equivalent declarative operator. A hybrid execution plan processing cost can be assigned to execution of the hybrid execution plan and a query execution plan processing cost can be assigned to execution of the query execution plan. The assigning can include evaluating a cost model for the hybrid execution plan and the query execution plan. The query can be executed using the hybrid execution plan if the hybrid execution plan processing cost is less than the query execution plan processing cost or the query execution plan if the hybrid execution plan processing cost is greater than the query execution plan processing cost. Related systems, methods, and articles of manufacture are disclosed.

73 citations

Journal ArticleDOI
TL;DR: This paper proposes some weighted averaging and geometric MSM aggregation operators to address the uncertainties in the medical diagnosis problems and handle the gesture quantification.

70 citations