scispace - formally typeset
Search or ask a question
Author

Huang-Chia Shih

Bio: Huang-Chia Shih is an academic researcher from Yuan Ze University. The author has contributed to research in topics: Image segmentation & Video tracking. The author has an hindex of 13, co-authored 66 publications receiving 770 citations. Previous affiliations of Huang-Chia Shih include National Tsing Hua University & Tsinghua University.


Papers
More filters
Journal ArticleDOI
TL;DR: This paper focuses on the video content analysis techniques applied in sportscasts over the past decade from the perspectives of fundamentals and general review, a content hierarchical model, and trends and challenges.
Abstract: Sports data analysis is becoming increasingly large scale, diversified, and shared, but difficulty persists in rapidly accessing the most crucial information. Previous surveys have focused on the methodologies of sports video analysis from the spatiotemporal viewpoint instead of a content-based viewpoint, and few of these studies have considered semantics. This paper develops a deeper interpretation of content-aware sports video analysis by examining the insight offered by research into the structure of content under different scenarios. On the basis of this insight, we provide an overview of the themes particularly relevant to the research on content-aware systems for broadcast sports. Specifically, we focus on the video content analysis techniques applied in sportscasts over the past decade from the perspectives of fundamentals and general review, a content hierarchical model, and trends and challenges. Content-aware analysis methods are discussed with respect to object-, event-, and context-oriented groups. In each group, the gap between sensation and content excitement must be bridged using proper strategies. In this regard, a content-aware approach is required to determine user demands. Finally, this paper summarizes the future trends and challenges for sports video analysis. We believe that our findings can advance the field of research on content-aware video analysis for broadcast sports.

179 citations

Journal ArticleDOI
TL;DR: A semantic analysis system based on Bayesian network (BN) and dynamic Bayesiannetwork (DBN) that can identify the special events in soccer games such as goal event, corner kick event, penaltyKick event, and card event is introduced.
Abstract: Video semantic analysis is formulated based on the low-level image features and the high-level knowledge which is encoded in abstract, nongeometric representations. This paper introduces a semantic analysis system based on Bayesian network (BN) and dynamic Bayesian network (DBN). It is validated in the particular domain of soccer game videos. Based on BN/DBN, it can identify the special events in soccer games such as goal event, corner kick event, penalty kick event, and card event. The video analyzer extracts the low-level evidences, whereas the semantic analyzer uses BN/DBN to interpret the high-level semantics. Different from previous shot-based semantic analysis approaches, the proposed semantic analysis is frame-based for each input frame, it provides the current semantics of the event nodes as well as the hidden nodes. Another contribution is that the BN and DBN are automatically generated by the training process instead of determined by ad hoc. The last contribution is that we introduce a so-called temporal intervening network to improve the accuracy of the semantics output

164 citations

Journal ArticleDOI
TL;DR: This paper presents a robust day-and-night people tracking and counting algorithm based on an image-based depth sensor and a programmable pan-tilt-zoom camera that enables the continuous detection and tracking of the occupants, even under dim-lighting conditions.

82 citations

Journal ArticleDOI
TL;DR: A semantic highlight detection scheme using a Multi-level Semantic Network (MSN) for baseball video interpretation is proposed and the probabilistic structure can be applied for highlight detection and shot classification.
Abstract: The information processing of sports video yields valuable semantics for content delivery over narrowband networks. Traditional image/video processing is formulated in terms of low-level features describing image/video structure and intensity, while the high-level knowledge such as common sense and human perceptual knowledge are encoded in abstract and nongeometric representations. The management of semantic information in video becomes more and more difficult because of the large difference in representations, levels of knowledge, and abstract episodes. This paper proposes a semantic highlight detection scheme using a Multi-level Semantic Network (MSN) for baseball video interpretation. The probabilistic structure can be applied for highlight detection and shot classification. Satisfactory results will be shown to illustrate better performance compared with the traditional ones.

37 citations

Journal ArticleDOI
TL;DR: A novel algorithm for improved fingerprinting-based indoor localization that decomposes the CSI sequence using the multilevel discrete wavelet transform (MDWT) and normalizes the wavelet coefficients by employing histogram equalization is proposed.
Abstract: Recently, channel state information (CSI) has been adopted as an enhanced wireless channel measurement instead of received signal strength (RSS) for indoor WiFi positioning systems. However, although CSI contains richer location information, a challenging problem is the severe dynamic range and fluctuation among the high-dimensional channels, which may degrade accuracy and cause overfitting problems. This paper proposes a novel algorithm for improved fingerprinting-based indoor localization. The proposed algorithm decomposes the CSI sequence using the multilevel discrete wavelet transform (MDWT) and normalizes the wavelet coefficients by employing histogram equalization. The robust features were then extracted by reconstructing CSI through the inverse MDWT of the normalized coefficients. We demonstrate the effectiveness of the proposed algorithm through experiments. The results show that the proposed algorithm outperforms traditional RSS, CSI, and two CSI-based algorithms, FIFS and MIMO.

35 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: The purpose of this article is to provide a systematic classification of various ideas and techniques proposed towards the effective abstraction of video contents, and identify and detail, for each approach, the underlying components and how they are addressed in specific works.
Abstract: The demand for various multimedia applications is rapidly increasing due to the recent advance in the computing and network infrastructure, together with the widespread use of digital video technology. Among the key elements for the success of these applications is how to effectively and efficiently manage and store a huge amount of audio visual information, while at the same time providing user-friendly access to the stored data. This has fueled a quickly evolving research area known as video abstraction. As the name implies, video abstraction is a mechanism for generating a short summary of a video, which can either be a sequence of stationary images (keyframes) or moving images (video skims). In terms of browsing and navigation, a good video abstract will enable the user to gain maximum information about the target video sequence in a specified time constraint or sufficient information in the minimum time. Over past years, various ideas and techniques have been proposed towards the effective abstraction of video contents. The purpose of this article is to provide a systematic classification of these works. We identify and detail, for each approach, the underlying components and how they are addressed in specific works.

879 citations

01 Jan 2014
TL;DR: This article surveys the new trend of channel response in localization and investigates a large body of recent works and classify them overall into three categories according to how to use CSI, highlighting the differences between CSI and RSSI.
Abstract: The spatial features of emitted wireless signals are the basis of location distinction and determination for wireless indoor localization. Available in mainstream wireless signal measurements, the Received Signal Strength Indicator (RSSI) has been adopted in vast indoor localization systems. However, it suffers from dramatic performance degradation in complex situations due to multipath fading and temporal dynamics. Break-through techniques resort to finer-grained wireless channel measurement than RSSI. Different from RSSI, the PHY layer power feature, channel response, is able to discriminate multipath characteristics, and thus holds the potential for the convergence of accurate and pervasive indoor localization. Channel State Information (CSI, reflecting channel response in 802.11 a/g/n) has attracted many research efforts and some pioneer works have demonstrated submeter or even centimeter-level accuracy. In this article, we survey this new trend of channel response in localization. The differences between CSI and RSSI are highlighted with respect to network layering, time resolution, frequency resolution, stability, and accessibility. Furthermore, we investigate a large body of recent works and classify them overall into three categories according to how to use CSI. For each category, we emphasize the basic principles and address future directions of research in this new and largely open area.

612 citations

Journal ArticleDOI
TL;DR: It is argued that video summarisation would benefit from greater incorporation of external information, particularly user based information that is unobtrusively sourced, in order to overcome longstanding challenges such as the semantic gap and providing video summaries that have greater relevance to individual users.

468 citations

Journal ArticleDOI
TL;DR: In this article, an ontology to represent energy-related occupant behavior in buildings is presented, based on four key components: i) the drivers of behavior, ii) the needs of the occupants, iii) the actions carried out by the occupants and iv) the building systems acted upon by occupants.

250 citations

Journal ArticleDOI
TL;DR: In this paper, the authors developed an indirect data mining approach using office appliance power consumption data to learn the occupant "passive" behavior in a medium office building, where the average percentage of correctly classified individual behavior instances is 90.29%.

242 citations