scispace - formally typeset
Search or ask a question
Author

Hong Yan

Bio: Hong Yan is an academic researcher from City University of Hong Kong. The author has contributed to research in topics: Cluster analysis & Image segmentation. The author has an hindex of 58, co-authored 809 publications receiving 15508 citations. Previous affiliations of Hong Yan include Yale University & China Academy of Engineering Physics.


Papers
More filters
Book
01 Oct 1996
TL;DR: Fuzzy rules and defuzzification: rules based on experience learning from examples decision tree approach neural network approach minimization of fuzzy rulesdefuzzification and optimization applications concluding remarks.
Abstract: Introduction: Fuzzy sets probability and fuzziness fuzzy models Membership functions: heuristic selections clustering approaches adjustment and toning applications concluding remarks Fuzzy clustering: clustering and fuzzy partition fuzzy c-means algorithm fuzzy cohonen clustering networks cluster validity and optimal fuzzy clustering applications concluding remarks Fuzzy rules and defuzzification: rules based on experience learning from examples decision tree approach neural network approach minimization of fuzzy rules defuzzification and optimization applications concluding remarks Fuzzy classifiers: fuzzy nearest neighbour classifier fuzzy multilayer perceptron fuzy decision trees fuzzy string matching applications concluding remarks Combined clasifications: introduction voting schemes maximum poteriori probability Dempster-Shafer evidence theory trained perceptron neural networks applications concluding remarks

528 citations

Journal ArticleDOI
TL;DR: An adaptive spatial fuzzy c-means clustering algorithm is presented in this paper for the segmentation of three-dimensional (3-D) magnetic resonance (MR) images that takes into account the spatial continuity constraints by using a dissimilarity index that allows spatial interactions between image voxels.
Abstract: An adaptive spatial fuzzy c-means clustering algorithm is presented in this paper for the segmentation of three-dimensional (3-D) magnetic resonance (MR) images. The input images may be corrupted by noise and intensity nonuniformity (INU) artifact. The proposed algorithm takes into account the spatial continuity constraints by using a dissimilarity index that allows spatial interactions between image voxels. The local spatial continuity constraint reduces the noise effect and the classification ambiguity. The INU artifact is formulated as a multiplicative bias field affecting the true MR imaging signal. By modeling the log bias field as a stack of smoothing B-spline surfaces, with continuity enforced across slices, the computation of the 3-D bias field reduces to that of finding the B-spline coefficients, which can be obtained using a computationally efficient two-stage algorithm. The efficacy of the proposed algorithm is demonstrated by extensive segmentation experiments using both simulated and real MR images and by comparison with other published algorithms.

347 citations

Journal ArticleDOI
TL;DR: The corner detection scheme introduced in this paper can provide accurate information about the corners and accurately locate the templates in relation to the eye images and greatly reduce the processing time for the templates.

345 citations

Journal ArticleDOI
TL;DR: An analytic-to-holistic approach which can identify faces at different perspective variations is proposed, and it is shown that this approach can achieve a similar level of performance from different viewing directions of a face.
Abstract: We propose an analytic-to-holistic approach which can identify faces at different perspective variations. The database for the test consists of 40 frontal-view faces. The first step is to locate 15 feature points on a face. A head model is proposed, and the rotation of the face can be estimated using geometrical measurements. The positions of the feature points are adjusted so that their corresponding positions for the frontal view are approximated. These feature points are then compared with the feature points of the faces in a database using a similarity transform. In the second step, we set up windows for the eyes, nose, and mouth. These feature windows are compared with those in the database by correlation. Results show that this approach can achieve a similar level of performance from different viewing directions of a face. Under different perspective variations, the overall recognition rates are over 84 percent and 96 percent for the first and the first three likely matched faces, respectively.

269 citations

Journal ArticleDOI
TL;DR: A new approach in text watermarking is proposed in which interword spaces of different text lines are slightly modified, and the average spaces of various lines have the characteristics of a sine wave and the wave constitutes a mark.
Abstract: Digital watermarking is widely believed to be a valid means to discourage illicit distribution of information content. Digital watermarking methods for text documents are limited because of the binary nature of text documents. A distinct feature of a text document is its space patterning. We propose a new approach in text watermarking in which interword spaces of different text lines are slightly modified. After the modification, the average spaces of various lines have the characteristics of a sine wave and the wave constitutes a mark. Both nonblind and blind watermarking algorithms are discussed. Preliminary experiments have shown promising results. Our experiments suggest that space patterning of text documents can be a useful tool in digital watermarking.

223 citations


Cited by
More filters
Journal ArticleDOI

[...]

08 Dec 2001-BMJ
TL;DR: There is, I think, something ethereal about i —the square root of minus one, which seems an odd beast at that time—an intruder hovering on the edge of reality.
Abstract: There is, I think, something ethereal about i —the square root of minus one. I remember first hearing about it at school. It seemed an odd beast at that time—an intruder hovering on the edge of reality. Usually familiarity dulls this sense of the bizarre, but in the case of i it was the reverse: over the years the sense of its surreal nature intensified. It seemed that it was impossible to write mathematics that described the real world in …

33,785 citations

Journal ArticleDOI
TL;DR: In this paper, the authors provide an up-to-date critical survey of still-and video-based face recognition research, and provide some insights into the studies of machine recognition of faces.
Abstract: As one of the most successful applications of image analysis and understanding, face recognition has recently received significant attention, especially during the past several years. At least two reasons account for this trend: the first is the wide range of commercial and law enforcement applications, and the second is the availability of feasible technologies after 30 years of research. Even though current machine recognition systems have reached a certain level of maturity, their success is limited by the conditions imposed by many real applications. For example, recognition of face images acquired in an outdoor environment with changes in illumination and/or pose remains a largely unsolved problem. In other words, current systems are still far away from the capability of the human perception system.This paper provides an up-to-date critical survey of still- and video-based face recognition research. There are two underlying motivations for us to write this survey paper: the first is to provide an up-to-date review of the existing literature, and the second is to offer some insights into the studies of machine recognition of faces. To provide a comprehensive survey, we not only categorize existing recognition techniques but also present detailed descriptions of representative methods within each category. In addition, relevant topics such as psychophysical studies, system evaluation, and issues of illumination and pose variation are covered.

6,384 citations

Journal ArticleDOI
TL;DR: A critical review of the nature of the problem, the state-of-the-art technologies, and the current assessment metrics used to evaluate learning performance under the imbalanced learning scenario is provided.
Abstract: With the continuous expansion of data availability in many large-scale, complex, and networked systems, such as surveillance, security, Internet, and finance, it becomes critical to advance the fundamental understanding of knowledge discovery and analysis from raw data to support decision-making processes. Although existing knowledge discovery and data engineering techniques have shown great success in many real-world applications, the problem of learning from imbalanced data (the imbalanced learning problem) is a relatively new challenge that has attracted growing attention from both academia and industry. The imbalanced learning problem is concerned with the performance of learning algorithms in the presence of underrepresented data and severe class distribution skews. Due to the inherent complex characteristics of imbalanced data sets, learning from such data requires new understandings, principles, algorithms, and tools to transform vast amounts of raw data efficiently into information and knowledge representation. In this paper, we provide a comprehensive review of the development of research in learning from imbalanced data. Our focus is to provide a critical review of the nature of the problem, the state-of-the-art technologies, and the current assessment metrics used to evaluate learning performance under the imbalanced learning scenario. Furthermore, in order to stimulate future research in this field, we also highlight the major opportunities and challenges, as well as potential important research directions for learning from imbalanced data.

6,320 citations

Journal ArticleDOI
TL;DR: Clustering algorithms for data sets appearing in statistics, computer science, and machine learning are surveyed, and their applications in some benchmark data sets, the traveling salesman problem, and bioinformatics, a new field attracting intensive efforts are illustrated.
Abstract: Data analysis plays an indispensable role for understanding various phenomena. Cluster analysis, primitive exploration with little or no prior knowledge, consists of research developed across a wide variety of communities. The diversity, on one hand, equips us with many tools. On the other hand, the profusion of options causes confusion. We survey clustering algorithms for data sets appearing in statistics, computer science, and machine learning, and illustrate their applications in some benchmark data sets, the traveling salesman problem, and bioinformatics, a new field attracting intensive efforts. Several tightly related topics, proximity measure, and cluster validation, are also discussed.

5,744 citations