Home
/
Authors
/
Stan Z. Li

Author

Stan Z. Li

Other affiliations: Microsoft, Macau University of Science and Technology, Beihang University ...read more

Bio: Stan Z. Li is an academic researcher from Westlake University. The author has contributed to research in topics: Facial recognition system & Face detection. The author has an hindex of 97, co-authored 532 publications receiving 41793 citations. Previous affiliations of Stan Z. Li include Microsoft & Macau University of Science and Technology.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1990

Papers

PDF

Open Access

More filters

Journal Article•DOI•

RFold: Towards Simple yet Effective RNA Secondary Structure Prediction

[...]

Cheng Tan, Zhan Gao, Stan Z. Li

arXiv.org

TL;DR: RFold as mentioned in this paper adopts attention maps as informative representations instead of designing hand-crafted features in the pre-processing step, which achieves competitive performance and about eight times faster inference capability than the state-of-the-art method.

...read moreread less

Abstract: The secondary structure of ribonucleic acid (RNA) is more stable and accessible in the cell than its tertiary structure, making it essential in functional prediction. Though deep learning has shown promising results in this ﬁeld, current methods suffer from either the post-processing step with a poor generalization or the pre-processing step with high complexity. In this work, we present RFold, a simple yet effective RNA secondary structure prediction in an end-to-end manner. RFold introduces novel Row-Col Softmax and Row-Col Argmax functions to replace the complicated post-processing step while the output is guaranteed to be valid. Moreover, RFold adopts attention maps as informative representations instead of designing hand-crafted features in the pre-processing step. Extensive experiments demonstrate that RFold achieves competitive performance and about eight times faster inference efﬁciency than the state-of-the-art method. The code and Colab demo are available in github.com/A4Bio/RFold.

...read moreread less

6 citations

Proceedings Article•DOI•

Nearest Feature Line: A Tangent Approximation

[...]

Ran He¹, Meng Ao¹, Shiming Xiang¹, Stan Z. Li¹•Institutions (1)

Chinese Academy of Sciences¹

31 Oct 2008

TL;DR: It is illustrated that NFL, nearest feature plane (NFP) and nearest feature space (NFS) are special cases of tangent approximation, and under the assumption of manifold, localized NFL (LNFL) and closest feature spline (NFB) are introduced to further enhance classification ability and reduce computational complexity.

...read moreread less

Abstract: Nearest feature line (NFL) (S.Z. Li and J. Lu, 1999) is an efficient yet simple classification method for pattern recognition. This paper presents a theoretical analysis and interpretation of NFL from the perspective of manifold analysis, and explains the geometric nature of NFL based similarity measures. It is illustrated that NFL, nearest feature plane (NFP) and nearest feature space (NFS) are special cases of tangent approximation. Under the assumption of manifold, we introduce localized NFL (LNFL) and nearest feature spline (NFB) to further enhance classification ability and reduce computational complexity. The LNFL extends NFL's Euclidean distance to a manifold distance. And for NFB, feature lines are constructed along with a manifold's variation which is defined on a tangent bundle. The proposed methods are validated on a synthetic dataset and two standard face recognition databases (FRGC version 2 and FERET). Experimental results illustrate its efficiency and effectiveness.

...read moreread less

6 citations

Proceedings Article•DOI•

DLME: Deep Local-flatness Manifold Embedding

[...]

Zelin Zang, Siyuan Li, Di Wu, Ge Wang, Lei Shang, Baigui Sun, Haoyang Li, Stan Z. Li - Show less +4 more

07 Jul 2022

TL;DR: This work proposes Deep Local-ﬂatness Manifold Embedding (DLME), a novel ML framework to obtain reliable manifold embedding by reducing distortion and shows that DLME outperforms SOTA ML & contrastive learning (CL) methods.

...read moreread less

Abstract: Manifold learning (ML) aims to ﬁnd low-dimensional embedding from high-dimensional data. Previous works focus on handcraft or easy datasets with simple and ideal scenarios; however, we ﬁnd they perform poorly on real-world datasets with under-sampling data. Generally, ML methods primarily model data structure and subsequently process a low-dimensional embedding, where the poor local connectivity of under-sampling data in the former step and inappropriate optimization objectives in the later step will lead to structural distortion and underconstrained embedding . To solve this problem, we propose Deep Local-ﬂatness Manifold Embedding (DLME), a novel ML framework to obtain reliable manifold embedding by reducing distortion. Our proposed DLME constructs semantic manifolds by data augmentation and overcomes structural distortion problems with the help of its smooth framework. To overcome underconstrained embedding , we design a speciﬁc loss for DLME and mathematically demonstrate that it leads to a more suitable embedding based on our proposed Local Flatness Assumption. In the experiments, by showing the effectiveness of DLME on downstream classiﬁcation, clustering, and visualization tasks with three types of datasets (toy, biological, and image), our experimental results show that DLME outperforms SOTA ML & contrastive learning (CL) methods. that can overcome structural distortions by introducing prior knowledge of data augmentation. In this experiment, all are mapped to a 2-D latent space to facilitate the visualization. All compared ML methods (t-SNE, PUMAP, ivis, and PHA) will fail in the CIFAR dataset; we only show the results of UMAP.

...read moreread less

6 citations

Journal Article•DOI•

Phenotype Classification using Proteome Data in a Data-Independent Acquisition Tensor Format.

[...]

Fangfei Zhang¹, Shaoyang Yu¹, Shaoyang Yu², Lirong Wu¹, Zelin Zang¹, Xiao Yi¹, Jiang Zhu³, Cong Lu³, Ping Sun³, Yaoting Sun¹, Sathiyamoorthy Selvarajan⁴, Lirong Chen⁵, X D Teng⁵, Yongfu Zhao⁶, Guangzhi Wang⁶, Junhong Xiao, Shiang Huang³, Oi Lian Kon, N. Gopalakrishna Iyer, Stan Z. Li¹, Zhongzhi Luan², Tiannan Guo¹ - Show less +18 more•Institutions (6)

Westlake University¹, Beihang University², Huazhong University of Science and Technology³, Singapore General Hospital⁴, Zhejiang University⁵, Dalian Medical University⁶

26 Oct 2020-Journal of the American Society for Mass Spectrometry

TL;DR: A new strategy for DIA data analysis based on a novel data format called DIAT, which enables facile two-dimensional visualization of DIA proteomics data, and surpassed the deep-learning model based on peptide and protein matrices generated by OpenSWATH.

...read moreread less

6 citations

Proceedings Article•DOI•

Efficient object matching using affine-invariant deformable contour

[...]

Zhong Xue, Stan Z. Li¹, Eam Khwang Teoh¹•Institutions (1)

Nanyang Technological University¹

03 Sep 2000

TL;DR: Experiments show that the proposed affine-invariant deformable contour model for object matching is more robust and insensitive to the positions, viewpoints, and large deformations of object shapes, than the active shape model (ASM) and the AI-snake model.

...read moreread less

Abstract: An affine-invariant deformable contour model for object matching, called affine-invariant eigensnake (AI-ES), is presented in the Bayesian framework. In AI-ES, the prior distribution of object shapes is estimated and utilized to constrain the prototype contour, which is dynamically adjustable in the matching process. Also, an affine-invariant internal energy is presented to define the global and local shape deformation of the contours between the shape domain and the image domain. Experiments on real object matching show that the proposed method is more robust and insensitive to the positions, viewpoints, and large deformations of object shapes, than the active shape model (ASM) and the AI-snake model.

...read moreread less

6 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
…
86
87
88
89
90
91
92
…
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125

Collapse

Cited by

PDF

Open Access

More filters

Proceedings Article•DOI•

You Only Look Once: Unified, Real-Time Object Detection

[...]

Joseph Redmon¹, Santosh K. Divvala², Ross Girshick³, Ali Farhadi²•Institutions (3)

University of Washington¹, Allen Institute for Artificial Intelligence², Facebook³

27 Jun 2016

TL;DR: Compared to state-of-the-art detection systems, YOLO makes more localization errors but is less likely to predict false positives on background, and outperforms other detection methods, including DPM and R-CNN, when generalizing from natural images to other domains like artwork.

...read moreread less

Abstract: We present YOLO, a new approach to object detection. Prior work on object detection repurposes classifiers to perform detection. Instead, we frame object detection as a regression problem to spatially separated bounding boxes and associated class probabilities. A single neural network predicts bounding boxes and class probabilities directly from full images in one evaluation. Since the whole detection pipeline is a single network, it can be optimized end-to-end directly on detection performance. Our unified architecture is extremely fast. Our base YOLO model processes images in real-time at 45 frames per second. A smaller version of the network, Fast YOLO, processes an astounding 155 frames per second while still achieving double the mAP of other real-time detectors. Compared to state-of-the-art detection systems, YOLO makes more localization errors but is less likely to predict false positives on background. Finally, YOLO learns very general representations of objects. It outperforms other detection methods, including DPM and R-CNN, when generalizing from natural images to other domains like artwork.

...read moreread less

27,256 citations

Journal Article•DOI•

Machine learning

[...]

Thomas G. Dietterich¹•Institutions (1)

Oregon State University¹

01 Dec 1996-ACM Computing Surveys

TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.

...read moreread less

Abstract: Machine Learning is the study of methods for programming computers to learn. Computers are applied to a wide range of tasks, and for most of these it is relatively easy for programmers to design and implement the necessary software. However, there are many tasks for which this is difficult or impossible. These can be divided into four general categories. First, there are problems for which there exist no human experts. For example, in modern automated manufacturing facilities, there is a need to predict machine failures before they occur by analyzing sensor readings. Because the machines are new, there are no human experts who can be interviewed by a programmer to provide the knowledge necessary to build a computer system. A machine learning system can study recorded data and subsequent machine failures and learn prediction rules. Second, there are problems where human experts exist, but where they are unable to explain their expertise. This is the case in many perceptual tasks, such as speech recognition, hand-writing recognition, and natural language understanding. Virtually all humans exhibit expert-level abilities on these tasks, but none of them can describe the detailed steps that they follow as they perform them. Fortunately, humans can provide machines with examples of the inputs and correct outputs for these tasks, so machine learning algorithms can learn to map the inputs to the outputs. Third, there are problems where phenomena are changing rapidly. In finance, for example, people would like to predict the future behavior of the stock market, of consumer purchases, or of exchange rates. These behaviors change frequently, so that even if a programmer could construct a good predictive computer program, it would need to be rewritten frequently. A learning program can relieve the programmer of this burden by constantly modifying and tuning a set of learned prediction rules. Fourth, there are applications that need to be customized for each computer user separately. Consider, for example, a program to filter unwanted electronic mail messages. Different users will need different filters. It is unreasonable to expect each user to program his or her own rules, and it is infeasible to provide every user with a software engineer to keep the rules up-to-date. A machine learning system can learn which mail messages the user rejects and maintain the filtering rules automatically. Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis. Statistics focuses on understanding the phenomena that have generated the data, often with the goal of testing different hypotheses about those phenomena. Data mining seeks to find patterns in the data that are understandable by people. Psychological studies of human learning aspire to understand the mechanisms underlying the various learning behaviors exhibited by people (concept learning, skill acquisition, strategy change, etc.).

...read moreread less

13,246 citations

Pattern Recognition and Machine Learning

[...]

Christopher M. Bishop¹•Institutions (1)

Microsoft¹

01 Jan 2006

TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.

...read moreread less

Abstract: Probability Distributions.- Linear Models for Regression.- Linear Models for Classification.- Neural Networks.- Kernel Methods.- Sparse Kernel Machines.- Graphical Models.- Mixture Models and EM.- Approximate Inference.- Sampling Methods.- Continuous Latent Variables.- Sequential Data.- Combining Models.

...read moreread less

10,141 citations

Journal Article•DOI•

Robust Face Recognition via Sparse Representation

[...]

John Wright¹, Allen Y. Yang², Arvind Ganesh¹, S. Shankar Sastry², Yi Ma¹ - Show less +1 more•Institutions (2)

University of Illinois at Urbana–Champaign¹, University of California, Berkeley²

01 Feb 2009-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: This work considers the problem of automatically recognizing human faces from frontal views with varying expression and illumination, as well as occlusion and disguise, and proposes a general classification algorithm for (image-based) object recognition based on a sparse representation computed by C1-minimization.

...read moreread less

Abstract: We consider the problem of automatically recognizing human faces from frontal views with varying expression and illumination, as well as occlusion and disguise. We cast the recognition problem as one of classifying among multiple linear regression models and argue that new theory from sparse signal representation offers the key to addressing this problem. Based on a sparse representation computed by C1-minimization, we propose a general classification algorithm for (image-based) object recognition. This new framework provides new insights into two crucial issues in face recognition: feature extraction and robustness to occlusion. For feature extraction, we show that if sparsity in the recognition problem is properly harnessed, the choice of features is no longer critical. What is critical, however, is whether the number of features is sufficiently large and whether the sparse representation is correctly computed. Unconventional features such as downsampled images and random projections perform just as well as conventional features such as eigenfaces and Laplacianfaces, as long as the dimension of the feature space surpasses certain threshold, predicted by the theory of sparse representation. This framework can handle errors due to occlusion and corruption uniformly by exploiting the fact that these errors are often sparse with respect to the standard (pixel) basis. The theory of sparse representation helps predict how much occlusion the recognition algorithm can handle and how to choose the training images to maximize robustness to occlusion. We conduct extensive experiments on publicly available databases to verify the efficacy of the proposed algorithm and corroborate the above claims.

...read moreread less

9,658 citations

Journal Article•DOI•

Integrating single-cell transcriptomic data across different conditions, technologies, and species.

[...]

Andrew Butler, Paul J. Hoffman, Peter Smibert, Efthymia Papalexi¹, Rahul Satija¹ - Show less +1 more•Institutions (1)

New York University¹

02 Apr 2018-Nature Biotechnology

TL;DR: An analytical strategy for integrating scRNA-seq data sets based on common sources of variation is introduced, enabling the identification of shared populations across data sets and downstream comparative analysis.

...read moreread less

Abstract: Computational single-cell RNA-seq (scRNA-seq) methods have been successfully applied to experiments representing a single condition, technology, or species to discover and define cellular phenotypes. However, identifying subpopulations of cells that are present across multiple data sets remains challenging. Here, we introduce an analytical strategy for integrating scRNA-seq data sets based on common sources of variation, enabling the identification of shared populations across data sets and downstream comparative analysis. We apply this approach, implemented in our R toolkit Seurat (http://satijalab.org/seurat/), to align scRNA-seq data sets of peripheral blood mononuclear cells under resting and stimulated conditions, hematopoietic progenitors sequenced using two profiling technologies, and pancreatic cell 'atlases' generated from human and mouse islets. In each case, we learn distinct or transitional cell states jointly across data sets, while boosting statistical power through integrated analysis. Our approach facilitates general comparisons of scRNA-seq data sets, potentially deepening our understanding of how distinct cell states respond to perturbation, disease, and evolution.

...read moreread less

7,741 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse