Home
/
Authors
/
Shou-De Lin

Author

Shou-De Lin

Other affiliations: University of Southern California, Chung Shan Medical University, University of Michigan ...read more

Bio: Shou-De Lin is an academic researcher from National Taiwan University. The author has contributed to research in topics: Social network & Ranking (information retrieval). The author has an hindex of 26, co-authored 131 publications receiving 7235 citations. Previous affiliations of Shou-De Lin include University of Southern California & Chung Shan Medical University.

Papers published on a yearly basis

2023
2022
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2006
2005
2004
2003
2001
2000
1970

Papers

PDF

Open Access

More filters

Journal Article•DOI•

An efficient heuristic procedure for partitioning graphs

[...]

Brian W. Kernighan¹, Shou-De Lin•Institutions (1)

Center for Information Technology¹

01 Feb 1970-Bell System Technical Journal

TL;DR: A heuristic method for partitioning arbitrary graphs which is both effective in finding optimal partitions, and fast enough to be practical in solving large problems is presented.

...read moreread less

Abstract: We consider the problem of partitioning the nodes of a graph with costs on its edges into subsets of given sizes so as to minimize the sum of the costs on all edges cut. This problem arises in several physical situations — for example, in assigning the components of electronic circuits to circuit boards to minimize the number of connections between boards. This paper presents a heuristic method for partitioning arbitrary graphs which is both effective in finding optimal partitions, and fast enough to be practical in solving large problems.

...read moreread less

5,082 citations

Journal Article•DOI•

Designing the market game for a trading agent competition

[...]

Michael P. Wellman¹, Peter R. Wurman¹, Kevin O'Malley¹, R. Bangera¹, Shou-De Lin¹, Daniel M. Reeves¹, William E. Walsh¹ - Show less +3 more•Institutions (1)

University of Michigan¹

01 Mar 2001-IEEE Internet Computing

TL;DR: The authors discuss the design and operation of a trading agent competition, focusing on the game structure and some of the key technical issues in running and playing the game.

...read moreread less

Abstract: The authors discuss the design and operation of a trading agent competition, focusing on the game structure and some of the key technical issues in running and playing the game. They also describe the competition's genesis, its technical infrastructure, and its organization. The article by A. Greenwald and P. Stone (2001), describes the competition from a participant's perspective and describes the strategies of some of the top-placing agents. A visualization of the competition and a description of the preliminary and final rounds of the TAC are available in IC Online (http://computer.org/internet/tac.htm).

...read moreread less

170 citations

Proceedings Article•

Feature Engineering and Classifier Ensemble for KDD Cup 2010

[...]

Hsiang-Fu Yu¹, Hung-Yi Lo², Hsun-Ping Hsieh¹, Jing-Kai Lou², Todd G. McKenzie, Jung-Wei Chou¹, Po-Han Chung, Chia-Hua Ho¹, Chun-Fu Chang¹, Jui-Yu Weng¹, En-Syu Yan, Che-Wei Chang, Tsung-Ting Kuo¹, Chien-Yuan Wang¹, Yi-Hung Huang, Yu-Xun Ruan¹, Yu-Shi Lin, Shou-De Lin¹, Hsuan-Tien Lin¹, Chih-Jen Lin¹ - Show less +16 more•Institutions (2)

National Taiwan University¹, Academia Sinica²

01 Jan 2010

TL;DR: This team is the first prize winner of both tracks (all teams and student teams) of KDD Cup 2010 and combined results of student sub-teams by regularized linear regression.

...read moreread less

Abstract: KDD Cup 2010 is an educational data mining competition. Participants are asked to learn a model from students' past behavior and then predict their future performance. At National Taiwan University, we organized a course for this competition. Most student sub-teams expanded features by various binarization and discretization techniques. The resulting sparse feature sets were trained by logistic regression (using LIBLINEAR). One sub-team considered condensed features using simple statistical techniques and applied Random Forest (through Weka) for training. Initial development was conducted on an internal split of training data for training and validation. We identied some useful feature combinations to improve performance. For the nal submission, we combined results of student sub-teams by regularized linear regression. Our team is the rst prize winner of both tracks (all teams and student teams) of KDD Cup 2010.

...read moreread less

168 citations

Proceedings Article•DOI•

Inferring Air Quality for Station Location Recommendation Based on Urban Big Data

[...]

Hsun-Ping Hsieh¹, Shou-De Lin¹, Yu Zheng²•Institutions (2)

National Taiwan University¹, Microsoft²

10 Aug 2015

TL;DR: A semi-supervised inference model utilizing existing monitoring data together with heterogeneous city dynamics, including meteorology, human mobility, structure of road networks, and point of interests is designed and an entropy-minimization model is proposed to suggest the best locations to establish new monitoring stations.

...read moreread less

Abstract: This paper tries to answer two questions. First, how to infer real-time air quality of any arbitrary location given environmental data and historical air quality data from very sparse monitoring locations. Second, if one needs to establish few new monitoring stations to improve the inference quality, how to determine the best locations for such purpose? The problems are challenging since for most of the locations (>99%) in a city we do not have any air quality data to train a model from. We design a semi-supervised inference model utilizing existing monitoring data together with heterogeneous city dynamics, including meteorology, human mobility, structure of road networks, and point of interests (POIs). We also propose an entropy-minimization model to suggest the best locations to establish new monitoring stations. We evaluate the proposed approach using Beijing air quality data, resulting in clear advantages over a series of state-of-the-art and commonly used methods.

...read moreread less

150 citations

Journal Article•

A trading agent competition

[...]

Michael P. Wellman, Peter R. Wurman, Kevin O'Malley, Roshan Bangera, Shou-De Lin, Daniel M. Reeves, William E. Walsh - Show less +3 more

01 Jan 2000-IEEE Internet Computing

TL;DR: This report specifies the Trading Agent Competition Ad Auction game (TAC/AA), a TAC market game in the domain of sponsored search, where agents play the role of search engine advertisers, who compete with each other on ad placement for search results.

...read moreread less

Abstract: We specify the Trading Agent Competition Ad Auction game (TAC/AA), a TAC market game in the domain of sponsored search. Agents play the role of search engine advertisers, who compete with each other on ad placement for search results. This report corresponds to the 2010 version of the game rules.

...read moreread less

112 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Machine learning

[...]

Thomas G. Dietterich¹•Institutions (1)

Oregon State University¹

01 Dec 1996-ACM Computing Surveys

TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.

...read moreread less

Abstract: Machine Learning is the study of methods for programming computers to learn. Computers are applied to a wide range of tasks, and for most of these it is relatively easy for programmers to design and implement the necessary software. However, there are many tasks for which this is difficult or impossible. These can be divided into four general categories. First, there are problems for which there exist no human experts. For example, in modern automated manufacturing facilities, there is a need to predict machine failures before they occur by analyzing sensor readings. Because the machines are new, there are no human experts who can be interviewed by a programmer to provide the knowledge necessary to build a computer system. A machine learning system can study recorded data and subsequent machine failures and learn prediction rules. Second, there are problems where human experts exist, but where they are unable to explain their expertise. This is the case in many perceptual tasks, such as speech recognition, hand-writing recognition, and natural language understanding. Virtually all humans exhibit expert-level abilities on these tasks, but none of them can describe the detailed steps that they follow as they perform them. Fortunately, humans can provide machines with examples of the inputs and correct outputs for these tasks, so machine learning algorithms can learn to map the inputs to the outputs. Third, there are problems where phenomena are changing rapidly. In finance, for example, people would like to predict the future behavior of the stock market, of consumer purchases, or of exchange rates. These behaviors change frequently, so that even if a programmer could construct a good predictive computer program, it would need to be rewritten frequently. A learning program can relieve the programmer of this burden by constantly modifying and tuning a set of learned prediction rules. Fourth, there are applications that need to be customized for each computer user separately. Consider, for example, a program to filter unwanted electronic mail messages. Different users will need different filters. It is unreasonable to expect each user to program his or her own rules, and it is infeasible to provide every user with a software engineer to keep the rules up-to-date. A machine learning system can learn which mail messages the user rejects and maintain the filtering rules automatically. Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis. Statistics focuses on understanding the phenomena that have generated the data, often with the goal of testing different hypotheses about those phenomena. Data mining seeks to find patterns in the data that are understandable by people. Psychological studies of human learning aspire to understand the mechanisms underlying the various learning behaviors exhibited by people (concept learning, skill acquisition, strategy change, etc.).

...read moreread less

13,246 citations

[신간의 별자리x] 우리/미술, 그리고 ‘슬픔의 박물관’

[...]

이화영

01 Jan 2015

12,972 citations

Journal Article•DOI•

Finding and evaluating community structure in networks.

[...]

Mark Newman¹, Mark Newman², Michelle Girvan², Michelle Girvan³•Institutions (3)

University of Michigan¹, Santa Fe Institute², Cornell University³

26 Feb 2004-Physical Review E

TL;DR: It is demonstrated that the algorithms proposed are highly effective at discovering community structure in both computer-generated and real-world network data, and can be used to shed light on the sometimes dauntingly complex structure of networked systems.

...read moreread less

Abstract: We propose and study a set of algorithms for discovering community structure in networks-natural divisions of network nodes into densely connected subgroups. Our algorithms all share two definitive features: first, they involve iterative removal of edges from the network to split it into communities, the edges removed being identified using any one of a number of possible "betweenness" measures, and second, these measures are, crucially, recalculated after each removal. We also propose a measure for the strength of the community structure found by our algorithms, which gives us an objective metric for choosing the number of communities into which a network should be divided. We demonstrate that our algorithms are highly effective at discovering community structure in both computer-generated and real-world network data, and show how they can be used to shed light on the sometimes dauntingly complex structure of networked systems.

...read moreread less

12,882 citations

Journal Article•DOI•

Modularity and community structure in networks

[...]

Mark Newman¹•Institutions (1)

University of Michigan¹

06 Jun 2006-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: In this article, the modularity of a network is expressed in terms of the eigenvectors of a characteristic matrix for the network, which is then used for community detection.

...read moreread less

Abstract: Many networks of interest in the sciences, including social networks, computer networks, and metabolic and regulatory networks, are found to divide naturally into communities or modules. The problem of detecting and characterizing this community structure is one of the outstanding issues in the study of networked systems. One highly effective approach is the optimization of the quality function known as “modularity” over the possible divisions of a network. Here I show that the modularity can be expressed in terms of the eigenvectors of a characteristic matrix for the network, which I call the modularity matrix, and that this expression leads to a spectral algorithm for community detection that returns results of demonstrably higher quality than competing methods in shorter running times. I illustrate the method with applications to several published network data sets.

...read moreread less

10,137 citations

Journal Article•

Data Mining Practical Machine Learning Tools and Techniques

[...]

อนิรุธ สืบสิงห์

01 Jan 2014-Journal of management science

9,185 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse