Home
/
Authors
/
Kai Puolamäki

Author

Kai Puolamäki

Other affiliations: Helsinki Institute of Physics, Helsinki Institute for Information Technology, Finnish Institute of Occupational Health ...read more

Bio: Kai Puolamäki is an academic researcher from University of Helsinki. The author has contributed to research in topics: Supersymmetry & Exploratory data analysis. The author has an hindex of 26, co-authored 122 publications receiving 2259 citations. Previous affiliations of Kai Puolamäki include Helsinki Institute of Physics & Helsinki Institute for Information Technology.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2001
2000
1999
1998
1997

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Combining eye movements and collaborative filtering for proactive information retrieval

[...]

Kai Puolamäki¹, Jarkko Salojärvi¹, Eerika Savia¹, Jaana Simola², Samuel Kaski³ - Show less +1 more•Institutions (3)

Helsinki University of Technology¹, Aalto University², University of Helsinki³

15 Aug 2005

TL;DR: The best prediction accuracy still leaves room for improvement but shows that proactive information retrieval and combination of many sources of relevance feedback is feasible.

...read moreread less

Abstract: We study a new task, proactive information retrieval by combining implicit relevance feedback and collaborative filtering. We have constructed a controlled experimental setting, a prototype application, in which the users try to find interesting scientific articles by browsing their titles. Implicit feedback is inferred from eye movement signals, with discriminative hidden Markov models estimated from existing data in which explicit relevance feedback is available. Collaborative filtering is carried out using the User Rating Profile model, a state-of-the-art probabilistic latent variable model, computed using Markov Chain Monte Carlo techniques. For new document titles the prediction accuracy with eye movements, collaborative filtering, and their combination was significantly better than by chance. The best prediction accuracy still leaves room for improvement but shows that proactive information retrieval and combination of many sources of relevance feedback is feasible.

...read moreread less

97 citations

Journal Article•DOI•

Evolution of Neogene Mammals in Eurasia: Environmental Forcing and Biotic Interactions

[...]

Mikael Fortelius, Jussi T. Eronen, Ferhat Kaya, Hui Tang, Pasquale Raia¹, Kai Puolamäki², Kai Puolamäki³ - Show less +3 more•Institutions (3)

University of Naples Federico II¹, Aalto University², Finnish Institute of Occupational Health³

05 Jun 2014-Annual Review of Earth and Planetary Sciences

TL;DR: It is suggested that species with evolutionary novelties arise predominantly in “species factories” that develop under harsh environmental conditions, under dominant physical forcing, whereas exceptionally mild environments give rise to “oases in the desert,” characterized by strong competition and survival of relics.

...read moreread less

Abstract: The relative weights of physical forcing and biotic interaction as drivers of evolutionary change have been debated in evolutionary theory. The recent finding that species, genera, clades, and chronofaunas all appear to exhibit a symmetrical pattern of waxing and waning lends support to the view that biotic interactions shape the history of life. Yet, there is similarly abundant evidence that these primary units of biological evolution arise and wane in coincidence with major climatic change. We review these patterns and the process-level explanations offered for them. We also propose a tentative synthesis, characterized by interdependence between physical forcing and biotic interactions. We suggest that species with evolutionary novelties arise predominantly in “species factories” that develop under harsh environmental conditions, under dominant physical forcing, whereas exceptionally mild environments give rise to “oases in the desert,” characterized by strong competition and survival of relics.

...read moreread less

92 citations

Journal Article•DOI•

Significance testing of word frequencies in corpora

[...]

Jefrey Lijffijt¹, Terttu Nevalainen², Tanja Säily², Panagiotis Papapetrou³, Kai Puolamäki⁴, Heikki Mannila⁵ - Show less +2 more•Institutions (5)

University of Bristol¹, University of Helsinki², Stockholm University³, Finnish Institute of Occupational Health⁴, Aalto University⁵

01 Jun 2016-Digital Scholarship in the Humanities

TL;DR: The significance estimates of various statistical tests are compared in a controlled resampling experiment and in a practical setting, studying differences between texts produced by male and female fiction writers in the British National Corpus to conclude that significance testing can be used to find consequential differences between corpora.

...read moreread less

Abstract: Finding out whether a word occurs significantly more often in one text or corpus than in another is an important question in analysing corpora. As noted by Kilgarriff (Language is never, ever, ever, random, Corpus Linguistics and Linguistic Theory , 2005; 1(2): 263–76.), the use of the χ2 and log-likelihood ratio tests is problematic in this context, as they are based on the assumption that all samples are statistically independent of each other. However, words within a text are not independent. As pointed out in Kilgarriff (Comparing corpora, International Journal of Corpus Linguistics , 2001; 6(1): 1–37) and Paquot and Bestgen (Distinctive words in academic writing: a comparison of three statistical tests for keyword extraction. In Jucker, A., Schreier, D., and Hundt, M. (eds), Corpora: Pragmatics and Discourse . Amsterdam: Rodopi, 2009, pp. 247–69), it is possible to represent the data differently and employ other tests, such that we assume independence at the level of texts rather than individual words. This allows us to account for the distribution of words within a corpus. In this article we compare the significance estimates of various statistical tests in a controlled resampling experiment and in a practical setting, studying differences between texts produced by male and female fiction writers in the British National Corpus. We find that the choice of the test, and hence data representation, matters. We conclude that significance testing can be used to find consequential differences between corpora, but that assuming independence between all words may lead to overestimating the significance of the observed differences, especially for poorly dispersed words. We recommend the use of the t-test, Wilcoxon rank-sum test, or bootstrap test for comparing word frequencies across corpora.

...read moreread less

86 citations

Journal Article•DOI•

Exploring the mammalian sensory space: co-operations and trade-offs among senses

[...]

Sirpa Nummela¹, Henry Pihlström¹, Kai Puolamäki, Mikael Fortelius¹, Simo Hemilä¹, Tom Reuter¹ - Show less +2 more•Institutions (1)

University of Helsinki¹

17 Sep 2013-Journal of Comparative Physiology A-neuroethology Sensory Neural and Behavioral Physiology

TL;DR: The method allows morphologists to identify sensory organ combinations that are characteristic of particular ecological niches, and observes a strong correlation between eyes and ears, indicating that co-operation between vision and hearing is a general mammalian feature.

...read moreread less

Abstract: The evolution of a particular sensory organ is often discussed with no consideration of the roles played by other senses. Here, we treat mammalian vision, olfaction and hearing as an interconnected whole, a three-dimensional sensory space, evolving in response to ecological challenges. Until now, there has been no quantitative method for estimating how much a particular animal invests in its different senses. We propose an anatomical measure based on sensory organ sizes. Dimensions of functional importance are defined and measured, and normalized in relation to animal mass. For 119 taxonomically and ecologically diverse species, we can define the position of the species in a three-dimensional sensory space. Thus, we can ask questions related to possible trade-off vs. co-operation among senses. More generally, our method allows morphologists to identify sensory organ combinations that are characteristic of particular ecological niches. After normalization for animal size, we note that arboreal mammals tend to have larger eyes and smaller noses than terrestrial mammals. On the other hand, we observe a strong correlation between eyes and ears, indicating that co-operation between vision and hearing is a general mammalian feature. For some groups of mammals we note a correlation, and possible co-operation between olfaction and whiskers.

...read moreread less

82 citations

Proceedings Article•DOI•

Tell me something I don't know: randomization strategies for iterative data mining

[...]

Sami Hanhijärvi¹, Markus Ojala¹, Niko Vuokko¹, Kai Puolamäki¹, Nikolaj Tatti¹, Heikki Mannila¹ - Show less +2 more•Institutions (1)

Helsinki University of Technology¹

28 Jun 2009

TL;DR: The problem of randomizing data so that previously discovered patterns or models are taken into account, and the results indicate that in many cases, the results of, e.g., clustering actually imply theresults of, say, frequent pattern discovery.

...read moreread less

Abstract: There is a wide variety of data mining methods available, and it is generally useful in exploratory data analysis to use many different methods for the same dataset. This, however, leads to the problem of whether the results found by one method are a reflection of the phenomenon shown by the results of another method, or whether the results depict in some sense unrelated properties of the data. For example, using clustering can give indication of a clear cluster structure, and computing correlations between variables can show that there are many significant correlations in the data. However, it can be the case that the correlations are actually determined by the cluster structure.In this paper, we consider the problem of randomizing data so that previously discovered patterns or models are taken into account. The randomization methods can be used in iterative data mining. At each step in the data mining process, the randomization produces random samples from the set of data matrices satisfying the already discovered patterns or models. That is, given a data set and some statistics (e.g., cluster centers or co-occurrence counts) of the data, the randomization methods sample data sets having similar values of the given statistics as the original data set. We use Metropolis sampling based on local swaps to achieve this. We describe experiments on real data that demonstrate the usefulness of our approach. Our results indicate that in many cases, the results of, e.g., clustering actually imply the results of, say, frequent pattern discovery.

...read moreread less

81 citations

1
2
3
4
5
…
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

The Theory of Island Biogeography

[...]

Jeff Swinebroad, Robert H. MacArthur, Edward O. Wilson

01 Oct 1969-Journal of Wildlife Management

TL;DR: Preface to the Princeton Landmarks in Biology Edition vii Preface xi Symbols used xiii 1.

...read moreread less

Abstract: Preface to the Princeton Landmarks in Biology Edition vii Preface xi Symbols Used xiii 1. The Importance of Islands 3 2. Area and Number of Speicies 8 3. Further Explanations of the Area-Diversity Pattern 19 4. The Strategy of Colonization 68 5. Invasibility and the Variable Niche 94 6. Stepping Stones and Biotic Exchange 123 7. Evolutionary Changes Following Colonization 145 8. Prospect 181 Glossary 185 References 193 Index 201

...read moreread less

14,171 citations

Journal Article•DOI•

Machine learning

[...]

Thomas G. Dietterich¹•Institutions (1)

Oregon State University¹

01 Dec 1996-ACM Computing Surveys

TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.

...read moreread less

Abstract: Machine Learning is the study of methods for programming computers to learn. Computers are applied to a wide range of tasks, and for most of these it is relatively easy for programmers to design and implement the necessary software. However, there are many tasks for which this is difficult or impossible. These can be divided into four general categories. First, there are problems for which there exist no human experts. For example, in modern automated manufacturing facilities, there is a need to predict machine failures before they occur by analyzing sensor readings. Because the machines are new, there are no human experts who can be interviewed by a programmer to provide the knowledge necessary to build a computer system. A machine learning system can study recorded data and subsequent machine failures and learn prediction rules. Second, there are problems where human experts exist, but where they are unable to explain their expertise. This is the case in many perceptual tasks, such as speech recognition, hand-writing recognition, and natural language understanding. Virtually all humans exhibit expert-level abilities on these tasks, but none of them can describe the detailed steps that they follow as they perform them. Fortunately, humans can provide machines with examples of the inputs and correct outputs for these tasks, so machine learning algorithms can learn to map the inputs to the outputs. Third, there are problems where phenomena are changing rapidly. In finance, for example, people would like to predict the future behavior of the stock market, of consumer purchases, or of exchange rates. These behaviors change frequently, so that even if a programmer could construct a good predictive computer program, it would need to be rewritten frequently. A learning program can relieve the programmer of this burden by constantly modifying and tuning a set of learned prediction rules. Fourth, there are applications that need to be customized for each computer user separately. Consider, for example, a program to filter unwanted electronic mail messages. Different users will need different filters. It is unreasonable to expect each user to program his or her own rules, and it is infeasible to provide every user with a software engineer to keep the rules up-to-date. A machine learning system can learn which mail messages the user rejects and maintain the filtering rules automatically. Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis. Statistics focuses on understanding the phenomena that have generated the data, often with the goal of testing different hypotheses about those phenomena. Data mining seeks to find patterns in the data that are understandable by people. Psychological studies of human learning aspire to understand the mechanisms underlying the various learning behaviors exhibited by people (concept learning, skill acquisition, strategy change, etc.).

...read moreread less

13,246 citations

Pattern Recognition and Machine Learning

[...]

Christopher M. Bishop¹•Institutions (1)

Microsoft¹

01 Jan 2006

TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.

...read moreread less

Abstract: Probability Distributions.- Linear Models for Regression.- Linear Models for Classification.- Neural Networks.- Kernel Methods.- Sparse Kernel Machines.- Graphical Models.- Mixture Models and EM.- Approximate Inference.- Sampling Methods.- Continuous Latent Variables.- Sequential Data.- Combining Models.

...read moreread less

10,141 citations

Book•

Machine Learning : A Probabilistic Perspective

[...]

Kevin P. Murphy

24 Aug 2012

TL;DR: This textbook offers a comprehensive and self-contained introduction to the field of machine learning, based on a unified, probabilistic approach, and is suitable for upper-level undergraduates with an introductory-level college math background and beginning graduate students.

...read moreread less

Abstract: Today's Web-enabled deluge of electronic data calls for automated methods of data analysis. Machine learning provides these, developing methods that can automatically detect patterns in data and then use the uncovered patterns to predict future data. This textbook offers a comprehensive and self-contained introduction to the field of machine learning, based on a unified, probabilistic approach. The coverage combines breadth and depth, offering necessary background material on such topics as probability, optimization, and linear algebra as well as discussion of recent developments in the field, including conditional random fields, L1 regularization, and deep learning. The book is written in an informal, accessible style, complete with pseudo-code for the most important algorithms. All topics are copiously illustrated with color images and worked examples drawn from such application domains as biology, text processing, computer vision, and robotics. Rather than providing a cookbook of different heuristic methods, the book stresses a principled model-based approach, often using the language of graphical models to specify models in a concise and intuitive way. Almost all the models described have been implemented in a MATLAB software package--PMTK (probabilistic modeling toolkit)--that is freely available online. The book is suitable for upper-level undergraduates with an introductory-level college math background and beginning graduate students.

...read moreread less

8,059 citations

Computer vision : a modern approach = 计算机视觉 : 一种现代的方法

[...]

David Forsyth, Jean Ponce

01 Jan 2004

TL;DR: Comprehensive and up-to-date, this book includes essential topics that either reflect practical significance or are of theoretical importance and describes numerous important application areas such as image based rendering and digital libraries.

...read moreread less

Abstract: From the Publisher: The accessible presentation of this book gives both a general view of the entire computer vision enterprise and also offers sufficient detail to be able to build useful applications. Users learn techniques that have proven to be useful by first-hand experience and a wide range of mathematical methods. A CD-ROM with every copy of the text contains source code for programming practice, color images, and illustrative movies. Comprehensive and up-to-date, this book includes essential topics that either reflect practical significance or are of theoretical importance. Topics are discussed in substantial and increasing depth. Application surveys describe numerous important application areas such as image based rendering and digital libraries. Many important algorithms broken down and illustrated in pseudo code. Appropriate for use by engineers as a comprehensive reference to the computer vision enterprise.

...read moreread less

3,627 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse