Search or ask a question

Showing papers by "Kai Puolamäki published in 2015"

PDF

Open Access

Book Chapter•DOI•

GoldenEye++: A Closer Look into the Black Box

[...]

Andreas Henelius¹, Kai Puolamäki¹, Isak Karlsson², Jing Zhao², Lars Asker², Henrik Boström², Panagiotis Papapetrou² - Show less +3 more•Institutions (2)

Finnish Institute of Occupational Health¹, Stockholm University²

20 Apr 2015

TL;DR: An empirical investigation comparing the two algorithms shows that the new version of the algorithm in several cases finds groups of medically relevant interacting attributes, corresponding to prescribed drugs, undetected by the previous version, suggesting that the GoldenEye++ algorithm can be a useful tool for finding novel (adverse) drug interactions.

...read moreread less

Abstract: Models with high predictive performance are often opaque, i.e., they do not allow for direct interpretation, and are hence of limited value when the goal is to understand the reasoning behind predictions. A recently proposed algorithm, GoldenEye, allows detection of groups of interacting variables exploited by a model. We employed this technique in conjunction with random forests generated from data obtained from electronic patient records for the task of detecting adverse drug events (ADEs). We propose a refined version of the GoldenEye algorithm, called GoldenEye++, utilizing a more sensitive grouping metric. An empirical investigation comparing the two algorithms on 27 datasets related to detecting ADEs shows that the new version of the algorithm in several cases finds groups of medically relevant interacting attributes, corresponding to prescribed drugs, undetected by the previous version. This suggests that the GoldenEye++ algorithm can be a useful tool for finding novel (adverse) drug interactions.

...read moreread less

15 citations

Journal Article•DOI•

Size matters: choosing the most informative set of window lengths for mining patterns in event sequences

[...]

Jefrey Lijffijt¹, Panagiotis Papapetrou², Kai Puolamäki³•Institutions (3)

University of Bristol¹, Stockholm University², Finnish Institute of Occupational Health³

01 Nov 2015-Data Mining and Knowledge Discovery

TL;DR: This work introduces the problem of finding the best set of window lengths for analysis of event sequences for algorithms with real-valued output and shows that the problem is NP-hard in general, but that it can be approximated efficiently and even analytically in certain cases.

...read moreread less

Abstract: In order to find patterns in data, it is often necessary to aggregate or summarise data at a higher level of granularity. Selecting the appropriate granularity is a challenging task and often no principled solutions exist. This problem is particularly relevant in analysis of data with sequential structure. We consider this problem for a specific type of data, namely event sequences. We introduce the problem of finding the best set of window lengths for analysis of event sequences for algorithms with real-valued output. We present suitable criteria for choosing one or multiple window lengths and show that these naturally translate into a computational optimisation problem. We show that the problem is NP-hard in general, but that it can be approximated efficiently and even analytically in certain cases. We give examples of tasks that demonstrate the applicability of the problem and present extensive experiments on both synthetic data and real data from several domains. We find that the method works well in practice, and that the optimal sets of window lengths themselves can provide new insight into the data.

...read moreread less

5 citations

Re: Know Ecosystem Plan

[...]

Tuukka Lehtiniemi, Antti Poikola, Kai Kuikkaniemi, Patrik Floréen, Marko Turpeinen, Ben Cowley, Kristian Lukander, Kai Puolamäki - Show less +4 more

01 Jan 2015