scispace - formally typeset
Search or ask a question
Institution

Helsinki Institute for Information Technology

FacilityEspoo, Finland
About: Helsinki Institute for Information Technology is a facility organization based out in Espoo, Finland. It is known for research contribution in the topics: Population & Bayesian network. The organization has 630 authors who have published 1962 publications receiving 63426 citations.


Papers
More filters
Book ChapterDOI
15 Sep 2014
TL;DR: It is proved that the problem of discovering communities in interaction networks, which are dense and whose edges occur in short time intervals, is NP-hard, and effective algorithms are provided by adapting techniques used to find dense subgraphs.
Abstract: Online social networks are often defined by considering interactions over large time intervals, e.g., consider pairs of individuals who have called each other at least once in a mobilie-operator network, or users who have made a conversation in a social-media site. Although such a definition can be valuable in many graph-mining tasks, it suffers from a severe limitation: it neglects the precise time that the interaction between network nodes occurs. In this paper we study interaction networks, where one considers not only the social-network topology, but also the exact time that nodes interact. In an interaction network an edge is associated with a time stamp, and multiple edges may occur for the same pair of nodes. Consequently, interaction networks offer a more fine-grained representation that can be used to reveal otherwise hidden dynamic phenomena in the network. We consider the problem of discovering communities in interaction networks, which are dense and whose edges occur in short time intervals. Such communities represent groups of individuals who interact with each other in some specific time instances, for example, a group of employees who work on a project and whose interaction intensifies before certain project milestones.We prove that the problem we define is NP-hard, and we provide effective algorithms by adapting techniques used to find dense subgraphs. We perform extensive evaluation of the proposed methods on synthetic and real datasets, which demonstrates the validity of our concepts and the good performance of our algorithms.

28 citations

Journal ArticleDOI
TL;DR: A new statistical method for discovery of a causal ordering using non-normality of observed variables is developed to provide a partial solution to the problem.

28 citations

Book ChapterDOI
01 Jan 2009
TL;DR: An open source computer program is developed which uses Monte Carlo sampling to compute the upper and lower bounds of these curves for one or more levels of statistical significance, and is able to confirm the hypothesis that the productivity of -ity, as measured by type counts, is significantly low in letters written by women.
Abstract: This work is a case study of applying nonparametric statistical methods to corpus data. We show how to use ideas from permutation testing to answer linguistic questions related to morphological productivity and type richness. In particular, we study the use of the suffixes -ity and -ness in the 17th-century part of the Corpus of Early English Correspondence within the framework of historical sociolinguistics. Our hypothesis is that the productivity of -ity, as measured by type counts, is significantly low in letters written by women. To test such hypotheses, and to facilitate exploratory data analysis, we take the approach of computing accumulation curves for types and hapax legomena. We have developed an open source computer program which uses Monte Carlo sampling to compute the upper and lower bounds of these curves for one or more levels of statistical significance. By comparing the type accumulation from women’s letters with the bounds, we are able to confirm our hypothesis.

28 citations

Journal ArticleDOI
TL;DR: In this paper, a cued-recall method was used to induce involuntary musical imagery (INMI) and delayed self-reports in a large sample of people, and the prevalence of the phenomenon was considerable.
Abstract: It is still a mystery why we sometimes experience the repetition of memories in our minds. This phenomenon seems to be particularly prominent in music. We believe that present lack of knowledge relates to the lack of methods available for the study of this topic. To improve the understanding of involuntary musical imagery (INMI), this paper proposes a novel method to induce it in experimental settings. We report three experiments that were conducted to evaluate two research questions related to INMI: Can it be experimentally induced, and if so, which factors influence its emergence? Investigation particularly focused on how recent activation of musical memory might predict INMI. The questions were tested in single-trial experiments conducted over the internet. The experiments utilized a cued-recall method to induce INMI and delayed self-reports. Among a large sample of people, the prevalence of the phenomenon was considerable. When the familiarity with the stimuli was controlled for, inducing INMI experim...

28 citations

01 Jan 2014
TL;DR: In this paper, an integer linear program (ILP) is proposed for learning bounded tree-width Bayesian networks, which can be solved by an anytime algorithm which provides upper bounds to assess the quality of the found solutions.
Abstract: In many applications one wants to compute conditional probabilities given a Bayesian network. This inference problem is NP-hard in general but becomes tractable when the network has low tree-width. Since the inference problem is common in many application areas, we provide a practical algorithm for learning bounded tree-width Bayesian networks. We cast this problem as an integer linear program (ILP). The program can be solved by an anytime algorithm which provides upper bounds to assess the quality of the found solutions. A key component of our program is a novel integer linear formulation for bounding tree-width of a graph. Our tests clearly indicate that our approach works in practice, as our implementation was able to nd an optimal or nearly optimal network for most of the data sets.

28 citations


Authors

Showing all 632 results

NameH-indexPapersCitations
Dimitri P. Bertsekas9433285939
Olli Kallioniemi9035342021
Heikki Mannila7229526500
Jukka Corander6641117220
Jaakko Kangasjärvi6214617096
Aapo Hyvärinen6130144146
Samuel Kaski5852214180
Nadarajah Asokan5832711947
Aristides Gionis5829219300
Hannu Toivonen5619219316
Nicola Zamboni5312811397
Jorma Rissanen5215122720
Tero Aittokallio522718689
Juha Veijola5226119588
Juho Hamari5117616631
Network Information
Related Institutions (5)
Google
39.8K papers, 2.1M citations

93% related

Microsoft
86.9K papers, 4.1M citations

93% related

Carnegie Mellon University
104.3K papers, 5.9M citations

91% related

Facebook
10.9K papers, 570.1K citations

91% related

Performance
Metrics
No. of papers from the Institution in previous years
YearPapers
20231
20224
202185
202097
2019140
2018127