Institution

Helsinki Institute for Information Technology

Facility•Espoo, Finland•

About: Helsinki Institute for Information Technology is a facility organization based out in Espoo, Finland. It is known for research contribution in the topics: Population & Bayesian network. The organization has 630 authors who have published 1962 publications receiving 63426 citations.

...read moreread less

Topics: Population, Bayesian network, The Internet, Mobile computing, Cluster analysis ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Book Chapter•DOI•

Discovering dynamic communities in interaction networks

[...]

Polina Rozenshtein¹, Nikolaj Tatti¹, Aristides Gionis¹•Institutions (1)

Helsinki Institute for Information Technology¹

15 Sep 2014

TL;DR: It is proved that the problem of discovering communities in interaction networks, which are dense and whose edges occur in short time intervals, is NP-hard, and effective algorithms are provided by adapting techniques used to find dense subgraphs.

...read moreread less

Abstract: Online social networks are often defined by considering interactions over large time intervals, e.g., consider pairs of individuals who have called each other at least once in a mobilie-operator network, or users who have made a conversation in a social-media site. Although such a definition can be valuable in many graph-mining tasks, it suffers from a severe limitation: it neglects the precise time that the interaction between network nodes occurs. In this paper we study interaction networks, where one considers not only the social-network topology, but also the exact time that nodes interact. In an interaction network an edge is associated with a time stamp, and multiple edges may occur for the same pair of nodes. Consequently, interaction networks offer a more fine-grained representation that can be used to reveal otherwise hidden dynamic phenomena in the network. We consider the problem of discovering communities in interaction networks, which are dense and whose edges occur in short time intervals. Such communities represent groups of individuals who interact with each other in some specific time instances, for example, a group of employees who work on a project and whose interaction intensifies before certain project milestones.We prove that the problem we define is NP-hard, and we provide effective algorithms by adapting techniques used to find dense subgraphs. We perform extensive evaluation of the proposed methods on synthetic and real datasets, which demonstrates the validity of our concepts and the good performance of our algorithms.

...read moreread less

28 citations

Journal Article•DOI•

Finding a causal ordering via independent component analysis

[...]

Shohei Shimizu¹, Aapo Hyvärinen¹, Patrik O. Hoyer¹, Yutaka Kano²•Institutions (2)

Helsinki Institute for Information Technology¹, Osaka University²

01 Jul 2006-Computational Statistics & Data Analysis

TL;DR: A new statistical method for discovery of a causal ordering using non-normality of observed variables is developed to provide a partial solution to the problem.

...read moreread less

28 citations

Book Chapter•DOI•

Comparing type counts: The case of women, men and -ity in early English letters

[...]

Tanja Säily¹, Jukka Suomela²•Institutions (2)

University of Helsinki¹, Helsinki Institute for Information Technology²

01 Jan 2009

TL;DR: An open source computer program is developed which uses Monte Carlo sampling to compute the upper and lower bounds of these curves for one or more levels of statistical significance, and is able to confirm the hypothesis that the productivity of -ity, as measured by type counts, is significantly low in letters written by women.

...read moreread less

Abstract: This work is a case study of applying nonparametric statistical methods to corpus data. We show how to use ideas from permutation testing to answer linguistic questions related to morphological productivity and type richness. In particular, we study the use of the suffixes -ity and -ness in the 17th-century part of the Corpus of Early English Correspondence within the framework of historical sociolinguistics. Our hypothesis is that the productivity of -ity, as measured by type counts, is significantly low in letters written by women. To test such hypotheses, and to facilitate exploratory data analysis, we take the approach of computing accumulation curves for types and hapax legomena. We have developed an open source computer program which uses Monte Carlo sampling to compute the upper and lower bounds of these curves for one or more levels of statistical significance. By comparing the type accumulation from women’s letters with the bounds, we are able to confirm our hypothesis.

...read moreread less

28 citations

Journal Article•DOI•

Inducing involuntary musical imagery: An experimental study

[...]

Lassi A. Liikkanen¹, Lassi A. Liikkanen²•Institutions (2)

Helsinki Institute for Information Technology¹, Stanford University²

18 May 2012-Musicae Scientiae

TL;DR: In this paper, a cued-recall method was used to induce involuntary musical imagery (INMI) and delayed self-reports in a large sample of people, and the prevalence of the phenomenon was considerable.

...read moreread less

Abstract: It is still a mystery why we sometimes experience the repetition of memories in our minds. This phenomenon seems to be particularly prominent in music. We believe that present lack of knowledge relates to the lack of methods available for the study of this topic. To improve the understanding of involuntary musical imagery (INMI), this paper proposes a novel method to induce it in experimental settings. We report three experiments that were conducted to evaluate two research questions related to INMI: Can it be experimentally induced, and if so, which factors influence its emergence? Investigation particularly focused on how recent activation of musical memory might predict INMI. The questions were tested in single-trial experiments conducted over the internet. The experiments utilized a cued-recall method to induce INMI and delayed self-reports. Among a large sample of people, the prevalence of the phenomenon was considerable. When the familiarity with the stimuli was controlled for, inducing INMI experim...

...read moreread less

28 citations

Learning Bounded Tree-width Bayesian Networks using Integer Linear Programming

[...]

Pekka Parviainen¹, Hossein Farahani², Jens Lagergren²•Institutions (2)

Helsinki Institute for Information Technology¹, Royal Institute of Technology²

01 Jan 2014

TL;DR: In this paper, an integer linear program (ILP) is proposed for learning bounded tree-width Bayesian networks, which can be solved by an anytime algorithm which provides upper bounds to assess the quality of the found solutions.

...read moreread less

Abstract: In many applications one wants to compute conditional probabilities given a Bayesian network. This inference problem is NP-hard in general but becomes tractable when the network has low tree-width. Since the inference problem is common in many application areas, we provide a practical algorithm for learning bounded tree-width Bayesian networks. We cast this problem as an integer linear program (ILP). The program can be solved by an anytime algorithm which provides upper bounds to assess the quality of the found solutions. A key component of our program is a novel integer linear formulation for bounding tree-width of a graph. Our tests clearly indicate that our approach works in practice, as our implementation was able to nd an optimal or nearly optimal network for most of the data sets.

...read moreread less

28 citations

Collapse

Authors

Showing all 632 results

Name	H-index	Papers	Citations
Dimitri P. Bertsekas	94	332	85939
Olli Kallioniemi	90	353	42021
Heikki Mannila	72	295	26500
Jukka Corander	66	411	17220
Jaakko Kangasjärvi	62	146	17096
Aapo Hyvärinen	61	301	44146
Samuel Kaski	58	522	14180
Nadarajah Asokan	58	327	11947
Aristides Gionis	58	292	19300
Hannu Toivonen	56	192	19316
Nicola Zamboni	53	128	11397
Jorma Rissanen	52	151	22720
Tero Aittokallio	52	271	8689
Juha Veijola	52	261	19588
Juho Hamari	51	176	16631

Network Information

Related Institutions (5)

Google

39.8K papers, 2.1M citations

93% related

Microsoft

86.9K papers, 4.1M citations

38.6K papers, 1.3M citations

92% related

Carnegie Mellon University

104.3K papers, 5.9M citations

91% related

Facebook

10.9K papers, 570.1K citations

91% related

Performance

Metrics

1,967

Papers

76,126

Citations

No. of papers from the Institution in previous years
Year	Papers
2023	1
2022	4
2021	85
2020	97
2019	140
2018	127