Home
/
Institutions
/
Helsinki Institute for Information Technology

Institution

Helsinki Institute for Information Technology

Facility•Espoo, Finland•

About: Helsinki Institute for Information Technology is a facility organization based out in Espoo, Finland. It is known for research contribution in the topics: Population & Bayesian network. The organization has 630 authors who have published 1962 publications receiving 63426 citations.

...read moreread less

Topics: Population, Bayesian network, The Internet, Mobile computing, Cluster analysis ...read more

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
1996

Papers

PDF

Open Access

More filters

Proceedings Article•

Polya-gamma augmentations for factor models

[...]

Arto Klami¹•Institutions (1)

Helsinki Institute for Information Technology¹

26 Nov 2014

TL;DR: This paper describes how Gibbs sampling and mean-eld variational approximation for various latent factor models can be implemented for these cases, presenting easy-to-implement and ecient inference schemas.

...read moreread less

Abstract: Bayesian inference for latent factor models, such as principal component and canonical correlation analysis, is easy for Gaussian likelihoods with conjugate priors using both Gibbs sampling and mean-eld variational approximation. For other likelihood potentials one needs to either resort to more complex sampling schemes or to specifying dedicated forms for variational lower bounds. Recently, however, it was shown that for specic likelihoods related to the logistic function it is possible to augment the joint density with auxiliary variables following a P olya-Gamma distribution, leading to closed-form updates for binary and over-dispersed count models. In this paper we describe how Gibbs sampling and mean-eld variational approximation for various latent factor models can be implemented for these cases, presenting easy-to-implement and ecient inference schemas.

...read moreread less

16 citations

Posted Content•

Fully Dynamic de Bruijn Graphs

[...]

Djamal Belazzougui, Travis Gagie¹, Travis Gagie², Veli Mäkinen¹, Veli Mäkinen², Marco Previtali³ - Show less +2 more•Institutions (3)

Helsinki Institute for Information Technology¹, University of Helsinki², University of Milano-Bicocca³

17 Jul 2016-arXiv: Data Structures and Algorithms

TL;DR: In this paper, a space and time-efficient fully dynamic implementation of de Bruijn graphs is presented, which can also support fixed-length jumbled pattern matching, and can be used with fixed length jumbled patterns.

...read moreread less

Abstract: We present a space- and time-efficient fully dynamic implementation de Bruijn graphs, which can also support fixed-length jumbled pattern matching.

...read moreread less

16 citations

Proceedings Article•DOI•

DiMaS: distributing multimedia on peer-to-peer file sharing networks

[...]

Tommo Reti¹, Risto Sarvas¹•Institutions (1)

Helsinki Institute for Information Technology¹

10 Oct 2004

TL;DR: DiMaS proves as a concept that it is possible to make a system for multimedia producing communities to publish their work on highly popular P2P networks and importantly, the system enables producers to insert content metadata, to manage intellectual property and usage rights, and to charge for the consumption.

...read moreread less

Abstract: This demonstration presents the Digital Content Distribution Management System (DiMaS). DiMaS proves as a concept that it is possible to make a system for multimedia producing communities to publish their work on highly popular P2P networks, and importantly, the system enables producers to insert content metadata, to manage intellectual property and usage rights, and to charge for the consumption. All this can be done without introducing another new content or metadata file format and a dedicated client application to read the format.

...read moreread less

16 citations

Journal Article•DOI•

Explaining a Weighted DAG with Few Paths for Solving Genome-Guided Multi-Assembly

[...]

Alexandru I. Tomescu¹, Travis Gagie¹, Alexandru Popa², Romeo Rizzi³, Anna Kuosmanen¹, Veli Mäkinen¹ - Show less +2 more•Institutions (3)

Helsinki Institute for Information Technology¹, Nazarbayev University², University of Verona³

01 Nov 2015-IEEE/ACM Transactions on Computational Biology and Bioinformatics

TL;DR: The approximability of this problem is studied, and a fully polynomial-time approximation scheme (FPTAS) is given for the case when the fitting function penalizes the maximum ratio between the weights of the arcs and their predicted coverage.

...read moreread less

Abstract: RNA-Seq technology offers new high-throughput ways for transcript identification and quantification based on short reads, and has recently attracted great interest. This is achieved by constructing a weighted DAG whose vertices stand for exons, and whose arcs stand for split alignments of the RNA-Seq reads to the exons. The task consists of finding a number of paths, together with their expression levels, which optimally explain the weights of the graph under various fitting functions, such as least sum of squared residuals. In (Tomescu et al. BMC Bioinformatics, 2013) we studied this genome-guided multi-assembly problem when the number of allowed solution paths was linear in the number of arcs. In this paper, we further refine this problem by asking for a bounded number $k$ of solution paths, which is the setting of most practical interest. We formulate this problem in very broad terms, and show that for many choices of the fitting function it becomes NP-hard. Nevertheless, we identify a natural graph parameter of a DAG $G$ , which we call arc-width and denote $\langle G\rangle$ , and give a dynamic programming algorithm running in time $O(W^k\langle G\rangle ^k(\langle G\rangle + k)n)$ , where $n$ is the number of vertices and $W$ is the maximum weight of $G$ . This implies that the problem is fixed-parameter tractable (FPT) in the parameters $W$ , $\langle G\rangle$ , and $k$ . We also show that the arc-width of DAGs constructed from simulated and real RNA-Seq reads is small in practice. Finally, we study the approximability of this problem, and, in particular, give a fully polynomial-time approximation scheme (FPTAS) for the case when the fitting function penalizes the maximum ratio between the weights of the arcs and their predicted coverage.

...read moreread less

16 citations

Proceedings Article•

Partial order MCMC for structure discovery in Bayesian networks

[...]

Teppo Niinimäki¹, Pekka Parviainen¹, Mikko Koivisto¹•Institutions (1)

Helsinki Institute for Information Technology¹

14 Jul 2011

TL;DR: In this article, a Markov chain Monte Carlo method for estimating posterior probabilities of structural features in Bayesian networks is presented, which draws samples from the posterior distribution of partial orders on the nodes.

...read moreread less

Abstract: We present a new Markov chain Monte Carlo method for estimating posterior probabilities of structural features in Bayesian networks. The method draws samples from the posterior distribution of partial orders on the nodes; for each sampled partial order, the conditional probabilities of interest are computed exactly. We give both analytical and empirical results that suggest the superiority of the new method compared to previous methods, which sample either directed acyclic graphs or linear orders on the nodes.

...read moreread less

15 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
…
166
167
168
169
170
171
172
…
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Authors

Showing all 632 results

Name	H-index	Papers	Citations
Dimitri P. Bertsekas	94	332	85939
Olli Kallioniemi	90	353	42021
Heikki Mannila	72	295	26500
Jukka Corander	66	411	17220
Jaakko Kangasjärvi	62	146	17096
Aapo Hyvärinen	61	301	44146
Samuel Kaski	58	522	14180
Nadarajah Asokan	58	327	11947
Aristides Gionis	58	292	19300
Hannu Toivonen	56	192	19316
Nicola Zamboni	53	128	11397
Jorma Rissanen	52	151	22720
Tero Aittokallio	52	271	8689
Juha Veijola	52	261	19588
Juho Hamari	51	176	16631

Network Information

Related Institutions (5)

Google

39.8K papers, 2.1M citations

93% related

Microsoft

86.9K papers, 4.1M citations

38.6K papers, 1.3M citations

92% related

Carnegie Mellon University

104.3K papers, 5.9M citations

91% related

Facebook

10.9K papers, 570.1K citations

91% related

Performance

Metrics

1,967

Papers

76,126

Citations

No. of papers from the Institution in previous years
Year	Papers
2023	1
2022	4
2021	85
2020	97
2019	140
2018	127