Institution

Helsinki Institute for Information Technology

Facility•Espoo, Finland•

About: Helsinki Institute for Information Technology is a facility organization based out in Espoo, Finland. It is known for research contribution in the topics: Population & Bayesian network. The organization has 630 authors who have published 1962 publications receiving 63426 citations.

...read moreread less

Topics: Population, Bayesian network, The Internet, Mobile computing, Cluster analysis ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Book Chapter•DOI•

Composite Repetition-Aware Data Structures

[...]

Djamal Belazzougui¹, Djamal Belazzougui², Fabio Cunial¹, Fabio Cunial², Travis Gagie², Travis Gagie¹, Nicola Prezza³, Mathieu Raffinot⁴ - Show less +4 more•Institutions (4)

University of Helsinki¹, Helsinki Institute for Information Technology², University of Udine³, Paris Diderot University⁴

29 Jun 2015

TL;DR: Two data structures are described whose size depends on multiple measures of repetition at once, and that provide competitive tradeoffs between the time for counting and reporting all the exact occurrences of a pattern, and the space taken by the structure.

...read moreread less

Abstract: In highly repetitive strings, like collections of genomes from the same species, distinct measures of repetition all grow sublinearly in the length of the text, and indexes targeted to such strings typically depend only on one of these measures. We describe two data structures whose size depends on multiple measures of repetition at once, and that provide competitive tradeoffs between the time for counting and reporting all the exact occurrences of a pattern, and the space taken by the structure. The key component of our constructions is the run-length encoded BWT (RLBWT), which takes space proportional to the number of BWT runs: rather than augmenting RLBWT with suffix array samples, we combine it with data structures from LZ77 indexes, which take space proportional to the number of LZ77 factors, and with the compact directed acyclic word graph (CDAWG), which takes space proportional to the number of extensions of maximal repeats. The combination of CDAWG and RLBWT enables also a new representation of the suffix tree, whose size depends again on the number of extensions of maximal repeats, and that is powerful enough to support matching statistics and constant-space traversal.

...read moreread less

74 citations

Proceedings Article•

Kernelized Bayesian Matrix Factorization

[...]

Mehmet G nen¹, Mehmet G nen², Suleiman A. Khan², Suleiman A. Khan¹, Samuel Kaski², Samuel Kaski³, Samuel Kaski¹ - Show less +3 more•Institutions (3)

Helsinki Institute for Information Technology¹, Aalto University², University of Helsinki³

16 Jun 2013

TL;DR: In this article, a fully conjugate probabilistic formulation of the kernelized matrix factorization problem is proposed, which enables an efficient variational approximation, whereas fully Bayesian treatments are not computationally feasible in the earlier approaches.

...read moreread less

Abstract: We extend kernelized matrix factorization with a fully Bayesian treatment and with an ability to work with multiple side information sources expressed as different kernels. Kernel functions have been introduced to matrix factorization to integrate side information about the rows and columns (e.g., objects and users in recommender systems), which is necessary for making out-of-matrix (i.e., cold start) predictions. We discuss specifically bipartite graph inference, where the output matrix is binary, but extensions to more general matrices are straightforward. We extend the state of the art in two key aspects: (i) A fully conjugate probabilistic formulation of the kernelized matrix factorization problem enables an efficient variational approximation, whereas fully Bayesian treatments are not computationally feasible in the earlier approaches. (ii) Multiple side information sources are included, treated as different kernels in multiple kernel learning that additionally reveals which side information sources are informative. Our method outperforms alternatives in predicting drug-protein interactions on two data sets. We then show that our framework can also be used for solving multilabel learning problems by considering samples and labels as the two domains where matrix factorization operates on. Our algorithm obtains the lowest Hamming loss values on 10 out of 14 multilabel classification data sets compared to five state-of-the-art multilabel learning algorithms.

...read moreread less

74 citations

Proceedings Article•

Multiview Triplet Embedding: Learning Attributes in Multiple Maps

[...]

Ehsan Amid¹, Antti Ukkonen²•Institutions (2)

Helsinki Institute for Information Technology¹, Finnish Institute of Occupational Health²

06 Jul 2015

TL;DR: The Multiview Triplet Embedding (MVTE) algorithm is proposed that produces a number of low-dimensional maps, each corresponding to one of the hidden attributes in a set of relative distance judgments in the form of triplets.

...read moreread less

Abstract: For humans, it is usually easier to make statements about the similarity of objects in relative, rather than absolute terms. Moreover, subjective comparisons of objects can be based on a number of different and independent attributes. For example, objects can be compared based on their shape, color, etc. In this paper, we consider the problem of uncovering these hidden attributes given a set of relative distance judgments in the form of triplets. The attribute that was used to generate a particular triplet in this set is unknown. Such data occurs, e.g., in crowdsourcing applications where the triplets are collected from a large group of workers. We propose the Multiview Triplet Embedding (MVTE) algorithm that produces a number of low-dimensional maps, each corresponding to one of the hidden attributes. The method can be used to assess how many different attributes were used to create the triplets, as well as to assess the difficulty of a distance comparison task, and find objects that have multiple interpretations in relation to the other objects.

...read moreread less

73 citations

Journal Article•DOI•

Technical Section: Exploring the use of handheld AR for outdoor navigation

[...]

Andreas Dünser¹, Mark Billinghurst¹, James Wen¹, Ville Lehtinen², Antti Nurminen² - Show less +1 more•Institutions (2)

University of Canterbury¹, Helsinki Institute for Information Technology²

01 Dec 2012-Computers & Graphics

TL;DR: A user study comparing navigation with information typically provided by currently available handheld AR browsers, to navigation with a digital map, and a combined map and AR condition found no overall difference in task completion time, but found evidence that AR browsers are less useful for navigation in some environment conditions.

...read moreread less

73 citations

Journal Article•

Balancing Audience and Privacy Tensions on Social Network Sites: Strategies of Highly Engaged Users

[...]

Jessica Vitak¹, Stacy Blasiola², Sameer Patil³, Eden Litt⁴•Institutions (4)

University of Maryland College of Information Studies¹, University of Illinois at Chicago², Helsinki Institute for Information Technology³, Northwestern University⁴

14 May 2015-International Journal of Communication

TL;DR: The authors conducted a qualitative study of highly engaged Facebook users to understand how people conceptualize friendship online as well as how perceived audience affects privacy concerns and privacy management strategies and found that most participants in this sample still engaged in some degree of self-censorship.

...read moreread less

Abstract: As social network sites grow and diversify in both users and content, tensions between users’ audience composition and their disclosure practices become more prevalent. Users must navigate these spaces carefully to reap relational benefits while ensuring content is not shared with unintended audiences. Through a qualitative study of highly engaged Facebook users, this study provides insight into how people conceptualize friendship online as well as how perceived audience affects privacy concerns and privacy management strategies. Findings suggest an increasingly complex relationship between these variables, fueled by collapsing contexts and invisible audiences. Although a diverse range of strategies are available to manage privacy, most participants in this sample still engaged in some degree of self-censorship.

...read moreread less

72 citations

Collapse

Authors

Showing all 632 results

Name	H-index	Papers	Citations
Dimitri P. Bertsekas	94	332	85939
Olli Kallioniemi	90	353	42021
Heikki Mannila	72	295	26500
Jukka Corander	66	411	17220
Jaakko Kangasjärvi	62	146	17096
Aapo Hyvärinen	61	301	44146
Samuel Kaski	58	522	14180
Nadarajah Asokan	58	327	11947
Aristides Gionis	58	292	19300
Hannu Toivonen	56	192	19316
Nicola Zamboni	53	128	11397
Jorma Rissanen	52	151	22720
Tero Aittokallio	52	271	8689
Juha Veijola	52	261	19588
Juho Hamari	51	176	16631

Network Information

Related Institutions (5)

Google

39.8K papers, 2.1M citations

93% related

Microsoft

86.9K papers, 4.1M citations

38.6K papers, 1.3M citations

92% related

Carnegie Mellon University

104.3K papers, 5.9M citations

91% related

Facebook

10.9K papers, 570.1K citations

91% related

Performance

Metrics

1,967

Papers

76,126

Citations

No. of papers from the Institution in previous years
Year	Papers
2023	1
2022	4
2021	85
2020	97
2019	140
2018	127