Institution

Helsinki Institute for Information Technology

Facility•Espoo, Finland•

About: Helsinki Institute for Information Technology is a facility organization based out in Espoo, Finland. It is known for research contribution in the topics: Population & Bayesian network. The organization has 630 authors who have published 1962 publications receiving 63426 citations.

...read moreread less

Topics: Population, Bayesian network, The Internet, Mobile computing, Cluster analysis ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•

Information Retrieval by Inferring Implicit Queries from Eye Movements

[...]

David R. Hardoon¹, John Shawe-Taylor¹, Antti Ajanki², Kai Puolamäki², Samuel Kaski² - Show less +1 more•Institutions (2)

University College London¹, Helsinki Institute for Information Technology²

11 Mar 2007

TL;DR: A new search strategy, in which the information retrieval (IR) query is inferred from eye movements measured when the user is reading text during an IR task, such that relevance predictions for a large set of unseen documents are ranked significantly better than by random guessing.

...read moreread less

Abstract: We introduce a new search strategy, in which the information retrieval (IR) query is inferred from eye movements measured when the user is reading text during an IR task. In training phase, we know the users' interest, that is, the relevance of training documents. We learn a predictor that produces a "query" given the eye movements; the target of learning is an "optimal" query that is computed based on the known relevance of the training documents. Assuming the predictor is universal with respect to the users' interests, it can also be applied to infer the implicit query when we have no prior knowledge of the users' interests. The result of an empirical study is that it is possible to learn the implicit query from a small set of read documents, such that relevance predictions for a large set of unseen documents are ranked significantly better than by random guessing.

...read moreread less

33 citations

Proceedings Article•DOI•

A Scalable Topic-Based Open Source Search Engine

[...]

Wray Buntine¹, Jaakko Löfström¹, Jukka Perkiö¹, Sami Perttu¹, Vladimir Poroshin¹, Tomi Silander¹, Henry Tirri¹, Antti Tuominen¹, Ville Tuulos¹ - Show less +5 more•Institutions (1)

Helsinki Institute for Information Technology¹

20 Sep 2004

TL;DR: This paper outlines a scalable system for site-based or topic-specific search, and demonstrates the developing system on a small 250,000 document collection of EU and UN web pages.

...read moreread less

Abstract: Site-based or topic-specific search engines work with mixed success because of the general difficulty of the information retrieval task, and the lack of good link information to allow authorities to be identified. We are advocating an open source approach to the problem due to its scope and need for software components. We have adopted a topic-based search engine because it represents the next generation of capability. This paper outlines our scalable system for site-based or topic-specific search, and demonstrates the developing system on a small 250,000 document collection of EU and UN web pages.

...read moreread less

33 citations

Proceedings Article•DOI•

Q-learning algorithms for optimal stopping based on least squares

[...]

Huizhen Yu¹, Dimitri P. Bertsekas¹•Institutions (1)

Helsinki Institute for Information Technology¹

02 Jul 2007

TL;DR: This work considers the solution of discounted optimal stopping problems using linear function approximation methods and proposes alternative algorithms, which are based on projected value iteration ideas and least squares, which prove the convergence of some of these algorithms.

...read moreread less

Abstract: We consider the solution of discounted optimal stopping problems using linear function approximation methods. A Q-learning algorithm for such problems, proposed by Tsitsiklis and Van Roy, is based on the method of temporal differences and stochastic approximation. We propose alternative algorithms, which are based on projected value iteration ideas and least squares. We prove the convergence of some of these algorithms and discuss their properties.

...read moreread less

33 citations

Journal Article•DOI•

Subgraph queries by context-free grammars.

[...]

Petteri Sevon¹, Lauri Eronen•Institutions (1)

Helsinki Institute for Information Technology¹

25 Aug 2008-Journal of Integrative Bioinformatics

TL;DR: The results indicate that parsing the connection subgraph directly is much more effective than parsing individual paths separately, and it is shown that using a bidirectional parsing algorithm, in most cases, allows for searching twice as long paths as using a unidirectional search strategy.

...read moreread less

Abstract: We describe a method for querying vertex- and edge-labeled graphs using context-free grammars to specify the class of interesting paths. We introduce a novel problem, finding the connection subgraph induced by the set of matching paths between given two vertices or two sets of vertices. Such a subgraph provides a concise summary of the relationship between the vertices. We also present novel algorithms for parsing subgraphs directly without enumerating all the individual paths. We evaluate experimentally the presented parsing algorithms on a set of real graphs derived from publicly available biomedical databases and on randomly generated graphs. The results indicate that parsing the connection subgraph directly is much more effective than parsing individual paths separately. Furthermore, we show that using a bidirectional parsing algorithm, in most cases, allows for searching twice as long paths as using a unidirectional search strategy.

...read moreread less

33 citations

Journal Article•DOI•

Fast motif matching revisited: high-order PWMs, SNPs and indels

[...]

Janne H. Korhonen¹, Janne H. Korhonen², Janne H. Korhonen³, Kimmo Palin², Jussi Taipale², Esko Ukkonen³, Esko Ukkonen² - Show less +3 more•Institutions (3)

Reykjavík University¹, University of Helsinki², Helsinki Institute for Information Technology³

22 Dec 2016-Bioinformatics

TL;DR: This work formalizes a framework based on high‐order position weight matrices for generic representation of motif models with dinucleotide or general q‐mer dependencies, and adapt fast PWM matching algorithms to the high‐ order PWM framework, and shows how to incorporate different types of sequence variants, such as SNPs and indels, and their combined effects into efficient PWM matches.

...read moreread less

Abstract: Motivation While the position weight matrix (PWM) is the most popular model for sequence motifs, there is growing evidence of the usefulness of more advanced models such as first-order Markov representations, and such models are also becoming available in well-known motif databases. There has been lots of research of how to learn these models from training data but the problem of predicting putative sites of the learned motifs by matching the model against new sequences has been given less attention. Moreover, motif site analysis is often concerned about how different variants in the sequence affect the sites. So far, though, the corresponding efficient software tools for motif matching have been lacking. Results We develop fast motif matching algorithms for the aforementioned tasks. First, we formalize a framework based on high-order position weight matrices for generic representation of motif models with dinucleotide or general q -mer dependencies, and adapt fast PWM matching algorithms to the high-order PWM framework. Second, we show how to incorporate different types of sequence variants , such as SNPs and indels, and their combined effects into efficient PWM matching workflows. Benchmark results show that our algorithms perform well in practice on genome-sized sequence sets and are for multiple motif search much faster than the basic sliding window algorithm. Availability and Implementation Implementations are available as a part of the MOODS software package under the GNU General Public License v3.0 and the Biopython license ( http://www.cs.helsinki.fi/group/pssmfind ). Contact janne.h.korhonen@gmail.com.

...read moreread less

33 citations

Collapse

Authors

Showing all 632 results

Name	H-index	Papers	Citations
Dimitri P. Bertsekas	94	332	85939
Olli Kallioniemi	90	353	42021
Heikki Mannila	72	295	26500
Jukka Corander	66	411	17220
Jaakko Kangasjärvi	62	146	17096
Aapo Hyvärinen	61	301	44146
Samuel Kaski	58	522	14180
Nadarajah Asokan	58	327	11947
Aristides Gionis	58	292	19300
Hannu Toivonen	56	192	19316
Nicola Zamboni	53	128	11397
Jorma Rissanen	52	151	22720
Tero Aittokallio	52	271	8689
Juha Veijola	52	261	19588
Juho Hamari	51	176	16631

Network Information

Related Institutions (5)

Google

39.8K papers, 2.1M citations

93% related

Microsoft

86.9K papers, 4.1M citations

38.6K papers, 1.3M citations

92% related

Carnegie Mellon University

104.3K papers, 5.9M citations

91% related

Facebook

10.9K papers, 570.1K citations

91% related

Performance

Metrics

1,967

Papers

76,126

Citations

No. of papers from the Institution in previous years
Year	Papers
2023	1
2022	4
2021	85
2020	97
2019	140
2018	127