Institution

Helsinki Institute for Information Technology

Facility•Espoo, Finland•

About: Helsinki Institute for Information Technology is a facility organization based out in Espoo, Finland. It is known for research contribution in the topics: Population & Bayesian network. The organization has 630 authors who have published 1962 publications receiving 63426 citations.

...read moreread less

Topics: Population, Bayesian network, Mobile computing, The Internet, Approximation algorithm ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•

Group-sparse Embeddings in Collective Matrix Factorization

[...]

Arto Klami¹, Guillaume Bouchard², Abhishek Tripathi²•Institutions (2)

Helsinki Institute for Information Technology¹, Xerox²

01 Apr 2014

TL;DR: This work presents a novel CMF solution that allows each of the matrices to have a separate low-rank structure that is independent of the other matrices, as well as structures that are shared only by a subset of them.

...read moreread less

Abstract: CMF is a technique for simultaneously learning low-rank representations based on a collection of matrices with shared entities. A typical example is the joint modeling of user-item, item-property, and user-feature matrices in a recommender system. The key idea in CMF is that the embeddings are shared across the matrices, which enables transferring information between them. The existing solutions, however, break down when the individual matrices have low-rank structure not shared with others. In this work we present a novel CMF solution that allows each of the matrices to have a separate low-rank structure that is independent of the other matrices, as well as structures that are shared only by a subset of them. We compare MAP and variational Bayesian solutions based on alternating optimization algorithms and show that the model automatically infers the nature of each factor using group-wise sparsity. Our approach supports in a principled way continuous, binary and count observations and is efficient for sparse matrices involving missing data. We illustrate the solution on a number of examples, focusing in particular on an interesting use-case of augmented multi-view learning.

...read moreread less

44 citations

Journal Article•DOI•

Seasonal Variation in Genome-Wide DNA Methylation Patterns and the Onset of Seasonal Timing of Reproduction in Great Tits

[...]

Heidi M. Viitaniemi¹, Irene Verhagen, Marcel E. Visser, Antti Honkela¹, Antti Honkela², Kees van Oers, Arild Husby³, Arild Husby¹, Arild Husby⁴ - Show less +5 more•Institutions (4)

University of Helsinki¹, Helsinki Institute for Information Technology², Uppsala University³, Norwegian University of Science and Technology⁴

01 Mar 2019-Genome Biology and Evolution

TL;DR: Reduced representation bisulfite sequencing on red blood cell derived DNA showed genome-wide temporal changes in more than 40,000 out of the 522,643 CpG sites examined, and sites that showed a temporal and treatment-specific response in DNA methylation are candidate sites of interest for future studies trying to understand the link betweenDNA methylation patterns and timing of reproduction.

...read moreread less

Abstract: In seasonal environments, timing of reproduction is a trait with important fitness consequences, but we know little about the molecular mechanisms that underlie the variation in this trait. Recentl ...

...read moreread less

44 citations

Journal Article•DOI•

Locating potential enhancer elements by comparative genomics using the EEL software.

[...]

Kimmo Palin¹, Jussi Taipale¹, Esko Ukkonen²•Institutions (2)

University of Helsinki¹, Helsinki Institute for Information Technology²

01 Jan 2006-Nature Protocols

TL;DR: EEL will predict the location and structure of conserved enhancers after being provided with two orthologous DNA sequences and binding specificity matrices for the transcription factors (TFs) that are expected to contribute to the function of the enhancers to be identified.

...read moreread less

Abstract: This protocol describes the use of Enhancer Element Locator (EEL), a computer program that was designed to locate distal enhancer elements in long mammalian sequences. EEL will predict the location and structure of conserved enhancers after being provided with two orthologous DNA sequences and binding specificity matrices for the transcription factors (TFs) that are expected to contribute to the function of the enhancers to be identified. The freely available EEL software can analyze two 1-Mb sequences with 100 TF motifs in about 15 min on a modern Windows, Linux or Mac computer. The output provides several hypotheses about enhancer location and structure for further evaluation by an expert on enhancer function.

...read moreread less

44 citations

Journal Article•DOI•

Liquid-chromatography retention order prediction for metabolite identification.

[...]

Eric Bach¹, Sandor Szedmak¹, Céline Brouard¹, Sebastian Böcker², Juho Rousu¹ - Show less +1 more•Institutions (2)

Helsinki Institute for Information Technology¹, University of Jena²

01 Sep 2018

TL;DR: This work presents a machine learning method for predicting the retention order of molecules; that is, the order in which molecules elute from the LC column, and shows that retention order is much better conserved between instruments than retention time.

...read moreread less

Abstract: Motivation Liquid Chromatography (LC) followed by tandem Mass Spectrometry (MS/MS) is one of the predominant methods for metabolite identification. In recent years, machine learning has started to transform the analysis of tandem mass spectra and the identification of small molecules. In contrast, LC data is rarely used to improve metabolite identification, despite numerous published methods for retention time prediction using machine learning. Results We present a machine learning method for predicting the retention order of molecules; that is, the order in which molecules elute from the LC column. Our method has important advantages over previous approaches: We show that retention order is much better conserved between instruments than retention time. To this end, our method can be trained using retention time measurements from different LC systems and configurations without tedious pre-processing, significantly increasing the amount of available training data. Our experiments demonstrate that retention order prediction is an effective way to learn retention behaviour of molecules from heterogeneous retention time data. Finally, we demonstrate how retention order prediction and MS/MS-based scores can be combined for more accurate metabolite identifications when analyzing a complete LC-MS/MS run. Availability and implementation Implementation of the method is available at https://version.aalto.fi/gitlab/bache1/retention_order_prediction.git.

...read moreread less

44 citations

Journal Article•DOI•

MDL Denoising Revisited

[...]

Teemu Roos¹, Petri Myllymäki¹, Jorma Rissanen¹•Institutions (1)

Helsinki Institute for Information Technology¹

01 Sep 2009-IEEE Transactions on Signal Processing

TL;DR: This work shows that the denoising problem can be reformulated as a clustering problem, where the goal is to obtain separate clusters for informative and noninformative wavelet coefficients, respectively, and suggests two refinements, adding a code-length for the model index, and extending the model in order to account for subband-dependent coefficient distributions.

...read moreread less

Abstract: We refine and extend an earlier minimum description length (MDL) denoising criterion for wavelet-based denoising. We start by showing that the denoising problem can be reformulated as a clustering problem, where the goal is to obtain separate clusters for informative and noninformative wavelet coefficients, respectively. This suggests two refinements, adding a code-length for the model index, and extending the model in order to account for subband-dependent coefficient distributions. A third refinement is the derivation of soft thresholding inspired by predictive universal coding with weighted mixtures. We propose a practical method incorporating all three refinements, which is shown to achieve good performance and robustness in denoising both artificial and natural signals.

...read moreread less

44 citations

Collapse

Authors

Showing all 632 results

Name	H-index	Papers	Citations
Dimitri P. Bertsekas	94	332	85939
Olli Kallioniemi	90	353	42021
Heikki Mannila	72	295	26500
Jukka Corander	66	411	17220
Jaakko Kangasjärvi	62	146	17096
Aapo Hyvärinen	61	301	44146
Samuel Kaski	58	522	14180
Nadarajah Asokan	58	327	11947
Aristides Gionis	58	292	19300
Hannu Toivonen	56	192	19316
Nicola Zamboni	53	128	11397
Jorma Rissanen	52	151	22720
Tero Aittokallio	52	271	8689
Juha Veijola	52	261	19588
Juho Hamari	51	176	16631

Network Information

Related Institutions (5)

Google

39.8K papers, 2.1M citations

93% related

Microsoft

86.9K papers, 4.1M citations

38.6K papers, 1.3M citations

92% related

Carnegie Mellon University

104.3K papers, 5.9M citations

91% related

Facebook

10.9K papers, 570.1K citations

91% related

Performance

Metrics

1,967

Papers

76,126

Citations

No. of papers from the Institution in previous years
Year	Papers
2023	1
2022	4
2021	85
2020	97
2019	140
2018	127