scispace - formally typeset
S

Samuel Kaski

Researcher at University of Manchester

Publications -  528
Citations -  15419

Samuel Kaski is an academic researcher from University of Manchester. The author has contributed to research in topics: Inference & Cluster analysis. The author has an hindex of 58, co-authored 522 publications receiving 14180 citations. Previous affiliations of Samuel Kaski include Helsinki University of Technology & University of Helsinki.

Papers
More filters
Journal ArticleDOI

Self organization of a massive document collection

TL;DR: A system that is able to organize vast document collections according to textual similarities based on the self-organizing map (SOM) algorithm, based on 500-dimensional vectors of stochastic figures obtained as random projections of weighted word histograms.
Journal ArticleDOI

WEBSOM - Self-Organizing Maps of Document Collections

TL;DR: Special consideration is given to the computation of very large document maps which is possible with general-purpose computers if the dimensionality of the word category histograms is first reduced with a random mapping method and if computationally efficient algorithms are used in computing the SOMs.
Proceedings ArticleDOI

Dimensionality reduction by random mapping: fast similarity computation for clustering

TL;DR: It is demonstrated that the document classification accuracy obtained after the dimensionality has been reduced using a random mapping method will be almost as good as the original accuracy if the final dimensionality is sufficiently large.

Bibliography of Self-Organizing Map SOM) Papers: 1998-2001 Addendum

TL;DR: This work has provided a keyword index to help finding articles of interest, and additionally a modern automatically constructed variant of a thematic index: a WEBSOM interface to the whole article collection of years 1981-2000.
Journal Article

Information Retrieval Perspective to Nonlinear Dimensionality Reduction for Data Visualization

TL;DR: A rigorous definition for a specific visualization task is given, resulting in quantifiable goodness measures and new visualization methods, and it is shown empirically that the unsupervised version outperforms existing unsuper supervised dimensionality reduction methods in the visualization task, and the supervised version outper performs existing supervised methods.