scispace - formally typeset
Proceedings ArticleDOI

Cosine similarity to determine similarity measure: Study case in online essay assessment

TLDR
This research implemented the weighting of Term Frequency - Inverse Document Frequency (TF-IDF) method and Cosine Similarity with the measuring degree concept of similarity terms in a document to rank the document weight that have closesness match level with expert's document.
Abstract
Development of technology in educational field brings the easier ways through the variety of facilitation for learning process, sharing files, giving assignment and assessment. Automated Essay Scoring (AES) is one of the development systems for determining a score automatically from text document source to facilitate the correction and scoring by utilizing applications that run on the computer. AES process is used to help the lecturers to score efficiently and effectively. Besides it can reduce the subjectivity scoring problem. However, implementation of AES depends on many factors and cases, such as language and mechanism of scoring process especially for essay scoring. A number of methods implemented for weighting the terms from document and reaching the solutions for handling comparative level between documents answer and expert's document still defined. In this research, we implemented the weighting of Term Frequency — Inverse Document Frequency (TF-IDF) method and Cosine Similarity with the measuring degree concept of similarity terms in a document. Tests carried out on a number of Indonesian text-based documents that have gone through the stage of pre-processing for data extraction purposes. This process results is in a ranking of the document weight that have closesness match level with expert's document.

read more

Citations
More filters
Proceedings ArticleDOI

Movie Recommender System Using Collaborative Filtering

TL;DR: To prove the effectiveness, K-NN algorithms and collaborative filtering are used to mainly focus on enhancing the accuracy of results as compared to content-based filtering, based on cosine similarity using k-nearest neighbor with the help of a collaborative filtering technique.
Journal ArticleDOI

An Efficient movie recommendation algorithm based on improved k-clique

TL;DR: To further improve accuracy in the recommendation system, the k-clique methodology used to analyze social networks is presented to be the guidance of this system and results show that the proposed methods improve more accuracy of the movie recommendation system than any other methods used in this experiment.
Journal ArticleDOI

Word Sense Disambiguation based on Context Selection using Knowledge-based Word Similarity

TL;DR: A novel knowledge-based word-sense disambiguation (WSD) system to find an effective way to filter out unnecessary information by using word similarity, and proposes a novel encoding method for word vector representation by considering the graphical semantic relationships from the lexical knowledge bases.
Proceedings ArticleDOI

Automatic Thai Subjective Examination using Cosine Similarity

TL;DR: In comparing the results of the proposed automatic subjective examination system for Thai language based on semantic similarity, applying cosine similarity techniques together with the applicable synonym, it was found that the scores produced from the system were similar to those obtained from the expert.
References
More filters
Journal ArticleDOI

Educational Data Mining: A Review of the State of the Art

TL;DR: The most relevant studies carried out in educational data mining to date are surveyed and the different groups of user, types of educational environments, and the data they provide are described.
Journal ArticleDOI

A Survey of Text Similarity Approaches

TL;DR: This survey discusses the existing works on text similarity through partitioning them into three approaches; String-based, Corpus-based and Knowledge-based similarities, and samples of combination between these similarities are presented.
Journal ArticleDOI

Review: Educational data mining: A survey and a data mining-based analysis of recent works

TL;DR: This review pursues a twofold goal, to preserve and enhance the chronicles of recent educational data mining (EDM) advances development, and provides an analysis of the EDM strengths, weakness, opportunities, and threats, whose factors represent, in a sense, future work to be fulfilled.
Journal ArticleDOI

Educational Data Mining and Learning Analytics: Applications to Constructionist Research.

TL;DR: The relevance of a set of approaches broadly called “educational data mining” or “learning analytics” to help provide a basis for quantitative research on constructionist learning which does not abandon the richness seen as essential by many researchers in that paradigm is investigated.
Proceedings ArticleDOI

Learning Semantic Similarity for Very Short Texts

TL;DR: The conclusion is made that the combination of word embeddings and tf-idf information might lead to a better model for semantic content within very short text fragments, which is a first step towards a hybrid method that combines the strength of dense distributed representations -- as opposed to sparse term matching -- with the strength to automatically reduce the impact of less informative terms.
Related Papers (5)