scispace - formally typeset
J

Joseph D. Prusa

Researcher at Florida Atlantic University

Publications -  26
Citations -  845

Joseph D. Prusa is an academic researcher from Florida Atlantic University. The author has contributed to research in topics: Sentiment analysis & Feature selection. The author has an hindex of 11, co-authored 26 publications receiving 568 citations.

Papers
More filters
Journal ArticleDOI

Survey of review spam detection using machine learning techniques

TL;DR: A strong and comprehensive comparative study of current research on detecting review spam using various machine learning techniques and to devise methodology for conducting further investigation is provided.
Proceedings ArticleDOI

Using Random Undersampling to Alleviate Class Imbalance on Tweet Sentiment Data

TL;DR: The experimental results show that Random Undersampling significantly improves classification performance in comparison to not using any data sampling, and there is no significant difference between selecting a 50:50 or 35:65 post-sampling class distribution ratio.
Journal ArticleDOI

Improving deep neural network design with new text data representations

TL;DR: This paper proposes a new method of creating character-level representations of text to reduce the computational costs associated with training a deep convolutional neural network and demonstrates that the method of character embedding greatly reduces training time and memory use, while significantly improving classification performance.
Proceedings Article

Impact of Feature Selection Techniques for Tweet Sentiment Classification.

TL;DR: This work extensively studies the impact of ten filter-based feature selection techniques on classification performance, using ten feature subset sizes and four different learners to demonstrate that feature selection can significantly improve classification performance in comparison to not using feature selection.
Proceedings ArticleDOI

The Effect of Dataset Size on Training Tweet Sentiment Classifiers

TL;DR: This is the first study to investigate how learners scale in respect to dataset size with results verified using statistical tests and multiple models trained for each learner and dataset size.