scispace - formally typeset
P

Paul Cook

Researcher at University of New Brunswick

Publications -  74
Citations -  3276

Paul Cook is an academic researcher from University of New Brunswick. The author has contributed to research in topics: Word (computer architecture) & SemEval. The author has an hindex of 26, co-authored 74 publications receiving 2938 citations. Previous affiliations of Paul Cook include University of Toronto & University of Melbourne.

Papers
More filters
Proceedings Article

Geolocation Prediction in Social Media Data by Finding Location Indicative Words

TL;DR: This paper shows that an information gain ratiobased approach surpasses other methods at LIW selection, outperforming state-of-the-art geolocation prediction methods by 10.6% in accuracy and reducing the mean and median of prediction error distance on a public dataset.
Journal ArticleDOI

Lexical normalization for social media text

TL;DR: This article targets out-of-vocabulary words in short text messages and proposes a method for identifying and normalizing lexical variants, which achieves state- of-the-art performance over an SMS corpus and a novel dataset based on Twitter.
Proceedings Article

Word Sense Induction for Novel Sense Detection

TL;DR: This work applies topic modelling to automatically induce word senses of a target word, and demonstrates that the proposed model can be used to automatically detect words with emergent novel senses, as well as token occurrences of those senses.
Proceedings ArticleDOI

An Unsupervised Model for Text Message Normalization

TL;DR: An unsupervised noisy-channel model for text message normalization is constructed on a sample of creative, non-standard text message word forms to determine frequent word formation processes in texting language.
Proceedings ArticleDOI

A Word Embedding Approach to Predicting the Compositionality of Multiword Expressions

TL;DR: Experimental results show that, in combination with a back-off method based on string similarity, word embeddings outperform a method using count-based distributional similarity.