Pushing the boundaries of crowd-enabled databases with query-driven schema expansion
Citations
2 citations
Cites methods from "Pushing the boundaries of crowd-ena..."
...Crowdsourcing has been used in database community for building hybrid humanmachine database systems [18, 19]....
[...]
2 citations
Cites background from "Pushing the boundaries of crowd-ena..."
...Generally, such hiring seeks intelligent information processing skills for numerous tasks, ranging from content annotation [1], information extraction [2], to more complex tasks like sentiment analysis [3] and crowd-enabled database retrieval [4]....
[...]
2 citations
Cites background or methods from "Pushing the boundaries of crowd-ena..."
...Our Perceptual Spaces introduced in [9] and [6] rely on a factor model using the following assumptions: Perceptual Spaces use the established assumption that item ratings in the Social Web are a result of a user’s preferences with respect to an item’s attributes [10]....
[...]
...In [9], we have shown that certain perceived properties (like the degree of funniness) can be made explicit with only minimal human input using crowdsourcing-based machine regression....
[...]
...As our experiments in [9] showed, quality of perceptual spaces increase with the involvement and activity of users: rating data obtained from a restaurant data set (where...
[...]
...In the following, we evaluate different review-based embeddings in comparison with our rating-based perceptual space [9] as a baseline....
[...]
2 citations
2 citations
References
12,443 citations
"Pushing the boundaries of crowd-ena..." refers methods in this paper
...Furthermore, we can show that approaches based on classification using metadata and LSI lead to surprisingly bad results (g-mean between 0.41 and 0.50), and show even worse accuracy than randomly applying labels....
[...]
...This is implemented by using Latent Semantic Indexing (LSI) [21] to generate a 100-dimensional “metadata space” from movie attributes like title, plot, main actors, directors, year, runtime, and country as recorded in IMDb....
[...]
10,696 citations
"Pushing the boundaries of crowd-ena..." refers methods in this paper
...Instead of relying on non-linear regression, we can use an SVM classifier [19]....
[...]
6,320 citations
"Pushing the boundaries of crowd-ena..." refers background in this paper
...A popular measure of classification performance in the presence of class imbalance is the g-mean measure [20], which is the geometric mean of sensitivity (accuracy on all movies truly belonging to the genre) and specificity (accuracy on all movies truly not belonging to the genre), As the g-mean punishes significant differences between sensitivity and specificity, the above naïve classifier would achieve 0% g-mean....
[...]
4,009 citations
"Pushing the boundaries of crowd-ena..." refers methods in this paper
...perceptual space, we suggest to use Support Vector Regression Machines (SVMs) [14], which are a highly flexible technique to perform non-linear regression and classification, and have been proven to be effective when dealing with perceptual data [15]....
[...]
3,773 citations