Pushing the boundaries of crowd-enabled databases with query-driven schema expansion
Citations
43 citations
Additional excerpts
...[12], [24], [27], [16])....
[...]
40 citations
38 citations
38 citations
28 citations
Cites background from "Pushing the boundaries of crowd-ena..."
...[91] explore how a stored movie collection can be extended with genre information extracted from social web sources, supplemented with input from crowdsourcing to train the extraction process....
[...]
References
12,443 citations
"Pushing the boundaries of crowd-ena..." refers methods in this paper
...Furthermore, we can show that approaches based on classification using metadata and LSI lead to surprisingly bad results (g-mean between 0.41 and 0.50), and show even worse accuracy than randomly applying labels....
[...]
...This is implemented by using Latent Semantic Indexing (LSI) [21] to generate a 100-dimensional “metadata space” from movie attributes like title, plot, main actors, directors, year, runtime, and country as recorded in IMDb....
[...]
10,696 citations
"Pushing the boundaries of crowd-ena..." refers methods in this paper
...Instead of relying on non-linear regression, we can use an SVM classifier [19]....
[...]
6,320 citations
"Pushing the boundaries of crowd-ena..." refers background in this paper
...A popular measure of classification performance in the presence of class imbalance is the g-mean measure [20], which is the geometric mean of sensitivity (accuracy on all movies truly belonging to the genre) and specificity (accuracy on all movies truly not belonging to the genre), As the g-mean punishes significant differences between sensitivity and specificity, the above naïve classifier would achieve 0% g-mean....
[...]
4,009 citations
"Pushing the boundaries of crowd-ena..." refers methods in this paper
...perceptual space, we suggest to use Support Vector Regression Machines (SVMs) [14], which are a highly flexible technique to perform non-linear regression and classification, and have been proven to be effective when dealing with perceptual data [15]....
[...]
3,773 citations