User-aware page classification in a search engine
Citations
19 citations
10 citations
6 citations
Cites methods from "User-aware page classification in a..."
...The first experiments are described in Aires et al. (2004), more advanced ones in Aires et al. (2005) and Aires (forthcoming)....
[...]
...…is important to emphasize that both corpora were used to train classifiers with a set of shallow parsing features (inspired by Biber’s work (1988)) and lexical general-content words, thus comparing several machine learning techniques, as reported in Aires et al. (2005) and Aires (forthcoming)....
[...]
3 citations
Cites background from "User-aware page classification in a..."
...(Aires et al, 2005c) Aires....
[...]
...SIGIR, agosto de 2005, Salvador - Brasil, 8 p. (Aires et al, 2005b) Aires, R.; Santos, D.; Aluísio....
[...]
...Além disso, foram também dadas instruções sobre cada tipo de necessidade, com exemplos e contra-exemplos de textos (Aires et al. 2005b)....
[...]
...1 3 9 11.3.4 Uso de marcadores estilísticos para a classificação em necessidades de textos em outras línguas Em Aires et al (2005a) apresenta-se um experimento para a classificação de textos de direito em inglês em textos para leigos ou para especialistas....
[...]
...(Aires et al. 2005a) Aires, R.; Aluísio, A.; Santos....
[...]
2 citations
Cites background from "User-aware page classification in a..."
...Esta lista foi criada manualmente, mas em experiências futuras poder-se-ão usar técnicas mais sofisticadas para classificar as páginas Web [Aires et al, 2005] ....
[...]
...Terminam-se com algumas considerações sobre a utilidade de sistemas de resposta automática a perguntas e o estado da arte nesta área para o português....
[...]
References
5,506 citations
5,350 citations
"User-aware page classification in a..." refers methods in this paper
...SMO implements Platts [13] sequential minimal optimisation algorithm for training a support vector classifier using scaled polynomial kernels, transforming the output of SVM into probabilities by applying a standard sigmoid function that is not fitted to the data....
[...]
5,019 citations
2,891 citations
"User-aware page classification in a..." refers background in this paper
...For comparison, note that Biber s 481 texts amounted to a corpus with approximately 960,000 words, which is larger in number of words because Web texts tend to be smaller....
[...]
...These features, which are mainly closed lists, were inspired by those proposed by Biber [10] and Karlgren [8], but checked in grammars and textbooks for Portuguese....
[...]
...[10] Biber, D.: Variation across speech and writing....
[...]
...Karlgren concluded that most users used the interface as intended and many searched for documents in the genres the results could be expected to show up in. Biber [10] has studied English text variation using several variables, and found that texts vary along five dimensions....
[...]
...We have also trained a classification scheme for texts in English using: (i) a corpus with 200 texts extracted from www.findlaw.com; (ii) the algorithms J48, SMO and LMT and (iii) 52 features taken from Biber and Karlgren [1, 2] which are the original features for English that were adapted for Portuguese (Figure 1) plus 3 types of modals, 2 of negation, nominalizations, besides reflexive and possessive pronouns....
[...]
2,094 citations
"User-aware page classification in a..." refers methods in this paper
...[4] Broder, A. "A Taxonomy of Web Search", SIGIR Forum 36 (2), Fall 2002, p.3-10....
[...]
...Inspired by Broder [4] and previous work in detecting user s goals in Web search [5], we devised a user need typology from a qualitative analysis of the TodoBr logs....
[...]
...Inspired by Broder [4] and previous work in detecting users goals in Web search [5], we devised a user need typology from a qualitative analysis of the TodoBr logs....
[...]