Showing papers by "Patrick Haffner published in 2006"

PDF

Open Access

Journal Article•DOI•

Scaling large margin classifiers for spoken language understanding

[...]

01 Mar 2006-Speech Communication

TL;DR: This paper provides an original and unified presentation of these algorithms within the framework of regularized and large margin linear classifiers, reviews some available optimization techniques, and offers practical solutions to scaling issues.

...read moreread less

46 citations

Patent•

Application signature based traffic classification

[...]

Oliver Spatscheck¹, Subhabrata Sen¹, Dongmei Wang¹, Patrick Haffner¹•Institutions (1)

AT&T¹

20 Apr 2006

TL;DR: In this article, a method for identifying traffic to an application including the steps of monitoring communication traffic in a network, identifying data from communication traffic content, and constructing a model for mapping the communication traffic for an application derived from data identified from the traffic content is described.

...read moreread less

Abstract: A method for identifying traffic to an application including the steps of monitoring communication traffic in a network, identifying data from communication traffic content, and constructing a model for mapping the communication traffic for an application derived from data identified from the communication traffic content is described. A related system and computer readable medium for performing the method is also described. The described method and system has utility in a wide array of networks including IP networks.

...read moreread less

31 citations

AT&T Research at TRECVID 2006.

[...]

Zhu Liu, David Crawford Gibbon, Eric Zavesky, Behzad Shahraray, Patrick Haffner - Show less +1 more

01 Jan 2006

TL;DR: AT&T participated in one evaluation task at TRECVID 2009: the content-based copy detection task, and submitted three runs: one for NoFA (no false alarm) profile, and two for balanced profile.

...read moreread less

18 citations

Proceedings Article•DOI•

Fast transpose methods for kernel learning on sparse data

[...]

Patrick Haffner¹•Institutions (1)

AT&T Labs¹

25 Jun 2006

TL;DR: A new method based on transposition is proposed to speedup this computation on sparse data, instead of dot-products over sparse feature vectors, that incrementally merges lists of training examples and minimizes access to the data.

...read moreread less

Abstract: Kernel-based learning algorithms, such as Support Vector Machines (SVMs) or Perceptron, often rely on sequential optimization where a few examples are added at each iteration. Updating the kernel matrix usually requires matrix-vector multiplications. We propose a new method based on transposition to speedup this computation on sparse data. Instead of dot-products over sparse feature vectors, our computation incrementally merges lists of training examples and minimizes access to the data. Caching and shrinking are also optimized for sparsity. On very large natural language tasks (tagging, translation, text classification) with sparse feature representations, a 20 to 80-fold speedup over LIBSVM is observed using the same SMO algorithm. Theory and experiments explain what type of sparsity structure is needed for this approach to work, and why its adaptation to Maxent sequential optimization is inefficient.

...read moreread less

13 citations

Finite-state transducer-based statistical machine translation using joint probabilities.

[...]

Srinivas Bangalore, Stephan Kanthak, Patrick Haffner¹•Institutions (1)

AT&T¹

01 Jan 2006

TL;DR: A novel approach to machine translation that uses a maximum entropy model for parameter estimation and its performance to the finite-state translation model on the IWSLT Chinese-English data sets is compared.

...read moreread less

Abstract: In this paper, we present our system for statistical machine translation that is based on weighted finite-state transducers. We describe the construction of the transducer, the estimation of the weights, acquisition of phrases (locally ordered tokens) and the mechanism we use for global reordering. We also present a novel approach to machine translation that uses a maximum entropy model for parameter estimation and contrast its performance to the finite-state translation model on the IWSLT Chinese-English data sets.

...read moreread less

4 citations

Patent•

Method and apparatus for providing fast kernel learning on sparse data

[...]

Patrick Haffner¹•Institutions (1)

AT&T¹

31 Dec 2006

TL;DR: In this article, a method and apparatus based on transposition to speed up learning computations on sparse data are disclosed, where the method receives an support vector comprising at least one feature represented by one non-zero entry.

...read moreread less

Abstract: A method and apparatus based on transposition to speed up learning computations on sparse data are disclosed. For example, the method receives an support vector comprising at least one feature represented by one non-zero entry. The method then identifies at least one column within a matrix with non-zero entries, wherein the at least one column is identified in accordance with the at least one feature of the support vector. The method then performs kernel computations using successive list merging on the at least one identified column of the matrix and the support vector to derive a result vector, wherein the result vector is used in a data learning function.

...read moreread less

3 citations

Journal Article•DOI•

Modifying boosted trees to improve performance on task 1 of the 2006 KDD challenge cup

[...]

Robert M. Bell¹, Patrick Haffner¹, Chris Volinsky¹•Institutions (1)

AT&T Labs¹

01 Dec 2006-Sigkdd Explorations

TL;DR: The main modifications were changing the dependent variable in the training set to account for multiple PEs per patient, and incorporating neighborhood information through augmentation of the set of predictor variables, which resulted in measurable predictive improvement.

...read moreread less

Abstract: Task 1 of the 2006 KDD Challenge Cup required classification of pulmonary embolisms (PEs) using variables derived from computed tomography angiography. We present our approach to the challenge and justification for our choices. We used boosted trees to perform the main classification task, but modified the algorithm to address idiosyncrasies of the scoring criteria. The two main modifications were: 1) changing the dependent variable in the training set to account for multiple PEs per patient, and 2) incorporating neighborhood information through augmentation of the set of predictor variables. Both of these resulted in measurable predictive improvement. In addition, we discuss a statistically based method for setting the classification threshold.

...read moreread less

3 citations