Semantic Matching Research based on Interactive Multi-head Attention Mechanism

doi:10.1145/3577065.3577110

Proceedings ArticleDOI

Semantic Matching Research based on Interactive Multi-head Attention Mechanism

Yulin Liao, +1 more

Chats0

TLDR

In this article , a multi-head attention mechanism is added to the Siamease model based on bidirectional LSTM to solve the problem of insufficient semantic extraction, and on this basis, it is proposed to add fine-grained matching information in the form of an interactive multihop attention mechanism.

Abstract:

Semantic matching plays a crucial technical supporting role in question answering systems. For semantic matching, it is mainly based on neural networks to solve the sentence representation and interaction in semantic matching. The Siamease network structure is a commonly used structure in semantic matching. In view of the problem of information exchange and semantic extraction not saving points in the independent training of the network structure using shared parameters for the input two sequences. Therefore, in this paper, a multi-head attention mechanism is added to the Siamease model based on bidirectional LSTM to solve the problem of insufficient semantic extraction, and on this basis, it is proposed to add fine-grained matching information in the form of an interactive multi-head attention mechanism to solve the interaction problem. Experimental results show that the performance of the model is further improved compared to previous deep learning models.

References

PDF

Open Access

More filters

Proceedings Article

Attention is All you Need

Ashish Vaswani, +7 more

TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.

...read moreread less

Journal ArticleDOI

Latent dirichlet allocation

David M. Blei, +2 more

- 01 Mar 2003 -

Journal of Machine Learning Research

TL;DR: This work proposes a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hofmann's aspect model.

...read moreread less

Journal ArticleDOI

A vector space model for automatic indexing

Gerard Salton, +2 more

- 01 Nov 1975 -

Communications of The ACM

TL;DR: An approach based on space density computations is used to choose an optimum indexing vocabulary for a collection of documents, demonstating the usefulness of the model.

...read moreread less

Proceedings ArticleDOI

Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks

Kai Sheng Tai, +2 more

TL;DR: The authors introduced the Tree-LSTM, a generalization of LSTMs to tree-structured network topologies, which outperformed all existing systems and strong LSTM baselines on two tasks: predicting the semantic relatedness of two sentences (SemEval 2014, Task 1) and sentiment classification (Stanford Sentiment Treebank).

...read moreread less

Proceedings ArticleDOI

Learning deep structured semantic models for web search using clickthrough data

Po-Sen Huang, +5 more

TL;DR: A series of new latent semantic models with a deep structure that project queries and documents into a common low-dimensional space where the relevance of a document given a query is readily computed as the distance between them are developed.

...read moreread less

Semantic Matching Research based on Interactive Multi-head Attention Mechanism

References

Attention is All you Need

Latent dirichlet allocation

A vector space model for automatic indexing

Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks

Learning deep structured semantic models for web search using clickthrough data

Related Papers (5)

A question answer approach to building semantic memory

Question answering using sentence parsing and semantic network matching

Procedural semantics for a question-answering machine

Procedural Semantics for a Question-Answering Machine

Research on Intelligent Search Engine Based on Semantic Comprehension