scispace - formally typeset
Search or ask a question

Showing papers in "arXiv: Computation and Language in 2022"


Book ChapterDOI
TL;DR: The language model's capabilities for causal association among events expressed in natural language text are investigated using sentence context combined with event information, and by leveraging masked event context with in-domain and out-of-domain data distribution.
Abstract: Causality understanding between events is a critical natural language processing task that is helpful in many areas, including health care, business risk management, and finance. On close examination, one can find a huge amount of textual content both in the form of formal documents or in content arising from social media like Twitter, dedicated to communicating and exploring various types of causality in the real world. Recognizing these “Cause-effect" relationships between natural language events continues to remain a challenge simply because it is often expressed implicitly. Implicit causality is hard to detect through most of the techniques employed in literature and can also, at times be perceived as ambiguous or vague. Also, although well-known datasets do exist for this problem, the examples in them are limited in the range and complexity of the causal relationships they depict especially when related to implicit relationships. Most of the contemporary methods are either based on lexico-semantic pattern matching or are feature-driven supervised algorithms. Therefore, these methods are more geared towards handling explicit causal relationships leading to limited coverage for implicit relationships, and are hard to generalize. In this paper, we investigate the language model’s capabilities for causal association among events expressed in natural language text using sentence context combined with event information, and by leveraging masked event context with in-domain and out-of-domain data distribution. Our proposed methods achieve the state-of-art performance in three different data distributions and can be leveraged for extraction of a causal diagram and/or building a chain of events from unstructured text.

21 citations


Book ChapterDOI
TL;DR: In this paper, the use of visual features in addition to linguistic features is demonstrated, showing that their models perform significantly well when compared to robust baselines, and state-of-the-art models are presented.
Abstract: Easier access to the Internet and social media has made disseminating information through online sources very easy. Sources like Facebook, Twitter, online news sites and blogs of self-proclaimed journalists have become significant players in providing news content. The sheer amount of information and the speed at which it is generated online makes it beyond the scope of human verification. There is, hence, a pressing need to develop technologies that can assist humans with automatic fact-checking and reliable identification of fake news. This paper summarises the multiple approaches that were undertaken and the experiments that were carried out for the task. Credibility information and metadata associated with the news article have been used for improved results. The experiments also show how modelling justification or evidence can lead to improved results. Additionally, the use of visual features in addition to linguistic features is demonstrated. A detailed comparison of the results showing that our models perform significantly well when compared to robust baselines, and state-of-the-art models are presented.

8 citations


Book ChapterDOI
TL;DR: This paper is proposing an abstractive text summarization Deep learning model for Telugu language based on encoder-decoder sequential models with attention mechanism and has got good results measured qualitatively.
Abstract: ive text summarization is the process of constructing semantically relevant shorter sentences which captures the essence of the overall meaning of the source text. It is actually difficult and very time consuming for humans to summarize manually large documents of text. Much of work in abstractive text summarization is being done in English, and almost no significant work has been reported in Telugu abstractive text summarization. So, we would like to propose an abstractive text summarization approach for Telugu language using deep learning. In this paper, we are proposing an abstractive text summarization deep learning model for Telugu language. The proposed architecture is based on encoder–decoder sequential models with attention mechanism. We have applied this model on manually created dataset to generate a one sentence summary of the source text and have got good results measured qualitatively.

6 citations


Book ChapterDOI
TL;DR: This article investigated the impact of pre-training models (one T5, three Pegasuses, three ProphetNets) on several Wikipedia datasets in English and Indonesian language and compared the results to the Wikipedia systems' summaries.
Abstract: This paper surveys several recent abstract summarization methods: T5, Pegasus, and ProphetNet We implement the systems in two languages: English and Indonesian languages We investigate the impact of pre-training models (one T5, three Pegasuses, three ProphetNets) on several Wikipedia datasets in English and Indonesian language and compare the results to the Wikipedia systems’ summaries The T5-Large, the Pegasus-XSum, and the ProphetNet-CNNDM provide the best summarization The most significant factors that influence ROUGE performance are coverage, density, and compression The higher the scores, the better the summary Other factors that influence the ROUGE scores are the pre-training goal, the dataset's characteristics, the dataset used for testing the pre-trained model, and the cross-lingual function Several suggestions to improve this paper's limitation are: (1) assure that the dataset used for the pre-training model must be sufficiently large, contains adequate instances for handling cross-lingual purpose; (2) advanced process (fine-tuning) shall be reasonable We recommend using the large dataset consisting of comprehensive coverage of topics from many languages before implementing advanced processes such as the train-infer-train procedure to the zero-shot translation in the training stage of the pre-training model

2 citations


Book ChapterDOI
TL;DR: It is concluded that the research proposes a promising method for domain-controlled title generation, which used the pre-trained text-to-text transformer model and the additional token technique to generate titles that looked realistic, convincing, and very close to the ground truth.
Abstract: We study automatic title generation and present a method for generating domain-controlled titles for scientific articles. A good title allows you to get the attention that your research deserves. A title can be interpreted as a high-compression description of a document containing information on the implemented process. For domain-controlled titles, we used the pre-trained text-to-text transformer model and the additional token technique. Title tokens are sampled from a local distribution (which is a subset of global vocabulary) of the domain-specific vocabulary and not global vocabulary, thereby generating a catchy title and closely linking it to its corresponding abstract. Generated titles looked realistic, convincing, and very close to the ground truth. We have performed automated evaluation using ROUGE metric and human evaluation using five parameters to make a comparison between human and machine-generated titles. The titles produced were considered acceptable with higher metric ratings in contrast to the original titles. Thus we concluded that our research proposes a promising method for domain-controlled title generation.