Showing papers in "arXiv: Computation and Language in 2022"

PDF

Open Access

Book Chapter•DOI•

Causal BERT: Language Models for Causality Detection Between Events Expressed in Text

[...]

Vivek Khetan¹, Roshni R. Ramnani¹, Mayuresh Anand², Shubhashis Sengupta¹, Andrew E. Fano¹ - Show less +1 more•Institutions (2)

Accenture¹, University of California, Santa Barbara²

01 Jan 2022-arXiv: Computation and Language

TL;DR: The language model's capabilities for causal association among events expressed in natural language text are investigated using sentence context combined with event information, and by leveraging masked event context with in-domain and out-of-domain data distribution.

...read moreread less

Abstract: Causality understanding between events is a critical natural language processing task that is helpful in many areas, including health care, business risk management, and finance. On close examination, one can find a huge amount of textual content both in the form of formal documents or in content arising from social media like Twitter, dedicated to communicating and exploring various types of causality in the real world. Recognizing these “Cause-effect" relationships between natural language events continues to remain a challenge simply because it is often expressed implicitly. Implicit causality is hard to detect through most of the techniques employed in literature and can also, at times be perceived as ambiguous or vague. Also, although well-known datasets do exist for this problem, the examples in them are limited in the range and complexity of the causal relationships they depict especially when related to implicit relationships. Most of the contemporary methods are either based on lexico-semantic pattern matching or are feature-driven supervised algorithms. Therefore, these methods are more geared towards handling explicit causal relationships leading to limited coverage for implicit relationships, and are hard to generalize. In this paper, we investigate the language model’s capabilities for causal association among events expressed in natural language text using sentence context combined with event information, and by leveraging masked event context with in-domain and out-of-domain data distribution. Our proposed methods achieve the state-of-art performance in three different data distributions and can be leveraged for extraction of a causal diagram and/or building a chain of events from unstructured text.

...read moreread less

21 citations

Book Chapter•DOI•

Fake News Detection: Experiments and Approaches Beyond Linguistic Features

[...]

Shaily Bhatt¹, Naman Goenka¹, Sakshi Kalra¹, Yashvardhan Sharma¹•Institutions (1)

Birla Institute of Technology and Science¹

01 Jan 2022-arXiv: Computation and Language

TL;DR: In this paper, the use of visual features in addition to linguistic features is demonstrated, showing that their models perform significantly well when compared to robust baselines, and state-of-the-art models are presented.

...read moreread less

Abstract: Easier access to the Internet and social media has made disseminating information through online sources very easy. Sources like Facebook, Twitter, online news sites and blogs of self-proclaimed journalists have become significant players in providing news content. The sheer amount of information and the speed at which it is generated online makes it beyond the scope of human verification. There is, hence, a pressing need to develop technologies that can assist humans with automatic fact-checking and reliable identification of fake news. This paper summarises the multiple approaches that were undertaken and the experiments that were carried out for the task. Credibility information and metadata associated with the news article have been used for improved results. The experiments also show how modelling justification or evidence can lead to improved results. Additionally, the use of visual features in addition to linguistic features is demonstrated. A detailed comparison of the results showing that our models perform significantly well when compared to robust baselines, and state-of-the-art models are presented.

...read moreread less

8 citations

Book Chapter•DOI•

Neural Abstractive Text Summarizer for Telugu Language

[...]

B. Mohan Bharath¹, B. Aravindh Gowtham², M. Akhil³•Institutions (3)

Indian Institute of Information Technology, Allahabad¹, Shiv Nadar University², University of Hyderabad³

01 Jan 2022-arXiv: Computation and Language

TL;DR: This paper is proposing an abstractive text summarization Deep learning model for Telugu language based on encoder-decoder sequential models with attention mechanism and has got good results measured qualitatively.

...read moreread less

Abstract: ive text summarization is the process of constructing semantically relevant shorter sentences which captures the essence of the overall meaning of the source text. It is actually difficult and very time consuming for humans to summarize manually large documents of text. Much of work in abstractive text summarization is being done in English, and almost no significant work has been reported in Telugu abstractive text summarization. So, we would like to propose an abstractive text summarization approach for Telugu language using deep learning. In this paper, we are proposing an abstractive text summarization deep learning model for Telugu language. The proposed architecture is based on encoder–decoder sequential models with attention mechanism. We have applied this model on manually created dataset to generate a one sentence summary of the source text and have got good results measured qualitatively.

...read moreread less

6 citations

Book Chapter•DOI•

A Survey of Recent Abstract Summarization Techniques

[...]

Diyah Puspitaningrum¹•Institutions (1)

University of Bengkulu¹

01 Jan 2022-arXiv: Computation and Language

TL;DR: This article investigated the impact of pre-training models (one T5, three Pegasuses, three ProphetNets) on several Wikipedia datasets in English and Indonesian language and compared the results to the Wikipedia systems' summaries.

...read moreread less

Abstract: This paper surveys several recent abstract summarization methods: T5, Pegasus, and ProphetNet We implement the systems in two languages: English and Indonesian languages We investigate the impact of pre-training models (one T5, three Pegasuses, three ProphetNets) on several Wikipedia datasets in English and Indonesian language and compare the results to the Wikipedia systems’ summaries The T5-Large, the Pegasus-XSum, and the ProphetNet-CNNDM provide the best summarization The most significant factors that influence ROUGE performance are coverage, density, and compression The higher the scores, the better the summary Other factors that influence the ROUGE scores are the pre-training goal, the dataset's characteristics, the dataset used for testing the pre-trained model, and the cross-lingual function Several suggestions to improve this paper's limitation are: (1) assure that the dataset used for the pre-training model must be sufficiently large, contains adequate instances for handling cross-lingual purpose; (2) advanced process (fine-tuning) shall be reasonable We recommend using the large dataset consisting of comprehensive coverage of topics from many languages before implementing advanced processes such as the train-infer-train procedure to the zero-shot translation in the training stage of the pre-training model

...read moreread less

2 citations

Book Chapter•DOI•

Domain-Controlled Title Generation with Human Evaluation

[...]

Abdul Waheed¹, Muskan Goyal¹, Nimisha Mittal¹, Deepak Gupta¹•Institutions (1)

Maharaja Agrasen Institute of Technology¹

01 Jan 2022-arXiv: Computation and Language

TL;DR: It is concluded that the research proposes a promising method for domain-controlled title generation, which used the pre-trained text-to-text transformer model and the additional token technique to generate titles that looked realistic, convincing, and very close to the ground truth.

...read moreread less

Abstract: We study automatic title generation and present a method for generating domain-controlled titles for scientific articles. A good title allows you to get the attention that your research deserves. A title can be interpreted as a high-compression description of a document containing information on the implemented process. For domain-controlled titles, we used the pre-trained text-to-text transformer model and the additional token technique. Title tokens are sampled from a local distribution (which is a subset of global vocabulary) of the domain-specific vocabulary and not global vocabulary, thereby generating a catchy title and closely linking it to its corresponding abstract. Generated titles looked realistic, convincing, and very close to the ground truth. We have performed automated evaluation using ROUGE metric and human evaluation using five parameters to make a comparison between human and machine-generated titles. The titles produced were considered acceptable with higher metric ratings in contrast to the original titles. Thus we concluded that our research proposes a promising method for domain-controlled title generation.

...read moreread less