What is the current state of research on information extraction using large language models. Literature review in various studies?

Papers (5)	Insight
Journal Article•DOI A Bibliometric Review of Large Language Models Research from 2017 to 2023 Lizhou Fan, Lingyao Li, Zihui Ma, Sanggyu Lee, Huizi Yu, Libby Hemphill - Show less +5 more 03 Apr 2023-arXiv.org 17 Citations	The paper provides a comprehensive review of large language models (LLMs) research from 2017 to 2023, covering core algorithm developments, NLP tasks, applications in diverse fields, and evolving research trends.
Open access•Posted Content•DOI A Bibliometric Review of Large Language Models Research from 2017 to 2023 03 Apr 2023 1 Citations	The current state of research on information extraction using large language models is explored in various studies, showcasing trends, applications, and collaborations in LLMs research.
Information Retrieval Meets Large Language Models: A Strategic Report from Chinese IR Community Qingyao Ai, Ting Bai, Zhao Cao, Yi-Fan Chang, Jiawei Chen, Zhumin Chen, Zhiyong Cheng, Shoubin Dong, Zhicheng Dou, Fuli Feng, Shengling Gao, Jiafeng Guo, Xiangnan He, Yanyan Lan, Chenliang Li, Yiqun Liu, Ziyu Lyu, Weizhi Ma, Jun Ma, Zhaochun Ren, Pengjie Ren, Zhiqiang Wang, Min Wang, Ji-Rong Wen, Lei Wu, Xin Xin, Jun Xu, Dawei Yin, Peng Zhang, Fan Zhang, Wei-Na Zhang, Jinghui Zhang, Xiaofei Zhu - Show less +32 more 19 Jul 2023	The current state of research on information extraction using Large Language Models (LLMs) is evolving, with LLMs enhancing text understanding, generation, and knowledge inference in Information Retrieval (IR) research.
Comparative Performance Evaluation of Large Language Models for Extracting Molecular Interactions and Pathway Knowledge Gilchan Park, Byung-Jun Yoon, Xihaier Luo, Patrick Johnstone, Shinjae Yoo, Francis J. Alexander - Show less +5 more 17 Jul 2023	The research evaluates large language models for extracting molecular interactions and pathway knowledge, highlighting their effectiveness in recognizing protein interactions, pathways, and gene regulatory relations.
Distilling Large Language Models for Biomedical Knowledge Extraction: A Case Study on Adverse Drug Events Yu Gu, Sheng Zhang, Naoto Usuyama, Yonas Gebre Woldesenbet, Cliff Wong, Mu-Hsin Wei, Naveen Babu Valluri, Erika Strandberg, T. Baumann, H. Poon - Show less +9 more 12 Jul 2023	Large language models like GPT-4 can enhance biomedical knowledge extraction through distillation, achieving significant gains in tasks like adverse drug event extraction without labeled data, showcasing promising advancements in information extraction.

How effective are large language models in named entity extraction compared to traditional machine learning algorithms?10 answersLarge Language Models (LLMs) have shown significant promise in the field of Named Entity Recognition (NER), a crucial task in Natural Language Processing (NLP) that involves identifying specific entities within text. However, their effectiveness compared to traditional machine learning algorithms varies depending on the context and the specific requirements of the task. Recent studies have demonstrated that while LLMs, such as ChatGPT, can perform NER tasks with a degree of success, they still lag behind fine-tuned models like BioClinicalBERT in terms of overall performance. For instance, in the context of extracting rare disease phenotypes, fine-tuning BioClinicalBERT achieved a higher F1 score compared to ChatGPT in both zero- and few-shot settings. Nonetheless, ChatGPT showed potential in achieving similar or even higher accuracy for certain entity types with minimal labeled data, suggesting that with appropriate prompt engineering, LLMs could match or outperform fine-tuned models for specific entities. The introduction of methodologies like GPT-NER, which bridges the gap between sequence labeling tasks and text-generation models, has further enhanced the capabilities of LLMs in NER tasks. GPT-NER, for example, has demonstrated comparable performances to fully supervised baselines and has shown greater ability in low-resource and few-shot setups. Moreover, LLMs have been explored for their utility in extracting structured tabular data from textual medical reports, showcasing their potential beyond traditional text classification models. However, challenges such as the "hallucination" issue, where LLMs might over-confidently label null inputs as entities, necessitate strategies like self-verification to ensure reliability. In comparison, traditional machine learning models, when fine-tuned and combined with LLMs, can offer a synergistic approach. For instance, an adaptive filter-then-rerank paradigm that leverages both small Pre-trained Language Models (SLMs) and LLMs has shown promising improvements in information extraction tasks. In summary, while LLMs have made remarkable strides in NER and related tasks, their effectiveness is enhanced when combined with traditional machine learning algorithms or when specific methodologies, such as GPT-NER, are employed to address their inherent limitations. This hybrid approach, leveraging the strengths of both LLMs and traditional algorithms, appears to be a promising direction for future research and application.

What is the current state of research on information extraction using large language models. Literature review in various studies?

Answers from top 5 papers

My columns

Related Questions

See what other people are reading