scispace - formally typeset
Search or ask a question
Topic

Latent Dirichlet allocation

About: Latent Dirichlet allocation is a research topic. Over the lifetime, 5351 publications have been published within this topic receiving 212555 citations. The topic is also known as: LDA.


Papers
More filters
Journal ArticleDOI
01 Jan 2017
TL;DR: The authors proposed sentiment analysis with probabilistic topic model using Latent Dirichlet Allocation (LDA) method to be applied for reading general tendency from tourist review into certain topics that can be classified toward positive and negative sentiment.
Abstract: The tourism industry is one of foreign exchange sector, which has considerable potential development in Indonesia. Compared to other Southeast Asia countries such as Malaysia with 18 million tourists and Singapore 20 million tourists, Indonesia which is the largest Southeast Asia's country have failed to attract higher tourist numbers compared to its regional peers. Indonesia only managed to attract 8,8 million foreign tourists in 2013, with the value of foreign tourists each year which is likely to decrease. Apart from the infrastructure problems, marketing and managing also form of obstacles for tourism growth. An evaluation and self-analysis should be done by the stakeholder to respond toward this problem and capture opportunities that related to tourism satisfaction from tourists review. Recently, one of technology to answer this problem only relying on the subjective of statistical data which collected by voting or grading from user randomly. So the result is still not to be accountable. Thus, we proposed sentiment analysis with probabilistic topic model using Latent Dirichlet Allocation (LDA) method to be applied for reading general tendency from tourist review into certain topics that can be classified toward positive and negative sentiment.

39 citations

Journal ArticleDOI
TL;DR: This article collected tweets about ChatGPT, an innovative AI chatbot, in the first month after its launch and analyzed 233,914 English tweets using the latent Dirichlet allocation (LDA) topic modeling algorithm.
Abstract: In this study, the author collected tweets about ChatGPT, an innovative AI chatbot, in the first month after its launch. A total of 233,914 English tweets were analyzed using the latent Dirichlet allocation (LDA) topic modeling algorithm to answer the question “what can ChatGPT do?”. The results revealed three general topics: news, technology, and reactions. The author also identified five functional domains: creative writing, essay writing, prompt writing, code writing, and answering questions. The analysis also found that ChatGPT has the potential to impact technologies and humans in both positive and negative ways. In conclusion, the author outlines four key issues that need to be addressed as a result of this AI advancement: the evolution of jobs, a new technological landscape, the quest for artificial general intelligence, and the progress-ethics conundrum.

39 citations

Journal ArticleDOI
TL;DR: A new method is described which employs the device of over-fitting, i.e. eliciting more than the minimal number of judgements, in order to produce a more carefully considered Dirichlet distribution and ensure that the DirICHlet distribution is indeed a reasonable fit to the expert's knowledge.
Abstract: Eliciting expert knowledge about several uncertain quantities is a complex task when those quantities exhibit associations. A well-known example of such a problem is eliciting knowledge about a set of uncertain proportions which must sum to 1. The usual approach is to assume that the expert's knowledge can be adequately represented by a Dirichlet distribution, since this is by far the simplest multivariate distribution that is appropriate for such a set of proportions. It is also the most convenient, particularly when the expert's prior knowledge is to be combined with a multinomial sample since then the Dirichlet is the conjugate prior family. Several methods have been described in the literature for eliciting beliefs in the form of a Dirichlet distribution, which typically involve eliciting from the expert enough judgements to identify uniquely the Dirichlet hyperparameters. We describe here a new method which employs the device of over-fitting, i.e. eliciting more than the minimal number of judgements,...

39 citations

Journal ArticleDOI
TL;DR: To automate and expedite the analysis tasks, this study deployed natural language processing (NLP) and commonly used unsupervised learning for text classification, namely latent semantic analysis (LSA) and latent Dirichlet allocation (LDA).

39 citations

Journal ArticleDOI
TL;DR: A voxel-based method for automatically extracting the transmission lines from airborne LiDAR point cloud data that provides higher detection correctness rate and efficiency and robustness compared with other existing methods.
Abstract: The safety of the electricity infrastructure significantly affects both our daily life and industrial activities. Timely and accurate monitoring of the safety of electricity network can prevent dangerous situations effectively. Thus, we, in this paper, develop a voxel-based method for automatically extracting the transmission lines from airborne LiDAR point cloud data. The method proposed in this paper uses three-dimensional (3-D) voxels as primitives and consist of the following steps: First, skeleton structure extraction using Laplacian smoothing; second, feature construction of a 3-D voxel using Latent Dirichlet allocation topic model; and third Markov random field model-based extraction for generating locally continuous and globally optimal results. To evaluate the effectiveness and robustness of the proposed method, experiments were conducted on four different types of power line scenes with flat and complex terrains from helicopter-borne LiDAR point cloud data. Experimental results demonstrate that our proposed method is efficient and robust for automatically detecting both the single conductor and the bundled conductors, with precision, recall, and quality of over 96.78%, 98.67%, and 96.66%, respectively. Moreover, compared with other existing methods, our proposed method provides higher detection correctness rate.

39 citations


Network Information
Related Topics (5)
Cluster analysis
146.5K papers, 2.9M citations
86% related
Support vector machine
73.6K papers, 1.7M citations
86% related
Deep learning
79.8K papers, 2.1M citations
85% related
Feature extraction
111.8K papers, 2.1M citations
84% related
Convolutional neural network
74.7K papers, 2M citations
83% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
2023323
2022842
2021418
2020429
2019473
2018446