Deep Learning Based Unsupervised POS Tagging for Sanskrit
TL;DR: A deep learning based approach to assign POS tags to words in a piece of text given to it as input and uses the untagged Sanskrit Corpus prepared by JNU for the tag assignment purpose and determining model accuracy.
Abstract: In this paper, we present a deep learning based approach to assign POS tags to words in a piece of text given to it as input. We propose an unsupervised approach owing to the lack of a large Sanskrit annotated corpora and use the untagged Sanskrit Corpus prepared by JNU for our purpose. The only tagged corpora for Sanskrit is created by JNU which has 115,000 words which are not sufficient to apply supervised deep learning approaches. For the tag assignment purpose and determining model accuracy, we utilize this tagged corpus. We explore various methods through which each Sanskrit word can be represented as a point multi-dimensional vector space whose position accurately captures its meaning and semantic information associated with it. We also explore other data sources to improve performance and robustness of the vector representations. We use these rich vector representations and explore autoencoder based approaches for dimensionality reduction to compress these into encodings which are suitable for clustering in the vector space. We experiment with different dimensions of these compressed representations and present one which was found to offer the best clustering performance. For modelling the sequence in order to preserve the semantic information we feed these embeddings to a bidirectional LSTM autoencoder. We assign a POS tag to each of the obtained clusters and produce our result by testing the model on the tagged corpus.
...read more
Citations
1 citations
Cites background from "Deep Learning Based Unsupervised PO..."
...And delegate the POS tag to the received cluster and refer the model to the label lexicon to yield the result[4]....
[...]
...Insufficient flagged corpus for languages such as Sanskrit, Kannada and other resource-poor languages is a hindrance to all of these modern supervised learning algorithms, which typically involve rich data sets [4]....
[...]
...has been shown to be beneficial in the NLP application[4]....
[...]
References
23,982 citations
"Deep Learning Based Unsupervised PO..." refers methods in this paper
...The skipgram approach as in [10] is based on a log bilinear model which is trained to predict an unordered set of context words given an center word of a training context window....
[...]
6,288 citations
247 citations
66 citations
62 citations
"Deep Learning Based Unsupervised PO..." refers methods in this paper
...In a paper [3], a Conditional Random Field Autoencoder was used to predict the latent representation of the input....
[...]