Convolutional neural networks over tree structures for programming language processing

Open AccessProceedings Article

Convolutional neural networks over tree structures for programming language processing

- pp 1287-1293

TLDR

In this article, a tree-based convolutional neural network (TBCNN) is proposed for programming language processing, in which a convolution kernel is designed over programs' abstract syntax trees to capture structural information.

Abstract:

Programming language processing (similar to natural language processing) is a hot research topic in the field of software engineering; it has also aroused growing interest in the artificial intelligence community. However, different from a natural language sentence, a program contains rich, explicit, and complicated structural information. Hence, traditional NLP models may be inappropriate for programs. In this paper, we propose a novel tree-based convolutional neural network (TBCNN) for programming language processing, in which a convolution kernel is designed over programs' abstract syntax trees to capture structural information. TBCNN is a generic architecture for programming language processing; our experiments show its effectiveness in two different program analysis tasks: classifying programs according to functionality, and detecting code snippets of certain patterns. TBCNN outperforms baseline methods, including several neural models for NLP.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep code comment generation

Xing Hu, +4 more

TL;DR: DeepCom applies Natural Language Processing (NLP) techniques to learn from a large code corpus and generates comments from learned features for better comments generation of Java methods.

...read moreread less

Posted Content

A Survey of Machine Learning for Big Code and Naturalness

Miltiadis Allamanis, +3 more

- 18 Sep 2017 -

arXiv: Software Engineering

TL;DR: This article presents a taxonomy based on the underlying design principles of each model and uses it to navigate the literature and discuss cross-cutting and application-specific challenges and opportunities.

...read moreread less

Journal ArticleDOI

A Survey of Machine Learning for Big Code and Naturalness

Miltiadis Allamanis, +3 more

- 31 Jul 2018 -

ACM Computing Surveys

TL;DR: A survey of machine learning, programming languages, and software engineering has recently taken important steps in proposing learnable probabilistic models of source code that exploit the abundance of patterns of code.

...read moreread less

Proceedings ArticleDOI

Deep code search

Xiaodong Gu, +2 more

TL;DR: A novel deep neural network named CODEnn (Code-Description Embedding Neural Network) is proposed, which jointly embeds code snippets and natural language descriptions into a high-dimensional vector space, in such a way that code snippet and its corresponding description have similar vectors.

...read moreread less

Proceedings Article

Discriminative embeddings of latent variable models for structured data

Hanjun Dai, +2 more

TL;DR: Structured2vec as mentioned in this paper is an effective and scalable approach for structured data representation based on the idea of embedding latent variable models into feature spaces, and learning such feature spaces using discriminative information.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Proceedings Article

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov, +4 more

TL;DR: This paper presents a simple method for finding phrases in text, and shows that learning good vector representations for millions of phrases is possible and describes a simple alternative to the hierarchical softmax called negative sampling.

...read moreread less

Journal ArticleDOI

A fast learning algorithm for deep belief nets

Geoffrey E. Hinton, +2 more

- 01 Jul 2006 -

Neural Computation

TL;DR: A fast, greedy algorithm is derived that can learn deep, directed belief networks one layer at a time, provided the top two layers form an undirected associative memory.

...read moreread less

Journal ArticleDOI

Representation Learning: A Review and New Perspectives

Yoshua Bengio, +2 more

- 01 Aug 2013 -

IEEE Transactions on Pattern Analysis an...

TL;DR: Recent work in the area of unsupervised feature learning and deep learning is reviewed, covering advances in probabilistic models, autoencoders, manifold learning, and deep networks.

...read moreread less

Book

Learning Deep Architectures for AI

Yoshua Bengio

TL;DR: The motivations and principles regarding learning algorithms for deep architectures, in particular those exploiting as building blocks unsupervised learning of single-layer modelssuch as Restricted Boltzmann Machines, used to construct deeper models such as Deep Belief Networks are discussed.

...read moreread less

Collapse

Neural Computation

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov, +4 more

Summarizing Source Code using a Neural Attention Model

Srinivasan Iyer, +3 more

Convolutional neural networks over tree structures for programming language processing

Citations

Deep code comment generation

A Survey of Machine Learning for Big Code and Naturalness

A Survey of Machine Learning for Big Code and Naturalness

Deep code search

Discriminative embeddings of latent variable models for structured data

References

ImageNet Classification with Deep Convolutional Neural Networks

Distributed Representations of Words and Phrases and their Compositionality

A fast learning algorithm for deep belief nets

Representation Learning: A Review and New Perspectives

Learning Deep Architectures for AI

Related Papers (5)

code2vec: learning distributed representations of code

On the naturalness of software

Long short-term memory

Distributed Representations of Words and Phrases and their Compositionality

Summarizing Source Code using a Neural Attention Model