Learning Program Embeddings to Propagate Feedback on Student Code

Home
/
Papers
/
Learning Program Embeddings to Propagate Feedback on Student Code

Proceedings Article•

Learning Program Embeddings to Propagate Feedback on Student Code

Chris Piech¹, Jonathan Huang², Andy Nguyen¹, Mike Phulsuksombati¹, Mehran Sahami¹, Leonidas J. Guibas¹ - Show less +2 more•Institutions (2)

Stanford University¹, Google²

06 Jul 2015-pp 1093-1102

TL;DR: A neural network method is introduced to encode programs as a linear mapping from an embedded precondition space to an embedded postcondition space and an algorithm for feedback at scale is proposed using these linear maps as features.

read less

Abstract: Providing feedback, both assessing final work and giving hints to stuck students, is difficult for open-ended assignments in massive online classes which can range from thousands to millions of students. We introduce a neural network method to encode programs as a linear mapping from an embedded precondition space to an embedded postcondition space and propose an algorithm for feedback at scale using these linear maps as features. We apply our algorithm to assessments from the Code.org Hour of Code and Stanford University's CS1 course, where we propagate human comments on student assignments to orders of magnitude more submissions.

...read moreread less

Content maybe subject to copyright Report

Citations

PDF

Open Access

More filters

Proceedings Article•

Convolutional neural networks over tree structures for programming language processing

[...]

Lili Mou¹, Ge Li¹, Lu Zhang¹, Tao Wang², Zhi Jin¹ - Show less +1 more•Institutions (2)

Peking University¹, Stanford University²

12 Feb 2016

TL;DR: In this article, a tree-based convolutional neural network (TBCNN) is proposed for programming language processing, in which a convolution kernel is designed over programs' abstract syntax trees to capture structural information.

...read moreread less

Abstract: Programming language processing (similar to natural language processing) is a hot research topic in the field of software engineering; it has also aroused growing interest in the artificial intelligence community. However, different from a natural language sentence, a program contains rich, explicit, and complicated structural information. Hence, traditional NLP models may be inappropriate for programs. In this paper, we propose a novel tree-based convolutional neural network (TBCNN) for programming language processing, in which a convolution kernel is designed over programs' abstract syntax trees to capture structural information. TBCNN is a generic architecture for programming language processing; our experiments show its effectiveness in two different program analysis tasks: classifying programs according to functionality, and detecting code snippets of certain patterns. TBCNN outperforms baseline methods, including several neural models for NLP.

...read moreread less

551 citations

Posted Content•

A Survey of Machine Learning for Big Code and Naturalness

[...]

Miltiadis Allamanis¹, Earl T. Barr², Premkumar Devanbu³, Charles Sutton⁴•Institutions (4)

Microsoft¹, University College London², University of California, Davis³, University of Edinburgh⁴

18 Sep 2017-arXiv: Software Engineering

TL;DR: This article presents a taxonomy based on the underlying design principles of each model and uses it to navigate the literature and discuss cross-cutting and application-specific challenges and opportunities.

...read moreread less

Abstract: Research at the intersection of machine learning, programming languages, and software engineering has recently taken important steps in proposing learnable probabilistic models of source code that exploit code's abundance of patterns. In this article, we survey this work. We contrast programming languages against natural languages and discuss how these similarities and differences drive the design of probabilistic models. We present a taxonomy based on the underlying design principles of each model and use it to navigate the literature. Then, we review how researchers have adapted these models to application areas and discuss cross-cutting and application-specific challenges and opportunities.

...read moreread less

503 citations

Cites background from "Learning Program Embeddings to Prop..."

...[155] Syntax + State Student Feedback Distributed Student Feedback...
[...]

Journal Article•DOI•

A Survey of Machine Learning for Big Code and Naturalness

[...]

Miltiadis Allamanis¹, Earl T. Barr², Premkumar Devanbu³, Charles Sutton⁴•Institutions (4)

Microsoft¹, University College London², University of California, Davis³, University of Edinburgh⁴

31 Jul 2018-ACM Computing Surveys

TL;DR: A survey of machine learning, programming languages, and software engineering has recently taken important steps in proposing learnable probabilistic models of source code that exploit the abundance of patterns of code.

...read moreread less

Abstract: Research at the intersection of machine learning, programming languages, and software engineering has recently taken important steps in proposing learnable probabilistic models of source code that exploit the abundance of patterns of code. In this article, we survey this work. We contrast programming languages against natural languages and discuss how these similarities and differences drive the design of probabilistic models. We present a taxonomy based on the underlying design principles of each model and use it to navigate the literature. Then, we review how researchers have adapted these models to application areas and discuss cross-cutting and application-specific challenges and opportunities.

...read moreread less

502 citations

Proceedings Article•

code2seq: Generating Sequences from Structured Representations of Code

[...]

Uri Alon¹, Shaked Brody², Omer Levy³, Eran Yahav²•Institutions (3)

University of Missouri–Kansas City¹, Technion – Israel Institute of Technology², Facebook³

27 Sep 2018

TL;DR: This model represents a code snippet as the set of compositional paths in its abstract syntax tree and uses attention to select the relevant paths while decoding and significantly outperforms previous models that were specifically designed for programming languages, as well as state-of-the-art NMT models.

...read moreread less

Abstract: The ability to generate natural language sequences from source code snippets has a variety of applications such as code summarization, documentation, and retrieval. Sequence-to-sequence (seq2seq) models, adopted from neural machine translation (NMT), have achieved state-of-the-art performance on these tasks by treating source code as a sequence of tokens. We present ${\rm {\scriptsize CODE2SEQ}}$: an alternative approach that leverages the syntactic structure of programming languages to better encode source code. Our model represents a code snippet as the set of compositional paths in its abstract syntax tree (AST) and uses attention to select the relevant paths while decoding. We demonstrate the effectiveness of our approach for two tasks, two programming languages, and four datasets of up to $16$M examples. Our model significantly outperforms previous models that were specifically designed for programming languages, as well as state-of-the-art NMT models. An interactive online demo of our model is available at this http URL. Our code, data and trained models are available at this http URL.

...read moreread less

486 citations

Proceedings Article•

DeepFix: Fixing Common C Language Errors by Deep Learning

[...]

Rahul Gupta¹, Soham Pal¹, Aditya Kanade¹, Shirish Shevade¹•Institutions (1)

Indian Institute of Science¹

12 Feb 2017

TL;DR: DeepFix is a multi-layered sequence-to-sequence neural network with attention which is trained to predict erroneous program locations along with the required correct statements and could fix 1881 programs completely and 1338 programs partially.

...read moreread less

Abstract: The problem of automatically fixing programming errors is a very active research topic in software engineering. This is a challenging problem as fixing even a single error may require analysis of the entire program. In practice, a number of errors arise due to programmer's inexperience with the programming language or lack of attention to detail. We call these common programming errors. These are analogous to grammatical errors in natural languages. Compilers detect such errors, but their error messages are usually inaccurate. In this work, we present an end-to-end solution, called DeepFix, that can fix multiple such errors in a program without relying on any external tool to locate or fix them. At the heart of DeepFix is a multi-layered sequence-to-sequence neural network with attention which is trained to predict erroneous program locations along with the required correct statements. On a set of 6971 erroneous C programs written by students for 93 programming tasks, DeepFix could fix 1881 (27%) programs completely and 1338 (19%) programs partially.

...read moreread less

415 citations

Cites methods from "Learning Program Embeddings to Prop..."

...Piech et al. (2015) proposed a neural network based approach to find program representations and used them for automatically propagating instructor feedback to students in a massive course....
[...]

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29

Collapse

References

PDF

Open Access

More filters

Journal Article•DOI•

A Complexity Measure

[...]

Thomas J. McCabe¹•Institutions (1)

National Security Agency¹

01 Jul 1976-IEEE Transactions on Software Engineering

TL;DR: Several properties of the graph-theoretic complexity are proved which show, for example, that complexity is independent of physical size and complexity depends only on the decision structure of a program.

...read moreread less

Abstract: This paper describes a graph-theoretic complexity measure and illustrates how it can be used to manage and control program complexity. The paper first explains how the graph-theory concepts apply and gives an intuitive explanation of the graph concepts in programming terms. The control graphs of several actual Fortran programs are then presented to illustrate the correlation between intuitive complexity and the graph-theoretic complexity. Several properties of the graph-theoretic complexity are then proved which show, for example, that complexity is independent of physical size (adding or subtracting functional statements leaves complexity unchanged) and complexity depends only on the decision structure of a program.

...read moreread less

5,097 citations

"Learning Program Embeddings to Prop..." refers background in this paper

...The distribution of cyclomatic complexity (McCabe, 1976), a measure of code structure, reflects this wide range (shown in gray in Figures 4(a),(b))....
[...]

Journal Article•

An Axiomatic Basis for Computer Programming

[...]

Tony Hoare

01 Jan 1969-Communications of The ACM

4,463 citations

Neural Turing Machines

[...]

Alex Graves¹, Greg Wayne¹, Ivo Danihelka¹•Institutions (1)

Google¹

20 Oct 2014

TL;DR: A combined system is analogous to a Turing Machine or Von Neumann architecture but is differentiable end-toend, allowing it to be efficiently trained with gradient descent.

...read moreread less

Abstract: We extend the capabilities of neural networks by coupling them to external memory resources, which they can interact with by attentional processes. The combined system is analogous to a Turing Machine or Von Neumann architecture but is differentiable end-toend, allowing it to be efficiently trained with gradient descent. Preliminary results demonstrate that Neural Turing Machines can infer simple algorithms such as copying, sorting, and associative recall from input and output examples.

...read moreread less

1,471 citations

"Learning Program Embeddings to Prop..." refers background in this paper

...However, there has been recent progress in the deep learning community towards models capable of simulating Turing machines (Graves et al., 2014)....
[...]

Proceedings Article•

Semi-Supervised Recursive Autoencoders for Predicting Sentiment Distributions

[...]

Richard Socher¹, Jeffrey Pennington¹, Eric H. Huang¹, Andrew Y. Ng¹, Christopher D. Manning¹ - Show less +1 more•Institutions (1)

Stanford University¹

27 Jul 2011

TL;DR: A novel machine learning framework based on recursive autoencoders for sentence-level prediction of sentiment label distributions that outperform other state-of-the-art approaches on commonly used datasets, without using any pre-defined sentiment lexica or polarity shifting rules.

...read moreread less

Abstract: We introduce a novel machine learning framework based on recursive autoencoders for sentence-level prediction of sentiment label distributions. Our method learns vector space representations for multi-word phrases. In sentiment prediction tasks these representations outperform other state-of-the-art approaches on commonly used datasets, such as movie reviews, without using any pre-defined sentiment lexica or polarity shifting rules. We also evaluate the model's ability to predict sentiment distributions on a new dataset based on confessions from the experience project. The dataset consists of personal user stories annotated with multiple labels which, when aggregated, form a multinomial distribution that captures emotional reactions. Our algorithm can more accurately predict distributions over such labels compared to several competitive baselines.

...read moreread less

1,315 citations

"Learning Program Embeddings to Prop..." refers background or methods in this paper

...We discuss the process by which such a dataset can be obtained in Section 5....
[...]
...Our models are related to recent work from the NLP and deep learning communities on recursive neural networks, particularly for modeling semantics in sentences or symbolic expressions (Socher et al., 2013; 2011; Zaremba et al., 2014; Bowman, 2013)....
[...]

Journal Article•DOI•

Axiomatic Basis for Computer Programming

[...]

Lauretta O. Osho, Francisca Ogwueleka, Oluwafemi Osho¹•Institutions (1)

Federal University of Technology Minna¹

01 Oct 2013

TL;DR: Using the axiomatic basis the completeness of two variants of integer multiplication program is proved and results show that computer programs can actually be verified sufficiently for correctness without necessarily testing them, or more practically put, to complement their testing.

...read moreread less

Abstract: This paper considers a formal method, known as axiomatic semantics, used to prove the correctness of a computer program. This formal method extracts, using some proof rules, the mathematical verification conditions from a computer program. The axioms of program flow, including, sequential flow, iteration, and alternation flows are presented. Using the axiomatic basis the completeness of two variants of integer multiplication program is proved. Results show that computer programs can actually be verified sufficiently for correctness without necessarily testing them, or more practically put, to complement their testing.

...read moreread less

907 citations