Learning Program Embeddings to Propagate Feedback on Student Code

Open AccessProceedings Article

Learning Program Embeddings to Propagate Feedback on Student Code

Chris Piech, +5 more

- pp 1093-1102

Chats0

TLDR

A neural network method is introduced to encode programs as a linear mapping from an embedded precondition space to an embedded postcondition space and an algorithm for feedback at scale is proposed using these linear maps as features.

Abstract:

Providing feedback, both assessing final work and giving hints to stuck students, is difficult for open-ended assignments in massive online classes which can range from thousands to millions of students. We introduce a neural network method to encode programs as a linear mapping from an embedded precondition space to an embedded postcondition space and propose an algorithm for feedback at scale using these linear maps as features. We apply our algorithm to assessments from the Code.org Hour of Code and Stanford University's CS1 course, where we propagate human comments on student assignments to orders of magnitude more submissions.

Citations

PDF

Open Access

More filters

Posted Content

Neural Program Synthesis with a Differentiable Fixer.

Matej Balog, +3 more

- 19 Jun 2020 -

arXiv: Machine Learning

TL;DR: This work presents a new program synthesis approach that combines an encoder-decoder based synthesis architecture with a differentiable program fixer, and shows that the addition of the fixer module leads to a significant improvement on synthesis accuracy compared to using beam search.

...read moreread less

Proceedings ArticleDOI

Source Code Summarization Using Attention-Based Keyword Memory Networks

Yun Seok Choi, +2 more

TL;DR: This work proposes a two-phase model that consists of a keyword predictor and a description generator that can effectively reduce the semantic gap and generate more accurate descriptions of source codes.

...read moreread less

Proceedings Article

Synthesizing Tasks for Block-based Programming

Umair Z. Ahmed, +6 more

TL;DR: This paper formalizes the problem of synthesizing visual programming tasks and proposes a novel methodology to automatically generate a set of new tasks along with solution codes such that tasks T^{in} and T^{out} are conceptually similar but visually dissimilar.

...read moreread less

Journal ArticleDOI

Hyperbolic Function Embedding: Learning Hierarchical Representation for Functions of Source Code in Hyperbolic Space

Lu Mingming, +6 more

- 18 Feb 2019 -

Symmetry

TL;DR: A novel hyperbolic function embedding (HFE) method is proposed, which can learn a distributed and hierarchical representation for each function via the Poincaré ball model, which is more compact in terms of lower dimensionality than the existing graph embedding methods.

...read moreread less

Proceedings ArticleDOI

Unleashing the Power of Compiler Intermediate Representation to Enhance Neural Program Embeddings

Zongjie Li, +6 more

TL;DR: This paper introduces IRGEN, a framework based on genetic algorithms (GA), to identify (near-)optimal sequences of optimization flags that can significantly improve embedding quality, and uses IRGEN to find optimal sequences of LLVM optimization flags by performing GA on source code datasets.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

Adaptive Subgradient Methods for Online Learning and Stochastic Optimization.

John C. Duchi, +2 more

TL;DR: Adaptive subgradient methods as discussed by the authors dynamically incorporate knowledge of the geometry of the data observed in earlier iterations to perform more informative gradient-based learning, which allows us to find needles in haystacks in the form of very predictive but rarely seen features.

...read moreread less

Journal Article

Adaptive Subgradient Methods for Online Learning and Stochastic Optimization

John C. Duchi, +2 more

- 01 Feb 2011 -

Journal of Machine Learning Research

TL;DR: This work describes and analyze an apparatus for adaptively modifying the proximal function, which significantly simplifies setting a learning rate and results in regret guarantees that are provably as good as the best proximal functions that can be chosen in hindsight.

...read moreread less

Journal Article

Random search for hyper-parameter optimization

James Bergstra, +1 more

- 01 Mar 2012 -

Journal of Machine Learning Research

TL;DR: This paper shows empirically and theoretically that randomly chosen trials are more efficient for hyper-parameter optimization than trials on a grid, and shows that random search is a natural baseline against which to judge progress in the development of adaptive (sequential) hyper- parameter optimization algorithms.

...read moreread less

Proceedings Article

Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank

Richard Socher, +6 more

TL;DR: A Sentiment Treebank that includes fine grained sentiment labels for 215,154 phrases in the parse trees of 11,855 sentences and presents new challenges for sentiment compositionality, and introduces the Recursive Neural Tensor Network.

...read moreread less

Book

A complexity measure

Thomas J. McCabe

TL;DR: In this paper, a graph-theoretic complexity measure for managing and controlling program complexity is presented. But the complexity is independent of physical size, and complexity depends only on the decision structure of a program.

...read moreread less

Collapse

Learning Program Embeddings to Propagate Feedback on Student Code

Citations

Neural Program Synthesis with a Differentiable Fixer.

Source Code Summarization Using Attention-Based Keyword Memory Networks

Synthesizing Tasks for Block-based Programming

Hyperbolic Function Embedding: Learning Hierarchical Representation for Functions of Source Code in Hyperbolic Space

Unleashing the Power of Compiler Intermediate Representation to Enhance Neural Program Embeddings

References

Adaptive Subgradient Methods for Online Learning and Stochastic Optimization.

Adaptive Subgradient Methods for Online Learning and Stochastic Optimization

Random search for hyper-parameter optimization

Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank

A complexity measure

Related Papers (5)

Convolutional neural networks over tree structures for programming language processing

On the naturalness of software

Learning to Represent Programs with Graphs

DeepFix: Fixing Common C Language Errors by Deep Learning

Long short-term memory