Showing papers by "Christian Szegedy published in 2020"

PDF

Open Access

Journal Article•DOI•

Graph Representations for Higher-Order Logic and Theorem Proving

[...]

Aditya Paliwal¹, Sarah M. Loos¹, Markus N. Rabe¹, Kshitij Bansal¹, Christian Szegedy¹ - Show less +1 more•Institutions (1)

Google¹

03 Apr 2020

TL;DR: This paper presents the first use of graph neural networks (GNNs) for higher-order proof search and demonstrates that GNNs can improve upon state-of-the-art results in this domain.

...read moreread less

Abstract: This paper presents the first use of graph neural networks (GNNs) for higher-order proof search and demonstrates that GNNs can improve upon state-of-the-art results in this domain. Interactive, higher-order theorem provers allow for the formalization of most mathematical theories and have been shown to pose a significant challenge for deep learning. Higher-order logic is highly expressive and, even though it is well-structured with a clearly defined grammar and semantics, there still remains no well-established method to convert formulas into graph-based representations. In this paper, we consider several graphical representations of higher-order logic and evaluate them against the HOList benchmark for higher-order theorem proving.

...read moreread less

53 citations

Posted Content•

Mathematical Reasoning via Self-supervised Skip-tree Training

[...]

Markus N. Rabe¹, Dennis Lee¹, Kshitij Bansal¹, Christian Szegedy¹•Institutions (1)

Google¹

08 Jun 2020-arXiv: Learning

TL;DR: It is found that models trained on the skip-tree task show surprisingly strong mathematical reasoning abilities, and outperform modelstrained on standard skip-sequence tasks.

...read moreread less

Abstract: We examine whether self-supervised language modeling applied to mathematical formulas enables logical reasoning. We suggest several logical reasoning tasks that can be used to evaluate language models trained on formal mathematical statements, such as type inference, suggesting missing assumptions and completing equalities. To train language models for formal mathematics, we propose a novel skip-tree task. We find that models trained on the skip-tree task show surprisingly strong mathematical reasoning abilities, and outperform models trained on standard skip-sequence tasks. We also analyze the models' ability to formulate new conjectures by measuring how often the predictions are provable and useful in other proofs.

...read moreread less

27 citations

Book Chapter•DOI•

A Promising Path Towards Autoformalization and General Artificial Intelligence

[...]

Christian Szegedy¹•Institutions (1)

Google¹

26 Jul 2020

TL;DR: This paper argues that autoformalization is a promising path for systems to learn sophisticated, general purpose reasoning in all domains of mathematics and computer science and provides the outline for a realistic path towards those goals.

...read moreread less

Abstract: An autoformalization system is an AI that learns to read natural language content and to turn it into an abstract, machine verifiable formalization, ideally by bootstrapping from unlabeled training data with minimum human interaction. This is a difficult task in general, one that would require strong automated reasoning and automated natural language processing capabilities. In this paper, it is argued that autoformalization is a promising path for systems to learn sophisticated, general purpose reasoning in all domains of mathematics and computer science. This could have far reaching implications not just for mathematical research, but also for software synthesis. Here I provide the outline for a realistic path towards those goals and give a survey of recent results that support the feasibility of this direction.

...read moreread less

18 citations

Proceedings Article•

Mathematical Reasoning in Latent Space

[...]

Dennis Lee¹, Christian Szegedy¹, Markus N. Rabe¹, Sarah M. Loos¹, Kshitij Bansal¹ - Show less +1 more•Institutions (1)

Google¹

30 Apr 2020

TL;DR: In this article, a graph neural network is used to predict the rewrite success of a given formal statement, even when they propagate predicted latent representations for several steps, and the resulting embeddings are used to inform the semantic features of the corresponding formal statement.

...read moreread less

Abstract: We design and conduct a simple experiment to study whether neural networks can perform several steps of approximate reasoning in a fixed dimensional latent space. The set of rewrites (i.e. transformations) that can be successfully performed on a statement represents essential semantic features of the statement. We can compress this information by embedding the formula in a vector space, such that the vector associated with a statement can be used to predict whether a statement can be rewritten by other theorems. Predicting the embedding of a formula generated by some rewrite rule is naturally viewed as approximate reasoning in the latent space. In order to measure the effectiveness of this reasoning, we perform approximate deduction sequences in the latent space and use the resulting embedding to inform the semantic features of the corresponding formal statement (which is obtained by performing the corresponding rewrite sequence using real formulas). Our experiments show that graph neural networks can make non-trivial predictions about the rewrite-success of statements, even when they propagate predicted latent representations for several steps. Since our corpus of mathematical formulas includes a wide variety of mathematical disciplines, this experiment is a strong indicator for the feasibility of deduction in latent space in general.

...read moreread less

16 citations

Posted Content•

Language Modeling for Formal Mathematics

[...]

Markus N. Rabe, Dennis Lee, Kshitij Bansal, Christian Szegedy

08 Jun 2020

TL;DR: To train language models for formal mathematics, a novel skip-tree task is proposed, which outperforms standard language modeling tasks on reasoning benchmarks and analyzes the models' ability to formulate new conjectures.

...read moreread less

Abstract: We examine whether language modeling applied to mathematical formulas enables logical reasoning. We suggest several logical reasoning tasks that can be used to evaluate language models trained on formal mathematical statements, such as type inference, suggesting missing assumptions and completing equalities. To train language models for formal mathematics, we propose a novel skip-tree task, which outperforms standard language modeling tasks on our reasoning benchmarks. We also analyze the models' ability to formulate new conjectures by measuring how often the predictions that do not fit the ground truth or any training data turn out to be true and useful statements.

...read moreread less

4 citations

Book Chapter•DOI•

Text Embeddings for Retrieval from a Large Knowledge Base

[...]

Tolgahan Cakaloglu¹, Tolgahan Cakaloglu², Christian Szegedy³, Xiaowei Xu²•Institutions (3)

Walmart Labs¹, University of Arkansas², Google³

23 Sep 2020

TL;DR: The main results are that by training deep residual neural models, specifically for retrieval purposes, can yield significant gains when it is used to augment existing embeddings and that deeper models are superior to this task.

...read moreread less

Abstract: Text embedding representing natural language documents in a semantic vector space can be used for document retrieval using nearest neighbor lookup In order to study the feasibility of neural models specialized for retrieval in a semantically meaningful way, we suggest the use of the Stanford Question Answering Dataset (SQuAD) in an open-domain question answering context, where the first task is to find paragraphs useful for answering a given question First, we compare the quality of various text-embedding methods on the performance of retrieval and give an extensive empirical comparison on the performance of various non-augmented base embedding with, and without IDF weighting Our main results are that by training deep residual neural models, specifically for retrieval purposes, can yield significant gains when it is used to augment existing embeddings We also establish that deeper models are superior to this task The best base baseline embeddings augmented by our learned neural approach improves the top-1 paragraph recall of the system by \(14\%\)

...read moreread less

3 citations