How do the authors construct a derivation of Ax x :?

The authors construct a derivation of Ax ∪ {x :σ} ` e :σ0 from that of Ax ∪ {x :σ ′} ` e :σ0 by substituting each use of TAUT for x :σ′ with x :σ , followed by an INST step to derive x :σ ′.

What is the semantic domain for Exp?

The semantic domain V for Exp is a complete partial order satisfying the following equations up to isomorphism, where Bi is a cpo corresponding to primitive type ιi :V = B0+B1+ · · ·+ F +W (disjoint sum) F =V →V (function space)W = {·} (error element)To each monotype µ corresponds a subset V , as detailed in [5]; if v ∈ V is in the subset forµwe write v :µ.

What is the type of the closure of a type?

The authors also need to define the closure of a type τ with respect to assumptions A;A(τ) = ∀α1, . . . ,αnτwhere α1, . . . ,αn are the type variables occurring free in τ but not in A.

what is the simplest way to prove that e is a type-scheme?

If e is e1 e2 then let W (A, e2) = (S1,τ2) and W (S1A, e2) = (S2,τ2) and U (S2τ1,τ2→ β) =V where β is new; then S =V S2S1 and τ =Vβ.(iii)

What is the author's support for the polymorphic type discipline of ML?

†The work of this author is supported by the Portuguese Instituto Nacional de Investigacao Cientificaletrec map f s = if null s then nilelse cons(f(hd s)) (map f (tl s))

(Open Access) Principal type-schemes for functional programs (1982) | Luis Damas

Q: what is the simplest way to deduce a type-scheme?

Moreover if there is a derivation of A` e :σ of height n then there is also a derivation of SA` e : Sσ of height less [than] or equal to n.

Q: What is the semantic function " : Exp?

Using it, the authors wish to attach meaning to assertions of the formA |= e :σwhere e ∈ Exp and A is a set of assumptions of the form x :σ , x ∈ Id.

Principal type-schemes for functional programs

∗

Luis Damas

†

and Robin Milner

First published in POPL ’82: Proceedings of the 9th ACM SIGPLAN-SIGACT

symposium on Principles of programming languages, ACM, pp. 207–212

Permission to copy without fee all or part of this material is granted provided that

the copies are not made or distributed for direct commercial advantage, the ACM

that copying is by permission of the Association for Computing Machinery. To copy

otherwise, or to republish, requires a fee and/or speciﬁc permission.

1 Introduction

This paper is concerned with the polymorphic type discipline of ML, which is a gen-

eral purpose functional programming language, although it was ﬁrst introduced as a

metalanguage (whence its name) for constructing proofs in the LCF proof system.[4]

The type discipline was studied in [5] where it was shown to be semantically sound,

in a sense made precise below, but where one important question was left open:

does the type-checking algorithm — or more precisely the type assignment algorithm

(since types are assigned by the compiler, and need not be mentioned by the pro-

grammer) — ﬁnd the most general type possible for every expression and declara-

tion? Here we answer the question in the afﬁrmative, for the purely applicative part

of ML. It follows immediately that it is decidable whether a program is well-typed, in

contrast with the elegant and slightly more permissive type discipline of Coppo. [1]

After several years of successful use of the language, both in LCF and other research,

and in teaching to undergraduates, it has become important to answer these ques-

tions — particularly because the combination of ﬂexibility (due to polymorphism),

robustness (due to semantic soundness) and detection of errors at compile time has

proved to be one of the strongest aspects of ML.

The discipline can be well illustrated by a small example. Let us deﬁne in ML the

function

map

, which maps a given function over a given list — that is

map f [x1; ...; xn] = [f(x1),...,f(xn)]

The required declaration is

∗

Re-keyed 12 October 2010 by Ian Grant

iang@pobox.com

†

The work of this author is supported by the Portuguese Instituto Nacional de Investigacao Cientiﬁca

letrec map f s = if null s then nil

else cons(f(hd s)) (map f (tl s))

The type checker will deduce a type-scheme for

map

from existing type-schemes for

null

nil

cons

and

; the term type-scheme is appropriate since all these objects

are polymorphic. In fact from

null

: ∀α(α

list

→

bool

)

nil

: ∀α(α

list

)

cons

: ∀α(α → (α

list

→ α

list

))

: ∀α(α

list

→ α)

: ∀α(α

list

→ α

list

)

will be deduced

map

: ∀α∀β((α → β) → (α

list

→ β

list

)).

Types are built from type constants (

bool

...) and type variables (α,β,...) using type

operators (such as inﬁxed → for functions and postﬁxed

list

for lists); a type-scheme

is a type with (possibly) quantiﬁcation of type variables at the outermost.

Thus, the main result of this paper is that the type-scheme deduced for such a dec-

laration (and more generally, for any ML expression) is a principal type-scheme, i.e.

that any other type-scheme for the declaration is a generic instance of it. This is a

generalisation of Hindley’s result for Combinatory Logic [3].

ML may be contrasted with Algol 68, in which there is no polymorphism, and with

Russell [2], in which parametric types appear explicitly as arguments to polymorphic

functions. The generic types of Ada may be compared with type-schemes. For sim-

plicity, our deﬁnitions and results here are formulated for a skeletal language, since

their extension to ML is a routine matter. For example recursion is omitted since it

can be introduced by simply adding the polymorphic ﬁxed-point operator

fix

: ∀α((α → α) → α)

and likewise for conditional expressions.

2 The language

Assuming a set

of identiﬁers x the language

Exp

of expressions e is given by the

syntax

e ::= x | e e

| λx.e |

let

x = e

(where parentheses may be used to avoid ambiguity). Only the last clause extends the

ń-calculus. Indeed for type checking purposes every

let

expression could be elimi-

nated (by replacing x by e everywhere in e

), except for the important consideration

that in on-line use of ML declarations

let

x = e

are allowed, whose scope (e

) is the remainder of the on-line session. As illustrated

in the introduction, it must be possible to assign type-schemes to the identiﬁers thus

declared.

Note that types are absent from the language

Exp

. Assuming a set of type variables α

and of primitive types ι, the syntax of types τ and of type-schemes σ is given by

τ ::= α | ι | τ → τ

σ ::= τ | ∀ασ

A type-scheme ∀α

...∀α

τ (which we may write ∀α

...α

τ) has generic type vari-

ables α

...α

. A monotype µ is a type containing no type variables.

3 Type instantiation

If S is a substitution of types for type variables, often written [τ

/α

,...,τ

/α

] or

[τ

/α

], and σ is a type-scheme, then Sσ is the type-scheme obtained by replacing

each free occurrence of α

in σ by τ

, renaming the generic variables of σ if necessary.

Then Sσ is called an instance of σ; the notions of substitution and instance extend

naturally to larger syntactic constructs containing type-schemes.

By contrast a type-scheme σ = ∀α

...α

τ has a generic instance σ

= ∀β

...β

if τ

= [τ

/α

]τ for some types τ

,...,τ

and the β

are not free in σ. In this case

we shall write σ > σ

. Note that instantiation acts on free variables, while generic

instantiation acts on bound variables. It follows that σ > σ

implies Sσ > Sσ

4 Semantics

The semantic domain V for

Exp

is a complete partial order satisfying the following

equations up to isomorphism, where B

is a cpo corresponding to primitive type ι

V = B

+ B

+ ·· · + F + W (disjoint sum)

F = V → V (function space)

W = {·} (error element)

To each monotype µ corresponds a subset V , as detailed in [5]; if v ∈ V is in the

subset for µ we write v :µ. Further we write v :τ if v :µ for every monotype instance

µ of τ, and we write v : σ if v : τ for every τ which is a generic instance of σ.

Now let

Env

→ V be the domain of environments η. The semantic function

" :

Exp

→

Env

→ V is given in [5]. Using it, we wish to attach meaning to assertions

of the form

A |= e :σ

where e ∈

Exp

and A is a set of assumptions of the form x :σ, x ∈

. If the assertion

is closed, i.e. if A and σ contain no free type variables, then the sentence is said to

hold iff, for every environment η, whenever η[[x]] :σ

for each member x :σ

of A,

it follows that "[[e]]η : σ. Further, an assertion holds iff all its closed instances hold.

Thus, to verify the assertion

x : α, f :∀β(β → β) |= ( f x): α

it is enough to verify it for every monotype µ in place of α. This example illus-

trates that free type variables in an assertion are implicitly quantiﬁed over the whole

assertion, while explicit quantiﬁcation in a type scheme has restricted scope.

The remainder of this paper proceeds as follows. First we present an inference system

for inferring valid assertions. Next we present an algorithm W for computing a type-

scheme for any expression, under assumptions A. We then show that W is sound,

in the sense that any type-scheme it derives is derivable in the inference system. Fi-

nally we show that W is complete, in the sense that [any] derivable type-scheme is an

instance of that computed by W .

5 Type inference

From now on we shall assume that A contains at most one assumption about each

identiﬁer x. A

stands for removing any assumption about x from A.

For assumptions A, expressions e and type-scheme σ we write

A ` e :σ

if this instance may be derived from the following inference rules:

TAUT:

(x : σ ∈ A)

A ` x : σ

A ` e :σ

INST:

(σ > σ

)

A ` e :σ

GEN:

(α not free in A)

A ` e :∀ασ

A ` e :τ

→ τ A ` e

:τ

COMB:

A ` (e e

):τ

∪ {x : τ

} ` e :τ

ABS:

A ` (λx.e) : τ

→ τ

A ` e :σ A

∪ {x : σ} ` e

:τ

LET:

A ` (

let

x = e

) : τ

The following example of a derivation is organised as a tree, in which each node

follows from those immediately above it by an inference rule.

TAUT:

x : α ` x : α

ABS:

` (λx.x):α → α

GEN:

†

` (λx.x):∀α(α → α)

TAUT:

i :∀α(α → α) ` i :∀α(α → α)

INST:

i :∀α(α → α) ` i :(α → α) → (α → α)

TAUT:

i :∀α(α → α) ` i :∀α(α → α)

INST:

i :∀α(α → α) ` i :α → α

COMB:

‡

i :∀α(α → α) ` i i :α → α

†

` (λx.x):∀α(α → α)

‡

i :∀α(α → α) ` i i :α → α

LET:

` (

let

i = (λx.x)

i i) : α → α

The following proposition, stating the semantic soundness of inference, can be proved

by induction on e.

Proposition 1 (Soundness of inference). If A ` e : σ then A |= e :σ.

We will also require later the following two properties of the inference system.

Proposition 2. If S is a substitution and A ` e :σ then SA ` e : Sσ. Moreover if there is

a derivation of A ` e :σ of height n then there is also a derivation of SA ` e : Sσ of height

less [than] or equal to n.

Proof. By induction on n.

Lemma 1. If σ > σ

and A

∪ {x : σ

} ` e :σ

then also A

∪ {x : σ} ` e :σ

Proof. We construct a derivation of A

∪ {x :σ} ` e : σ

from that of A

∪ {x :σ

} `

e :σ

by substituting each use of TAUT for x :σ

with x : σ, followed by an INST step to

derive x : σ

. Note that GEN steps remain valid since if α occurs free in σ then it also

occurs free in σ

6 The type assignment algorithm W

The type inference system itself does not provide an easy method for ﬁnding, given

A and e, a type-scheme σ such that A ` e : σ. We now present an algorithm W for

this purpose. In fact, W goes a step further. Given A and e, if W succeeds it ﬁnds

a substitution S and a type τ, which are most general in a sense to be made precise

below, such that

SA ` e :τ.

To deﬁne W we require the uniﬁcation algorithm of Robinson [6] .

Proposition 3 (Robinson). There is an algorithm U which, given a pair of types, either

returns a substitution V or fails; further

(i) If U (τ,τ

) returns V , then V uniﬁes τ and τ

, i.e. V τ = τ

(ii) If S uniﬁes τ and τ

then U(τ,τ

) returns some V and there is another substitution

R such that S = RV .

Moreover, V involves only variables in τ and τ

We also need to deﬁne the closure of a type τ with respect to assumptions A;

A(τ) = ∀α

,...,α

where α

,...,α

are the type variables occurring free in τ but not in A.

Principal type-schemes for functional programs

Citations

Types and Programming Languages

The Definition of Standard ML

On understanding types, data abstraction, and polymorphism

Haskell 98 language and libraries : the revised report

Lazy abstraction

References

A Machine-Oriented Logic Based on the Resolution Principle

A theory of type polymorphism in programming

The Principal Type-Scheme of an Object in Combinatory Logic

An Extended Polymorphic Type System for Applicative Languages

Related Papers (5)

A theory of type polymorphism in programming

The Definition of Standard ML

A Machine-Oriented Logic Based on the Resolution Principle

Towards a theory of type structure

Types and Programming Languages

Frequently Asked Questions (10)

Q1. How do the authors construct a derivation of Ax x :?

Q2. What is the semantic domain for Exp?

Q3. What is the simplest way to define a type-scheme?

Q4. What is the syntax of type variables?

Q5. What is the type of the closure of a type?

Q6. what is the simplest way to prove that e is a type-scheme?

Q7. What is the author's support for the polymorphic type discipline of ML?

Q8. what is the simplest way to deduce a type-scheme?

Q9. What is the semantic function " : Exp?

Q10. what is the main result of this paper?