6 September 2017: Introduction to Comp 105

Announcements

Recitations start this week (tomorrow!)
First HW due Wednesday, September 13.

Overview

Why so many languages?

Question: What languages have you programmed in?

There are thousands of programming languages, each unique.

Question: Why do you suppose there are so many?

The right language for the job makes it easier to write programs that really work

An invaluable skill for software practitioners
Your language can make you or break you.
Example
- Writing a compiler requires a language that facilitiates maniuplating tree data structures
- Relevant features: algebraic data types, pattern matching, efficient recursion, higher-order functions

The Blub paradox

Essay by Paul Graham on the relative power of programming languages.

What this course isn’t

Simula in September
Objective-C in Ocobter
Visual Basic in November
C in December

Why not?

Because you’d spend most of your time learning shallow details like program syntax, tool chains, and library interfaces.

What this course is:

Reusable Principles

Why?

Once you know the principles, you’ll be able to teach yourself new languages quickly and improve your coding in many different languages.

What are reusable principles?

What if the course were called “Cooking”?

You’d need to know something about how cooking works (THEORY)
- Want to make bread? How does yeast work?
- Want to avoid getting sick? Under what conditions do bacteria thrive?
- Want to develop flavor? What triggers the Maillard reaction?
You’d need to know something about how to cook effectively (PRACTICE)
- French cuisine: mirepoix (onions, carrots, celery cooked in butter)
- Base sauces (the 5 “mother sauces” of Western cooking); Know what they are and when to use them.

The same division for programming languages:

How programming languages work (THEORY)
- MATH: logic, semantics, type theory, induction
How to make them effective (PRACTICE)
- What features enable compact, efficient, maintainable CODE?

What Programming Languages are, technically

The marriage of math and code
Principal tools: Induction and recursion

What can you get out of Comp 105?

Discover new ways think about programming (in many languages)
- For example: master using recursive, higher-order functions
Double your productivity
- By choosing the right language/feature for the job
Become a sophisticated consumer, recognizing familiar features in new languages
Learn new languages quickly
Bonus: Prepare for advanced study
(Course serves everyone from recent 15/61 grads to grad students)

Students who get the most out of 105 report

They enjoy programming (a la 15)
They also like math (a la 61, sort of—induction and proofs)
They work hard

Great languages begin with great features

Language shapes your thinking
There aren’t that many great features, and you will see them over and over
You’ll choose features, and therefore languages, to fit your needs
Some are more powerful than others
Examples: first-class functions, continuations, pattern matching, type inference

In Comp 105,

We explode your brain so you can think differently
You’ll know you’re doing it right if at first your head hurts terribly, then you have a breakthrough moment and it all seems pleasant afterwards

How will we study language features?

Write (lots of!) small programs exercising those features
- High power-to-weight ratio (lots of thought per line)
Learn formal tools to describe language features precisely
- Operational Semantics (What do programs mean?)
- Typing Rules (What can we prove about all programs without running them?)
Extend language implementations so you understand what is under the hood.
Prove properties about various language features

Common Framework

Sequence of Scheme-based pedagogical languages with increasing power
- Simplest language: ImpCore (IMPerative CORE)
Implementation language:
- Start in C
- Shift to ML once we have learned that language
Foundation of operational semantics and typing rules

Course logistics and administration

Books

You must get Norman’s book (Both Volumes!!!)
ML book is optional, but very useful. You won’t need it until October 11.

Homework

Homework will be frequent and challenging:

Many small programming problems
Some theory problems, more like a math problem set
The occasional larger project, like a type checker or a game solver
Submit everything electronically
First homework is due a week from today; designed to get you moving quickly.
The course is relentlessly cumulative.

Both individual and pair work:

All problems should be discussed with others
(Essential to your success)
Discussions must be acknowleged
Most problems must be completed individually
Do not allow anyone else to see your code.
For some problems larger in scope, you can work in pairs
Be very careful to separate your pair work and your individual work. (A mistake could be major violation of academic integrity, with severe penalties.)

Arc of the homework looks something like this:

Assignment	Difficulty
impcore	one star
opsem	two stars
scheme	three stars
hofs	four stars

And it’s more or less four-star homeworks from there on out.

Lesson: Don’t make decisions based on the first couple of homeworks!

Just as intellectually challenging as COMP 40, but in an entirely different direction.

Not “How long until this huge pile of code works?”
Instead “How long until I get the Aha! Moment that makes these 10 lines work?”

Everyone who takes this class has the ability to master the material; Succeeding just requires digging in.

We provide lots of resources to help:

Lectures
Readings
Recitations
Office hours
Piazza

We encourage you to form study groups so you have thought partners.

Two two bad habits to avoid:

Working on your own.
Trying to cram the assignments at the last minute.

The role of lectures

We don’t cover everything in lecture
Lecture is for just the hard parts
We’ll talk very little about the code (just the interesting bits)

In a 100-level course, you are responsible for your own learning

Course evaluations from previous years: a few students want everything gone over in lecture. That’s not how things work in real life, and that’s not how things work here. We point you in the right direction and identify traps and pitfalls, and we find good problems for you to work on.

If you’re expecting to see everything in lecture, you have a couple of choices: change your expectations, or take the course next year when you will have more experience and will be more prepared to manage your own learning.

Recitations

Class goes very fast; recitations provide chance to dig in to key topics with classmates.
Start this Thursday and Friday.
Location information available in SIS.
Designed to be interactive, so bring your thinking cap.
Count towards class participation.

Questions and answers on Piazza

Don’t just ask questions; answer them too.
Both activities count toward class participation.
Be super careful that any question containing your code must be private. (This is an issue of academic integrity.)

Other policies and procedures on the web

You are responsible!
Treasure Hunt for class participation points

What am I called?

Call me “Kathleen,” “Professor Fisher”, or “Profesor.”

ImpCore: The first language in our common framework

Exercise: all-fours?

Write a function that takes a natural number n and returns true (1) iff all the digits in n are 4’s.

Code

Impcore interpretor: > impcore
Command (use homework/fours.imp); will load file into interpretor
Syntax: parentheses with keyword or function name to start
An Impcore program is a sequence of definitions (and expresions)

Impcore variable definition

Example

(val n 99)

Compare

int n = 99;

Also, expressions at top level (definition of it)

Impcore expressions

No statements means expression-oriented:

(if e1 e2 e3)
(while e1 e2)
(set x e)
(begin e1 ... en)
(f e1 ... en)

Each one has a value and may have side effects!

Functions are primitive (+ - * / = < > print)
or defined with (define f ...).

The only type of data is “machine integer” (deliberate oversimplification)

Scoping rules for Impcore

Scopes also called “name spaces”; we will call them “environments” because that’s the pointy-headed theory term—and if you want to read some of the exciting papers, pointy-headed theory has to be second nature.

Names known in ``environments"

Ways to talk about meanings of names:

Scope rules
Name spaces
Environments (aka symbol tables)

Impcore vars in 2 environments: globals, formals

There are no local variables

Just like awk; if you need temps, use extra formal parameters
For homework, you’ll add local variables

Functions live in their own environment (not shared with variables)

Environmental abuse

Abuse of separate name spaces:

-> (val f 33)
33
-> (define f (x) (+ x x))
f
-> (f f)
66

Recursion: a review

Ways a recursive function could decompose a natural number n.

Peel back one (Peano numbers):

n = 0
n = m + 1,    m is also a natural number

Split into two pieces:

n = 0
n = k + (n - k)    0 < k < n   (everything gets smaller)

Sequence of decimal digits (see study problems on digits)

n = d,               where 0 <= d < 10
n = 10 * m + d,      where 0 <= d < 10 and m > 0

To do your homework problems, which I recommend starting today, you’ll need to invent at least one more.

11 September 2017: Introduction to Semantics

There are PDF slides for 9/12/2017.

Handout: 105 Impcore Semantics, Part 1

Today: Abstract Syntax and Operational Semantics

Discussion: Two things you learned last class.

Programming-language semantics

Semantics means meaning.

What problem are we trying to solve?

Know what’s supposed to happen when you run the code

Ways of knowing:

People learn from examples
You can build intuition from words
(Book is full of examples and words)
To know exactly, unambiguously, you need more precision

Q: Does anyone know the beginner exercise “make a peanut butter and jelly sandwich”? (Videos on YouTube)

You can watch and learn, but a computer can’t.
“Put the peanut butter on the bread”

Why bother with precise semantics?

Same reason as other forms of math:

Distill understanding
Express it in sharable way
Prove useful properties. For example:
- private information doesn’t leak
- device driver can’t crash the OS kernel
- compiler optimizations prserve program meaning
- Most important for you: things that look different are actually the same

Plus, needed to build language implementation and tests

The programming languages you encounter after 105 will certainly look different from what we study this term. But most of them will actually be the same. Studying semantics helps you identify that.

The idea: The skills you learn in this class will apply

Behavior decomposes

We want a computational notion of meaning.

What happens when we run (* y 3)?

We must know something about *, y, 3, and function application.

Knowledge is expressed inductively

Atomic forms: Describe behavior directly (e.g., constants, variables)
Compound forms: Behavior specified by composing behaviors of parts

(Non)-Example of compositionality: Spelling/pronunciation in English

fish vs ghoti
Both composed from letters, but no rules of composition for pronunciation.

By design, programming languages more orderly than natural language.

Review: Concrete syntax for Impcore

Definitions and expressions:

def ::= (define f (x1 ... xn) exp)
     |  (val x exp)                
     |  exp
     |  (use filename)            
     |  (check-expect exp1 exp2)
     |  (check-error exp)

exp ::= integer-literal      ;; atomic forms
     |  variable-name
     |  (set x exp)          ;; compound forms
     |  (if exp1 exp2 exp3)
     |  (while exp1 exp2)
     |  (begin exp1 ... expn)
     |  (function-name exp1 ... expn)

How to define behaviors inductively

Expressions only

Base cases (plural): numerals, names

Inductive steps: compound forms

To determine behavior of a compound form, look at behaviors of its parts

First, simplify the task of definition

What’s different? What’s the same?

 x = 3;               (set x 3)

 while (i * i < n)    (while (< (* i i) n)
   i = i + 1;            (set i (+ i 1)))

Abstract away gratuitous differences

(See the bones beneath the flesh)

Abstract syntax

Same inductive structure as BNF

More uniform notation

Good representation in computer

Concrete syntax: sequence of symbols

Abstract syntax: ???

The abstraction is a tree

The abstract-syntax tree (AST):

Exp = LITERAL (Value)
    | VAR     (Name)
    | SET     (Name name, Exp exp)
    | IFX     (Exp cond, Exp true, Exp false)
    | WHILEX  (Exp cond, Exp exp)
    | BEGIN   (Explist)
    | APPLY   (Name name, Explist actuals)

One kind of “application” for both user-defined and primitive functions.

ASTs

Question: What do we assign behavior to?

Answer: The Abstract Syntax Tree (AST) of the program.

An AST is a data structure that represents a program.
A parser converts program text into an AST.

Question: How can we represent all while loops?

while (i < n && a[i] < x) { i++ }

Answer:

Tag code as a while loop
Identify the condition, which can be any expression
Identify the body, which can be any expression

As a data structure:

WHILEX(exp1, exp2), where
exp1 is the representation of (i < n && a[i] < x), and
exp2 is the representation of i++

In C, trees are a bit fiddly

typedef struct Exp *Exp;
typedef enum {
  LITERAL, VAR, SET, IFX, WHILEX, BEGIN, APPLY
} Expalt;        /* which alternative is it? */

struct Exp {  // only two fields: 'alt' and 'u'!
    Expalt alt;
    union {
        Value literal;
        Name var;
        struct { Name name; Exp exp; } set;
        struct { Exp cond; Exp true; Exp false; } ifx;
        struct { Exp cond; Exp exp; } whilex;
        Explist begin;
        struct { Name name; Explist actuals; } apply;
    } u;
};

Let’s picture some trees

An expression:

  (f x (* y 3))

(Representation uses Explist)

A definition:

  (define abs (n)
    (if (< n 0) (- 0 n) n))

Behaviors of ASTs, part I: Atomic forms

Numeral: stands for a value

Name: stands for what?

``Environment’’ is pointy-headed theory

You may also hear:

Symbol table
Name space

Influence of environment is “scope rules”

In what part of code does environment govern?

Find behavior using environment

Recall

  (* y 3)   ;; what does it mean?

Your thoughts?

Impcore uses three environments

Global variables ξ

Functions ϕ

Formal parameters ρ

There are no local variables

Just like awk; if you need temps, use extra formal parameters
For homework, you’ll add local variables

Function environment ϕ not shared with variables—just like Perl

Syntax and Environments determine behavior

Behavior is called evaluation

Expression is evaluated in environment to produce value
``The environment’’ has three parts: globals, formals, functions

Evaluation is

Specified using inference rules (math)
Implemented using interpreter (code)

You know code. You will learn math.

Key ideas apply to any language

Expressions

Values

Rules

Rules written using operational semantics

Evaluation on an abstract machine

Concise, precise definition
Guide to build interpreter
Prove “evaluation deterministic” or “environments can be on a stack”

Idea: “mathematical interpreter”

formal rules for interpretation

With that as background, we can now dive in to the semantics for Impcore!

13 September 2017: Semantics, Syntactic Proofs, Metatheory

There are PDF slides for 9/14/2017.

Handout: Impcore expression rules

Announcements

Impcore homework due tonight
Opsem homework now available

Today

Operational semantics of function application
How we know what the code is supposed to do at run time: valid derivations
What we know about valid derivations: metatheory

Last Time

Compositionality
Abstract Syntax Trees
Environments: Globals (ξ), Functions (ϕ), Locals (ρ)
Abstract machines
Evaluation judgement
Operational Semantics
Correspondance between code and inference rules

Both math and code on homework

You’re good with code—lecture and recitation will focus on math

Questions:

In what order are the actual parameters evaluated?
- How can you tell?
What happens if the formal parameter names are duplicated?
How many formal parameters can the body of f access?
- What are their names?
Can changes to formal parameters in the body of f be seen by the code calling f?
Can changes to globals in the body of f be seen by the code calling f?

Using Operational Semantics

The big idea:

Every terminating computation is described by a data structure—we’re going to turn computation into a data structure. Proofs about computations are hard (see: COMP 170), but proofs about data structures are lots easier (see: COMP 61).

Valid derivations, or “How do I know what this program should evaluate to?”

Code example

  (define and (p q)
    (if p q 0))

  (define digit? (n)
    (and (<= 0 n) (< n 10)))

Suppose we evaluate (digit? 7)

Exercise:

In the body of digit?, what expressions are evaluated in what order?
As a function application, the body matches template (f e₁ e₂). In this example,
- What is f?
- What is e₁?
- What is e₂?

What is the result of (digit? 7)?

How do we know it’s right?

From rules to proofs

What can a proof tell us?

Judgment is valid when ``derivable’’

Special kind of proof: derivation

It’s a data structure (derivation tree)
Made inductively, by composing rules
Valid derivation matches rules (by substitution)
Spacelike representation of timelike behavior (think flip-book animation)

A form of “syntactic proof”

Recursive evaluator travels inductive proof

Root of derivation at the bottom (surprise!)

Build

Start on the left, go up
Cross the ⇓
Finish on the right, go down

First let’s see a movie

Example derivation (rules in handout)

Building derivations

At this point, we’ve now covered derivations and how a single derivation corresponds to evaluating a particular program.

Proofs about all derivations: Metatheory

Derivations (aka syntactic proofs) enable meta-reasoning

Derivation D is a data structure

Got a fact about all derivations?

It’s a fact about all terminating evaluations
They are in 1 to 1 correspondance.

Prove facts by structural induction over derivations

(Or “induction on height of derivation tree”)

Example: Evaluating an expression doesn’t change the set of global variables

Metatheorems often help implementors

More example metatheorems:

OK to mutate environments if you use a stack
Interactive browser doesn’t leak space (POPL 2012)
Device driver can’t harm kernel (Microsoft Singularity)

Metatheorems are proved by induction

Induction over structure (or height) of derivation trees $\mathcal D$

These are “math-class proofs” (not derivations)

Proof

Has one case for each rule
Has multiple cases for some syntactic forms
Assumes the induction hypothesis for any proper sub-derivation (derivation of a premise)

Let’s try it!

Cases to try:

Literal
GlobalVar
SetGlobal
IfTrue
ApplyUser2

For your homework, “Theory Impcore” leaves out While and Begin rules.

18 September 2017: Metatheory wrapup. Intro to functional programming

There are PDF slides for 9/19/2017.

Announcements

Impcore homework returned via email

Today

More induction on derivations (metatheory)
Introduction to Scheme

Last Time

Operational semantics of function application
A valid derivation defines the execution of a single program.
Metatheory allows us to prove things about all programs in the language.

Where are we going?

Recursion and composition:

Recursive functions in depth
Two recursive data structures: the list and the S-expression
More powerful ways of putting functions together (compositionality again, and it leads to reuse)

Recursion comes from inductive structure of input

Structure of the input drives the structure of the code.

You’ll learn to use a three-step design process:

Inductive structure
Equations (“algebraic laws”)
Code

To discover recursive functions, write algebraic laws:

sum 0 = 0
sum n = n + sum (n - 1)

Which direction gets smaller?

Code:

(define sum (n)
   (if (= n 0) 0 (+ n (sum (- n 1)))))

Another example:

exp x 0 = 1
exp x (n + 1) = x * (exp x n)

Can you find a direction in which something gets smaller?

Code:

(define exp (x m) 
  (if (= m 0) 
      1
      (* x (exp x (- m 1)))))

For a new language, five powerful questions

As a lens for understanding, you can ask these questions about any language:

What is the abstract syntax? What are the syntactic categories, and what are the terms in each category?
What are the values? What do expressions/terms evaluate to?
What environments are there? That is, what can names stand for?
How are terms evaluated? What are the judgments? What are the evaluation rules?
What’s in the initial basis? Primitives and otherwise, what is built in?

(Initial basis for μScheme on page 157)

Introduction to Scheme

Question 2: What are the values?

Two new kinds of data:

The function closure: the key to “first-class” functions
Pointer to automatically managed cons cell (mother of civilization)

Graphically:

Picture of two cons cells

(cons 3 (cons 2 ’()))

Scheme Values

Values are S-expressions.

An S-expression is either

a symbol 'Halligan 'tufts
a literal integer 0 77
a literal Boolean #t #f
(cons v₁ v₂), where v₁ and v₂ are S-expressions

Many predefined functions work with a list of S-expressions

A list of S-expressions is either

the empty list '()
(cons v₁ v₂), where v₁ is an S-expression and v₂ is a list of S-expressions

We say “an S-expression followed by a list of S-expressions”

S-Expression operators

Like any other abstract data type, S-Expresions have:

creators that create new values of the type '()
producers that make new values from existing values (cons s s')
mutators that change values of the type (not in uScheme)
observers that examine values of the type
number? symbol? boolean? null? pair? car cdr

N.B. creators + producers = constructors

Examples of S-Expression operators


 (cons 'a '())         also written '(a)

 (cons 'b '(a))        equals '(b a)

 (cons 'c '(b a))      equals '(c b a)

 (null? '(c b a))      equals #f

 (cdr '(c b a)         equals '(b a)

 (car '(c b a)         equals 'c

The symbol ’ is pronounced “tick.”
It indicates that what follows is a literal.

Picture of (cons c (cons b (cons a '())))

Your turn!

What is the representation of

'((a b) (c d))

which can be alternatively written

cons( (cons a (cons b '()))

 `(cons (cons c (cons d '())) '()))`

What is the representation of

cons('a 'b)

Contrast this representation with the one for

cons('a '())

Both of these expressions are S-expressions, but only cons('a '()) is a list.

Picture of '((a b) (c d))

Picture of cons('a 'b)

20 September 2017: More Scheme

There are PDF slides for 9/21/2017.

Announcements

OpSem Homework due tonight
Scheme I: Recursive Programming with Lists Homework now available.

Today

Lists
Algebraic Laws for writing functions
The cons cost model
The method of accumulating parameters

Last Time

Wrapped up induction on derivations
Introduction to Scheme
S-expressions and cons cells

Lists

Subset of S-Expressions.

Can be defined via a recursion equation or by inference rules:

Constructors: '(),cons`

Observers: null?, pair?, car, cdr (also known as first and rest, head and tail, and many other names)

Why are lists useful?

Sequences a frequently used abstraction
Can easily approximate a set
Can implement finite maps with association lists (aka dictionaries)
You don’t have to manage memory

These “cheap and cheerful” representations are less efficient than balanced search trees, but are very easy to implement and work with—see the book.

The only thing new here is automatic memory management. Everything else you could do in C. (You can have automatic memory management in C as well.)

Immutable data structures

Key idea of functional programming. Instead of mutating, build a new one. Supports composition, backtracking, parallelism, shared state.

Review: Algebraic laws of lists

You fill in these right-hand sides:

(null? '()) == 
(null? (cons v vs)) == 
(car (cons v vs)) == 
(cdr (cons v vs)) == 

(length '()) ==
(length (cons v vs)) ==

Combine creators/producers with observers to create laws.

Can use laws to prove properties of code and to write better code.

Recursive functions for recursive types

Any list is therefore constructed with '() or with cons applied to an atom and a smaller list.

How can you tell the difference between these types of lists?
What, therefore, is the structure of a function that consumes a list?

Example: length

Algebraic Laws for length

Code:

;; you fill in this part

Algebraic laws to design list functions

Using informal math notation with .. for “followed by” and e for the empty sequence, we have these laws:

xs .. e         = xs
e .. ys         = ys
(z .. zs) .. ys = z .. (zs .. ys)
xs .. (y .. ys) = (xs .. y) .. ys

The underlying operations are append, cons, and snoc. Which ..’s are which?

But we have no snoc
If we cross out the snoc law, we are left with three cases… but case analysis on the first argument is complete.

So cross out the law xs .. e == xs.

Example: Append

Which rules look useful for writing append?

You fill in these right-hand sides:

(append '()         ys) == 

(append (cons z zs) ys) ==

Equations and function for append

(append '()         ys) == ys

(append (cons z zs) ys) == (cons z (append zs ys))


(define append (xs ys)

  (if (null? xs)

      ys

      (cons (car xs) (append (cdr xs) ys))))

Why does it terminate?

Cost model

The major cost center is cons because it corresponds to allocation.

How many cons cells are allocated?

Let’s rigorously explore the cost of append.

Induction Principle for List(Z)

Suppose I can prove two things:

IH (’())
Whenever z in Z and also IH(zs), then IH (cons z zs)

then I can conclude

Forall zs in List(Z), IH(zs)

Example: The cost of `append`

Claim: Cost (append xs ys) = (length xs)

Proof: By induction on the structure of xs.

Base case: xs = ’()

I am not allowed to make any assumptions.
```
(append '() ys)
= { because xs is null }
ys
```
Nothing has been allocated, so the cost is zero.

(length xs) is also zero.

Therefore, cost = (length xs).

Inductive case: xs = (cons z zs)

I am allowed to assume the inductive hypothesis for zs.

Therefore, I may assume the number of cons cells allocated by (append zs ys) equals (length zs)

Now, the code:

(append (cons z zs) ys)
  = { because first argument is not null }
  = { because (car xs) = z }
  = { because (cdr xs) = zs }
(cons z (append zs ys))

The number of cons cells allocated is 1 + the number of cells allocated by (append zs ys).

cost of (append xs ys)
 = { reading the code }
1 + cost of (append zs ys)
 = { induction hypothesis }
1 + (length zs)
 = { algebraic law for length }
(length (cons z zs))
 = { definition of xs }
(length xs)

Conclusion: Cost of append is linear in length of first argument.

Example: list reversal

Algebraic laws for list reversal:

reverse '() = '()
reverse (x .. xs) = reverse xs .. reverse '(x) = reverse xs .. '(x)

And the code?

Naive list reversal

(define reverse (xs)
   (if (null? xs)
       '()
       (append (reverse (cdr xs))
               (list1 (car xs)))))

The list1 function maps an atom x to the singleton list containing x.

How many cons cells are allocated? Let’s let n = |xs|.

Q: How many calls to reverse? A: n
Q: How many calls to append? A: n
Q: How long a list is passed to reverse? A: n-1, n-2, … , 0
Q: How long a list is passed as first argument to append? A: n-1, n-2, … , 0
Q: How many cons cells are allocated by call to list1? A: one per call to reverse.
Conclusion: O(n²) cons cells allocated. (We could prove it by induction.)

The method of accumulating parameters

The function revapp takes two list arguments xs and ys.
It reverses xs and appends the result to ys:

(revapp xs ys) = (append (reverse xs) ys)

Write algebraic laws for revapp involving different possible forms for xs.

Who could write the code?

Reversal by accumulating parameters

(define revapp (xs ys)
   (if (null? xs)
       ys
       (revapp (cdr xs) 
               (cons (car xs) ys))))

(define reverse (xs) (revapp xs '()))

The cost of this version is linear in the length of the list being reversed.

Parameter ys is the accumulating parameter.
(A powerful, general technique.)

Linear reverse, graphically

We call reverse on the list '(1 2 3):

Function reverse calls the helper function revapp with '() as the ys argument:

The xs parameter isn’t '(), so we recursively call revapp with the cdr of xs and the result of consing the car of xs onto ys:

The xs parameter still isn’t '(), so we again call revapp recursively:

Still not '(), so we recurse again:

This time xs is '(), so now we just return ys, which now contains the original list, reversed!

PDF slides of revapp

25 September 2017: Let and Lambda

There are PDF slides for 9/26/2017.

Code

Announcements

Scheme I HW due 9/27

Last Time

Inductive definitions: List of Z
List functions ('(), cons, car, cdr, null?)
Cost model: number of cons allocations
Accumulating parameters: revapp

Today

Association lists [Not covered in class]
Let construct
Anonymous functions

Association lists represent finite maps [Not covered in class]

Implementation: List of key-value pairs

'((k1 v1) (k2 v2) ... (kn vn))

Picture with spine of cons cells

Functions car, cdar, caar, cadar can help navigate.

car: Contents of the address register
caar: Contents of the address then address register
cdar: Contents of the address then data register
cadar: Contents of the address then data then address registers

Recall that the left box in a cons cell is the address and the right box is the data. Read the a as “address” and the d as “data” from right to left.

In association lists, these operations correspond to

car: First key value pair, e.g., '(k1 v1)
caar: Key of first key value pair, e.g. 'k1
cdar: List of values of first key value pair, e.g. '(v1)
cadar: Contents of the address then data then address registers, e.g. 'v1

A-list example

    -> (find 'Building 
             '((Course 105) (Building Robinson) 
               (Instructor Fisher)))
    Robinson
    -> (val ksf (bind 'Office 'Halligan-242
                (bind 'Courses '(105)
                (bind 'Email 'comp105-staff '()))))
    ((Email comp105-staff) 
     (Courses (105)) 
     (Office Halligan-242))
    -> (find 'Office ksf) 
    Halligan-242
    -> (find 'Favorite-food ksf)
    ()

Notes:

An attribute can be a list or any other value.
'() stands for ‘not found’

Algebraic laws of association lists

Laws of assocation lists

(find k (bind k v l)) = v
(find k (bind k' v l)) = (find k l), provided k != k'
(find k '()) =  '() --- bogus!

Handy new feature of Scheme: `let` binding

Introduce local names into environment

    (let ((x1 e1)
          ...
          (xn en))
        e)

Evaluate e1 through en, bind answers to x1, … xn

Name intermediate results (simpler code, less error prone)
Creates new environment for local use only:
```
rho {x1 |-> v1, ..., xn |-> vn}
```

Also let* (one at a time) and letrec (local recursive functions)

Note that we have definititions in the language and it might be easier to read if McCarthy had actually used definition syntax, which you’ll see in ML, Haskell, and other functional languages:

Syntax McCarthy should have used

(let ((val x1 e1)
      ...
      (val xn en))
   e)

From Impcore to uScheme

Things that should offend you about Impcore:

Looking up a function and looking up a variable require different interfaces! (isvalbound and isfunbound)
To get a variable, must check 2 or 3 environments (ξ, ϕ, ρ),
Can’t create a function without giving it a name:
- High cognitive overhead
- A sign of second-class citizenship

All these problems have one solution: Lambda! (λ)

Anonymous, first-class functions

From Church’s lambda-calculus:

(lambda (x) (+ x x))

“The function that maps x to x plus x”

At top level, like define. (Or more accurately, define is a synonym for lambda that also gives the lambda a name.)

In general, \x.E or (lambda (x) E)

x is bound in E
other variables are free in E

The ability to “capture” free variables is what makes it interesting.

Functions become just like any other value.

First-class, nested functions

(lambda (x) (+ x y))  ; means what??

What matters is that y can be a parameter or a let-bound variable of an enclosing function.

Can tell at compile time what is captured.
To understand why anyone cares, you’ll need examples

First example: Finding roots. Given n and k, find an x such that x^n = k.

Step 1: Write a function that computes x^n - k.

Step 2: Write a function that finds a zero between lo and hi bounds.

Picture of zero-finding function.

Algorithm uses binary search over integer interval between lo and hi. Finds point in that interval in which function is closest to zero.

Code that computes the function x^n - k given n and k:

Function escapes!

-> (define to-the-n-minus-k (n k)
      (let
        ((x-to-the-n-minus-k (lambda (x) 
                                (- (exp x n) k))))
        x-to-the-n-minus-k))
-> (val x-cubed-minus-27 (to-the-n-minus-k 3 27))
-> (x-cubed-minus-27 2)
-19

The function to-the-n-minus-k is a higher-order function because it returns another (escaping) function as a result.

No need to name the escaping function

-> (define to-the-n-minus-k (n k)
      (lambda (x) (- (exp x n) k)))

-> (val x-cubed-minus-27 (to-the-n-minus-k 3 27))
-> (x-cubed-minus-27 2)
-19

General purpose zero-finder that works for any function f:

The zero-finder

(define findzero-between (f lo hi)
   ; binary search
   (if (>= (+ lo 1) hi)
       hi
       (let ((mid (/ (+ lo hi) 2)))
          (if (< (f mid) 0)
              (findzero-between f mid hi)
              (findzero-between f lo mid)))))
(define findzero (f) (findzero-between f 0 100))

findzero-between is also a higher-order function because it takes another function as an argument. But nothing escapes; you can do this in C.

Example uses:

Cube root of 27 and square root of 16

-> (findzero (to-the-n-minus-k 3 27))                                    
3
-> (findzero (to-the-n-minus-k 2 16))
4

Your turn!!

Lambda questions

(define combine (p? q?)
   (lambda (x) (if (p? x) (q? x) #f)))

(define divvy (p? q?)
   (lambda (x) (if (p? x) #t (q? x))))

(val c-p-e (combine prime? even?))
(val d-p-o (divvy   prime? odd?))

(c-p-e 9) == ?            (d-p-o 9) == ?
(c-p-e 8) == ?            (d-p-o 8) == ?
(c-p-e 7) == ?            (d-p-o 7) == ?

Lambda answers

(define combine (p? q?)
   (lambda (x) (if (p? x) (q? x) #f)))

(define divvy (p? q?)
   (lambda (x) (if (p? x) #t (q? x))))

(val c-p-e (combine prime? even?))
(val d-p-o (divvy   prime? odd?))

(c-p-e 9) == #f           (d-p-o 9) == #t
(c-p-e 8) == #f           (d-p-o 8) == #f
(c-p-e 7) == #f           (d-p-o 7) == #t

Escaping functions

“Escape” means “outlive the function in which the lambda was evaluated.”

Typically returned
More rarely, stored in a global variable or a heap-allocated data structure

We have already seen an example:

An ``escaping’’ function

-> (define to-the-n-minus-k (n k)
      (lambda (x) (- (exp x n) k)))

Where are n and k stored???

Values that escape have to be allocated on the heap
- C programmers use malloc to explicitly manage such values.
- In a language with first-class, nested functions, storage of escaping values is part of the semantics of lambda.

Picture of activation record for to-the-n-minus-k with n and k being popped.

An example:

What’s the closure for conjunction?

(define combine (p? q?)
   (lambda (x) (if (p? x) (q? x) #f)))

Higher-order functions!

Preview: in math, what is the following equal to?

(f o g)(x) == ???

Another algebraic law, another function:

(f o g) (x) = f(g(x))
(f o g) = \x. (f (g (x)))

Functions create new functions

-> (define o (f g) (lambda (x) (f (g x))))
-> (define even? (n) (= 0 (mod n 2)))
-> (val odd? (o not even?))
-> (odd? 3)
#t
-> (odd? 4)
#f

Another example: (o not null?)

Currying

Currying converts a binary function f(x,y) to a function f' that takes x and returns a function f'' that takes y and returns the value f(x,y).

As we study higher-order functions in more detail, you will see why currying is useful.

Classic functional technique: Currying

-> (val positive? (lambda (y) (< 0 y)))
-> (positive? 3)
#t
-> (val <-c (lambda (x) (lambda (y) (< x y))))
-> (val positive? (<-c 0)) ; "partial application"
-> (positive? 0)
#f

Curried functions take their arguments “one-at-a-time.”

What’s the algebraic law for curry?


     ...   (curry f) ...    =  ... f ...

     Keep in mind: 
     All you can do with a function is apply it!


     (((curry f) x) y) = f (x, y)

Your turn!!

Exercises

-> (map     ((curry +) 3) '(1 2 3 4 5))
???
-> (exists? ((curry =) 3) '(1 2 3 4 5))
???
-> (filter  ((curry >) 3) '(1 2 3 4 5))
???                        ; tricky

Answers

-> (map     ((curry +) 3) '(1 2 3 4 5))
(4 5 6 7 8)
-> (exists? ((curry =) 3) '(1 2 3 4 5))
#t
-> (filter  ((curry >) 3) '(1 2 3 4 5)) 
(1 2)

Bonus content: Lambda as an abstraction barrier

Bonus content: vulnerable variables?

-> (val seed 1)
-> (val rand (lambda ()
      (set seed (mod (+ (* seed 9) 5) 1024)))))
-> (rand)
14
-> (rand)
131
-> (set seed 1)
1
-> (rand)
14

Q: What’s the problem with this approach?

A: The seed is exposed to the end user, who can break the abstraction of the rand function.

Bonus: Lambda as abstraction barrier!

-> (val mk-rand (lambda (seed)
     (lambda ()
       (set seed (mod (+ (* seed 9) 5) 1024))))))
-> (val rand (mk-rand 1))
-> (rand)
14
-> (rand)
131
-> (set seed 1)
error: set unbound variable seed
-> (rand)
160

27 September 2017: Higher-order functions

There are PDF slides for 9/28/2017.

Code

Announcements

Instructor Office Visits on Saturday (see Piazza)
Scheme I HW due tonight
Scheme II HW (HOFS) due 10/4

Last Time

Association lists [bonus]
Let construct
Anonymous functions

Exercises

-> (map     ((curry +) 3) '(1 2 3 4 5))
???
-> (exists? ((curry =) 3) '(1 2 3 4 5))
???
-> (filter  ((curry >) 3) '(1 2 3 4 5))
???                        ; tricky

Answers

-> (map     ((curry +) 3) '(1 2 3 4 5))
(4 5 6 7 8)
-> (exists? ((curry =) 3) '(1 2 3 4 5))
#t
-> (filter  ((curry >) 3) '(1 2 3 4 5)) 
(1 2)

Today

Reasoning about Functions
Useful Higher-Order Functions
Tail Calls

Reasoning about code

Reasoning principle for lists

Recursive function that consumes A has the same structure as a proof about A

Q: How to prove two lists are equal?
A: Prove they are both '() or that they are both cons cells cons-ing equal car’s to equal cdr’s

Reasoning principle for functions

Q: Can you do case analysis on a function?

A: No!

Q: So what can you do then?

A: Apply it!

Q: How to prove two functions equal?
A: Prove that when applied to equal arguments they produce equal results.

Higher-Order Functions

Goal: Start with functions on elements, end up with functions on lists

Generalizes to sets, arrays, search trees, hash tables, …

Goal: Capture common patterns of computation or algorithms

exists? (Ex: Is there a number?)
all? (Ex: Is everything a number?)
filter (Ex: Take only the numbers)
map (Ex: Add 1 to every element)
foldr (General: can do all of the above.)

Fold also called reduce, accum, or a “catamorphism”

List search: `exists?`

Algorithm encapsulated: linear search

Example: Is there a even element in the list?

Algebraic laws:

(exists? p? '())          == ???
(exixts? p? '(cons a as)) == ???


(exists? p? '())          == #f
(exixts? p? '(cons a as)) == p? x or exists? p? xs

Defining exists?

-> (define exists? (p? xs)
      (if (null? xs)
          #f
          (or (p? (car xs)) 
              (exists? p? (cdr xs)))))
-> (exists? even? '(1 3))
#f
-> (exists? even? '(1 2 3))
#t
-> (exists? ((curry =) 0) '(1 2 3))
#f
-> (exists? ((curry =) 0) '(0 1 2 3))
#t

Your turn: Does everything match: `all?`

Example: Is every element in a list even?

Algebraic laws:

(all? p? '())          == ???
(all? p? '(cons a as)) == ???


(all? p? '())          == #t
(all? p? '(cons a as)) == p? x and all? p? xs

Defining all?

-> (define all? (p? xs)
      (if (null? xs)
          #t
          (and (p? (car xs)) 
               (all? p? (cdr xs)))))

-> (all? even? '(1 3)) 
#f
-> (all? even? '(2)) 
#t
-> (all? ((curry =) 0) '(1 2 3))
#f
-> (all? ((curry =) 0) '(0 0 0))
#t

List selection: `filter`

Algorithm encapsulated: Linear filtering

Example: Given a list of numbers, return only the even ones.

Algebraic laws:

(filter p? '())          == ???
(filter p? '(cons m ms)) == ???

(filter p? '())          == '()
(filter p? '(cons m ms)) == if (p? m)
                               (cons m (filter p? ms)) 
                               (filter p? ms)

Defining filter

-> (define filter (p? xs)
     (if (null? xs)
       '()
       (if (p? (car xs))
         (cons (car xs) (filter p? (cdr xs)))
         (filter p? (cdr xs)))))
-> (filter (lambda (n) (>  n 0)) '(1 2 -3 -4 5 6))
(1 2 5 6)
-> (filter (lambda (n) (<= n 0)) '(1 2 -3 -4 5 6))
(-3 -4)
-> (filter ((curry <)  0) '(1 2 -3 -4 5 6))
(1 2 5 6)
-> (filter ((curry >=) 0) '(1 2 -3 -4 5 6))
(-3 -4)

Composition Revisited: List Filtering

-> (val positive? ((curry <) 0))

-> (filter positive?         '(1 2 -3 -4 5 6))
(1 2 5 6)
-> (filter (o not positive?) '(1 2 -3 -4 5 6))
(-3 -4)

“Lifting” functions to lists: `map`

Algorithm encapsulated: Transform every element

Example: Square every element of a list.

Algebraic laws:

(map f '())         ==  ???
(map f (cons n ns)) ==  ???

(map f '())         ==  '()
(map f (cons n ns)) ==  cons (f n) (map f ns)

Defining map

-> (define map (f xs)
     (if (null? xs)
       '()
       (cons (f (car xs)) (map f (cdr xs)))))
-> (map number? '(3 a b (5 6)))
(#t #f #f #f)
-> (map ((curry *) 100) '(5 6 7))
(500 600 700)
-> (val square* ((curry map) (lambda (n) (* n n))))

-> (square* '(1 2 3 4 5))
(1 4 9 16 25)

The universal list function: `fold`

foldr takes two arguments:

zero: What to do with the empty list.
plus: How to combine next element with running results.

Example: foldr plus zero '(a b)

cons a (cons b '())
 |       |      |
 v       v      v
plus a (plus b zero)

In-class exercise: Folding `combine?`

Tail calls

Intuition: In a function, a call is in tail position if it is the last thing the function will do.

A tail call is a call in tail position.

Important for optimizations: Can change complexity class.

What is tail position?

Tail position is defined inductively:

The body of a function is in tail position
When (if e1 e2 e3) is in tail position, so are e2 and e3
When (let (...) e) is in tail position, so is e, and similary for letrec and let*.
When (begin e1 ... en) is in tail position, so is en.

Idea: The last thing that happens

Anything in tail position is the last thing executed!

Key idea is tail-call optimization!

Example: reverse '(1 2)

Question: How much stack space is used by the call?

Call stack:

reverse '() 
append
reverse '(2)
append
reverse '(1 2)

Answer: Linear in the length of the list

Example: revapp '(1 2) '()

Question: How much stack space is used by the call?

Call stack: (each line replaces previous one)

revapp '(1 2) '() –>

revapp '(2) '(1) –>

revapp '() '(2 1)

Answer: Constant

Question: Why can’t we do this same optimization on reverse?

Answer: reverse has to do further computation with results of recursive calls, so can’t eliminate the stack frame until later.

Answer: a goto!!

Think of “tail call” as “goto with arguments”

2 October 2017: Continuations

There are PDF slides for 10/3/2017.

Last Time

Reasoning about functions
Higher-order functions
exists? all?
filter
map
fold
Tail calls

Announcements

Scheme II HW (HOFs) due Wednesday 10/4

Today

Continuations

Continuations

A continuation is code that represents “the rest of the computation.”

Not a normal function call because continuations never return
Think “goto with arguments”

Different coding styles

Direct style: Last action of a function is to return a value. (This style is what you are used to.)

Continuation-passing style (CPS): Last action of a function is to “throw” a value to a continuation.

Uses of continuations

Call-backs in GUI frameworks
A style of coding that can mimic exceptions
Some languages provide a construct for capturing the current continuation and giving it a name k. Control can be resumed at captured continuation by throwing to k.
Compiler representation: Compilers for functional languages often convert direct-style user code to CPS because CPS matches control-flow of assembly.

Implementation

First-class continuations require compiler support.
We’re going to simulate continuations with function calls in tail position.

Motivating Example: From existence to witness

Ideas?

Bad choices:

nil
special symbol 'fail
run-time error

Good choice:

exception (not in uScheme)

Your turn: Refine the laws

(witness-cps p? xs succ fail) = (succ x)
     ; where x is in xs and (p? x)
(witness-cps p? xs succ fail) = (fail)
     ; where (not (exists? p? xs))

(witness-cps p? '() succ fail) = ?

(witness-cps p? (cons z zs) succ fail) = ?
    ; when (p? z)

(witness-cps p? (cons z zs) succ fail) = ?
    ; when (not (p? z))

Refine the laws

(witness-cps p? xs succ fail) = (succ x)
     ; where x is in xs and (p? x)
(witness-cps p? xs succ fail) = (fail)
     ; where (not (exists? p? xs))

(witness-cps p? '() succ fail) = (fail)

(witness-cps p? (cons z zs) succ fail) = (succ z)
    ; when (p? z)

(witness-cps p? (cons z zs) succ fail) = 
     (witness-cps p? zs succ fail)  
    ; when (not (p? z))

Coding with continuations

(define witness-cps (p? xs succ fail)
   (if (null? xs)
       (fail)
       (let ((x (car xs)))
         (if (p? x)
             (succ x)
             (witness-cps p? (cdr xs) succ fail)))))

Question: How much stack space is used by the call?

Answer: Constant

Example Use: Instructor Lookup

-> (val 2017f '((Fisher 105)(Cowen 170)(Chow 116)))
-> (instructor-info 'Fisher 2017f)
(Fisher teaches 105)
-> (instructor-info 'Chow 2017f)
(Chow teaches 116)
-> (instructor-info 'Souvaine 2017f)
(Souvaine is-not-on-the-list)

Extended Example: A SAT Solver

Exercise: Find a satisfying assignment if one exists

(val f1 '(and x y z w p q (not x)))

(val f2 '(not (or x y)))

(val f3 '(not (and x y z)))

(val f4 '(and (or x y z) 
              (or (not x) (not y) (not z))))

Satisfying assignments

(val f1 '(and x y z w p q (not x))) ; NONE

(val f2 '(not (or x y))) 
                  ; { x |-> #f, y |-> #f }

(val f3 '(not (and x y z))) 
                  ; { x |-> #f, ... }
(val f4 '(and (or x y z) 
              (or (not x) (not y) (not z))))
              ; { x |-> #f, y |-> #t, ... }

Continuations for Search

Solving a Literal

start carries a partial truth assignment to variables current

Box describes how to extend current to make a variable, say x, true.

Case 1: current(x) = #t

Call success continuation with current

Pass fail as resume continuation (argument to success)

Case 2: current(x) = #f

Call fail continuation

Case 3: x not in current

Call success cotinuation with current{x -> #t}

Pass fail as resume continuation

Solving a Negated Literal (Your turn)

start carries a partial truth assignment to variables current

Box describes how to extend current to make a negated variable, say not x, true.

Case 1: current(x) = #f

Call success continuation with current

Pass fail as resume continuation (argument to success)

Case 2: current(x) = #t

Call fail continuation

Case 3: x not in current

Call success cotinuation with current{x -> #f}

Pass fail as resume continuation

These diagrams (and the corresponding code) compose!

Solving A and B

Picture of A and B

Solver enters A
If A is solved, newly allocated success continuation starts B
If B succeeds, we’re done! Use success continuation from context.
If B fails, use resume continuation A passed to B as fail.
If A fails, the whole thing fails. Use fail continuation from context.

Solving A or B

Picture of A or B

Solver enters A
If A is solved, we’re good! But what if context doesn’t like solution? It can resume A using the resume continuation passed out as fail.
If A can’t be solved, don’t give up! Try a newly allocated failure continuation to start B.
If ever B is started, we’ve given up on A entirely. So B’s success and failure continuations are exactly the ones in the context.
If B succeeds, but the context doesn’t like the answer, the context can resume B.
If B fails, abject failure all around; call the original fail continuation.

4 October 2017: Scheme Semantics

There are PDF slides for 10/5/2017.

Announcements

If you can, bring your laptop to this week’s recitation.
Scheme HW II (hofs) due tonight.
Scheme HW III (continuations and semantics) due on Monday 10/16.

Today

Scheme Semantics

Stores
Lambdas evaluate to closures
Application

Last Time

Continuations
“gotos with arguments”
Example: Handling missing values (association list)
Example: Structuring a search (SAT solver)

New Syntax, Values, Environments, and Evaluation Rules

First four of five questions: Syntax, Values, Environments, Evaluation

Key changes from Impcore:

New constructs: let, lambda, application (not just named functions)

New values: cons cells and functions (closures)
A single kind of environment
- Environment maps names to mutable locations, not values.
- A store maps locations to values.
- Environments get copied (in closures).

It’s not precisely true that rho never changes.
New variables are added when they come into scope.
Old variables are deleted when they go out of scope.
But the location associated with a variable never changes.

The book includes all rules for uScheme. Here we will discuss on key rules.

Variables

Board: Picture of environment pointing to store.

Questions about Assign:

What changes are captured in σ′?
What changes are captured in σ′{ℓ↦v}?
What would happen if we used σ instead of σ′
What would happen if we used a fresh ℓ?
Some other ℓ in the range of ρ?

Lambdas

Function Application

Example


  (val even (lambda (x) (= 0 (mod x 2)))) 

  (val f    (lambda (y) (if (even y) 5 15)))

  (val even 3)

  (f 10)

Question: Which even is referenced when f is called?
Answer: With static scoping, it’s the predicate. With dynamic scoping it’s the one bound to 3.

Questions about ApplyClosure:

What if we used σ instead of σ₀ in evaluation of e₁?
What if we used σ instead of σ₀ in evaluation of arguments?
What if we used ρ_c instead of ρ in evaluation of arguments?
What if we did not require ℓ₁, …, ℓ_n ∉ dom(σ)?
What is the relationship between ρ and σ?

Picture of environment and store that results from executing above program.

Closure Optimizations

Major issue in making functional programs efficient
- Keep closures on the stack
- Share closures
- Eliminate closures (when functions don’t escape)

11 October 2017: Scheme Wrap-up; ML Intro

There are PDF slides for 10/12/2017.

Handout: Which let is which?

Announcements

Today

Scheme Wrap-up
Intro to ML

Last Time

Scheme semantics
Single kind of environment that maps names to locations.
A store maps locations to values.
Functions & Closures
Application

Lets

Which let is which and why?

Three versions of let:

let puts the new bindings in scope only for the body expression.
let* adds each binding one at a time, so each binding is in scope for the later ones.
letrec considers all the bindings to be mutually recursive.

Handout: Which let is which?

Lisp and Scheme Retrospective

Common Lisp, Scheme

Advantages:

High-level data structures
Cheap, easy recursion
Automatic memory management (garbage collection!)
Programs as data!
Hygenic macros for extending the language
Big environments, tiny interpreters, everything between
Sophisticated Interactive Development Environments
Used in AI applications; ITA; Paul Graham’s company Viaweb

Down sides:

Hard to talk about data
Hard to detect errors at compile time

Bottom line: it’s all about lambda

Major win
Real implementation cost (heap allocation)

Bonus content: Scheme as it really is

Macros!
Cond expressions (solve nesting problem)
Mutation
…

Macros!

Real Scheme: Macros

A Scheme program is just another S-expression

Function define-syntax manipulates syntax at compile time
Macros are hygienic—name clashes impossible
let, and, many others implemented as macros

Conditional expressions

Real Scheme: Conditionals

(cond (c1 e1)    ; if c1 then e1
      (c2 e2)    ; else if c2 then e2
       ...            ...
      (cn en))   ; else if cn then en

; Syntactic sugar---'if' is a macro:
(if e1 e2 e3) == (cond (e1 e2)
                       (#t e3))

Mutation

Real Scheme: Mutation

Not only variables can be mutated.

Mutate heap-allocated cons cell:

(set-car! '(a b c) 'd)  => (d b c)

Circular lists, sharing, avoids allocation

still for specialists only

Introduction to ML

Apply your new knowledge in Standard ML:

You’ve already learned (most of) the ideas
There will be a lot of new detail
Good language for implementing language features
Good language for studying type systems

Lectures on ML:

Algebraic types and pattern matching
Exceptions
An introduction to types

Meta: Not your typical introduction to a new language

Not definition before use, as in a manual
Not tutorial, as in Ullman
Instead, the most important ideas that are most connected to your work up to now

ML Overview

Designed for programs, logic, symbolic data

Theme: Precise ways to describe data

ML = uScheme + pattern matching + exceptions + static types

uScheme -> ML Rosetta Stone

uScheme                    SML


 (cons x xs)             x :: xs

 '()                     []
 '()                     nil

 (lambda (x) e)          fn x => e

 (lambda (x y z) e)      fn (x, y, z) => e

 ||  &&                  andalso    orelse


 (let* ([x e1]) e2)      let val x = e1 in e2 end

 (let* ([x1 e1]          let val x1 = e1
        [x2 e2]              val x2 = e2
        [x3 e3]) e)          val x3 = e3
                         in  e
                         end

Three new ideas

Pattern matching is big and important. You will like it.
Exceptions are easy
Static types get two to three weeks in their own right.

Pattern matching makes code look more like algebraic laws: one pattern for each case.

Static types tell us at compile time what the cases are.

And lots of new concrete syntax!

Examples

The length function.

Algebraic laws:

length []      = 0
length (x::xs) = 1 + length xs

The code:

fun length [] = 0
|   length (x::xs) = 1 + length xs

Things to notice:
- No brackets! (I hate the damn parentheses)
- Function application by juxtaposition
- Function application has higher precedence than any infix operator
- Compiler checks all the cases (try in the interpreter)
Let’s try another! map, filter, exists, all, take, drop, takewhile, dropwhile

Length

    fun length [] = 0
      | length (x::xs) = 1 + length xs

    val res = length [1,2,3]

Map

fun map f [] = []
  | map f (x::xs) = (f x) :: (map f xs)

val res1 = 
  map length [[], [1], [1,2], [1,2,3]]

Map, without redundant parentheses

fun map f []      = []
  | map f (x::xs) =  f x  ::  map f xs

val res1 =
  map length [[], [1], [1,2], [1,2,3]]

Filter

fun filter pred [] = [] 
  | filter pred (x::xs) =   (* pred? not legal *)
      let val rest = filter pred xs 
      in if pred x then
           (x::rest) 
         else rest
      end

val res2 = 
  filter (fn x => (x mod 2) = 0) [1,2,3,4]

(* Note fn x => e is syntax for lambda in SML *)

Filter, without redundant parentheses

fun filter pred []      = []
  | filter pred (x::xs) =  (* no 'pred?' *)
      let val rest = filter pred xs
      in  if pred x then
             x :: rest
          else
            rest
      end

val res2 =
  filter (fn x => (x mod 2) = 0) [1,2,3,4]

Exists

fun exists pred [] = false
  | exists pred (x::xs) = 
      (pred x) orelse (exists pred xs)

val res3 = 
  exists (fn x => (x mod 2) = 1) [1,2,3,4]

Exists, without redundant parentheses

fun exists pred []      = false
  | exists pred (x::xs) =
       pred x  orelse  exists pred xs

val res3 =
  exists (fn x => (x mod 2) = 1) [1,2,3,4]

All

fun all pred [] = true
  | all pred (x::xs) =
      (pred x) andalso (all pred xs)

val res4 = all (fn x => (x >= 0)) [1,2,3,4]

All, without redundant parentheses

fun all pred []      = true
  | all pred (x::xs) =
      pred x andalso all pred xs

val res4 = all (fn x => (x >= 0)) [1,2,3,4]

Take

exception ListTooShort
fun take 0     l   = []
  | take n    []   = raise ListTooShort
  | take n (x::xs) = x::(take (n-1) xs)

val res5 = take 2 [1,2,3,4]
val res6 = take 3 [1] 
           handle ListTooShort => 
             (print "List too short!"; [])

(* Note use of exceptions. *)

Take, without redundant parentheses

exception TooShort
fun take 0 _       = []  (* wildcard! *)
  | take n []      = raise TooShort
  | take n (x::xs) = x ::  take (n-1) xs

val res5 = take 2 [1,2,3,4]
val res6 = take 3 [1]
           handle TooShort =>
             (print "List too short!"; [])

(* Note use of exceptions. *)

Drop

fun drop 0     l   = l
  | drop n    []   = raise ListTooShort
  | drop n (x::xs) = (drop (n-1) xs)

val res7 = drop 2 [1,2,3,4]
val res8 = drop 3 [1] 
           handle ListTooShort => 
              (print "List too short!"; [])

Takewhile

fun takewhile p [] = []
  | takewhile p (x::xs) = 
      if p x then (x::(takewhile p xs)) 
             else []

fun even x = (x mod 2 = 0)
val res8 = takewhile even [2,4,5,7]
val res9 = takewhile even [3,4,6,8]

Takewhile, without redundant parentheses

fun takewhile p [] = []
  | takewhile p (x::xs) =
      if p x then  x ::  takewhile p xs
      else []

fun even x = (x mod 2 = 0)
val res8 = takewhile even [2,4,5,7]
val res9 = takewhile even [3,4,6,8]

Drop while

fun dropwhile p [] = []
  | dropwhile p (zs as (x::xs)) = 
      if p x then (dropwhile p xs) else zs
val res10 = dropwhile even [2,4,5,7]
val res11 = dropwhile even [3,4,6,8]

(* fancy pattern form: zs as (x::xs) *

Dropwhile, without redundant parentheses

fun dropwhile p []              = []
  | dropwhile p (zs as (x::xs)) =
      if p x then  dropwhile p xs  else zs
val res10 = dropwhile even [2,4,5,7]
val res11 = dropwhile even [3,4,6,8]

(* fancy pattern form: zs as (x::xs) *)

Folds

fun foldr p zero [] = zero
  | foldr p zero (x::xs) = p (x, (foldr p zero xs))
 
fun foldl p zero [] = zero
  | foldl p zero (x::xs) = foldl p (p (x, zero)) xs


val res12 = foldr (op +)  0 [1,2,3,4] 
val res13 = foldl (op * ) 1 [1,2,3,4] 

(* Note 'op' to use an infix operator as a value. *)

Folds, without redundant parentheses

fun foldr p zero []      = zero
  | foldr p zero (x::xs) = p (x,  foldr p zero xs )

fun foldl p zero []      = zero
  | foldl p zero (x::xs) = foldl p (p (x, zero)) xs


val res12 = foldr (op +)  0 [1,2,3,4]
val res13 = foldl (op * ) 1 [1,2,3,4]

(* Note 'op' to use infix operator as a value *)

ML—The Five Questions

Syntax: definitions, expressions, patterns, types

Values: num/string/bool, record/tuple, algebraic data

Environments: names stand for values (and types)

Evaluation: uScheme + case and pattern matching

Initial Basis: medium size; emphasizes lists

(Question Six: type system—a coming attraction)

16 Oct 2017: Programming with constructed data and types

There are PDF slides for 10/17/2017.

Announcements

Continuations HW due tonight
ML homework is now available. Due 10/25.

Today

Datatypes
Types, Patterns, Exceptions
ML Traps & Pitfalls

Last Time

Semantics of let
Scheme wrap up
Introduction to ML: functions and patterns

A note about books

Ullman is easy to digest

Ullman costs money but saves time

Ullman is clueless about good style

Suggestion:

Learn the syntax from Ullman
Learn style from Ramsey, Harper, and Tofte

Details in course guide Learning Standard ML

Foundation: Data

Syntax is always the presenting complaint, but data is what’s always important

Base types: int, real, bool, char, string
Functions
Constructed data:
- Tuples: pairs, triples, etc
- (Records with named fields)
- Lists and other algebraic data types

“Distinguish one cons cell (or one record) from another”

Algebraic Datatypes

Enumerated types

Datatypes can define an enumerated type and associated values.

datatype suit = heart | diamond | spade | club

Here suit is the name of a new type.

The data constructors heart, dimaond, spade, and club are the values of type suit.

Data constructors are separated by vertical bars.

Pattern matching

Datatypes are deconstructed using pattern matching.

fun toString heart = "heart"
  | toString diamond = "diamond"
  | toString spade = "spade"
  | toString club = "club"

val suitName = toString heart

But wait, there’s more: Data constructors can take arguments!

datatype IntTree = Leaf | Node of int * IntTree * IntTree

IntTree is the name of a new type.

There are two data constructors: Leaf and Node.

Nodes take a tuple of three arguments: a value at the node, and left and right subtrees.

The keyword of separates the name of the data constructor and the type of its argument.

When fully applied, data constructors have the type of the defining datatype (ie, IntTree).

Building values with constructors

We build values of type IntTree using the associated constructors: (Draw on board)

 val tempty = Leaf
 val t1 = Node (1, tempty, tempty)
 val t2 = Node (2, t1, t1)
 val t3 = Node (3, t2, t2)

What is the in-order traversal of t3?

 [1,2,1,3,1,2,1]

What is the pre-order traversal of t3?

 [3,2,1,1,2,1,1]

Deconstruct values with pattern matching

(The @ symbol denotes append in ML)

fun inOrder Leaf = []
  | inOrder (Node (v, left, right)) = 
       (inOrder left) @ [v] @ (inOrder right)

val il3 = inOrder t3

fun preOrder Leaf = []
  | preOrder (Node (v, left, right)) = 
       v :: (preOrder left) @ (preOrder right)

val pl3 = preOrder t3

IntTree is monomorphic because it has a single type.

Note though that the inOrder and preOrder functions only cared about the structure of the tree, not the payload value at each node.

But wait, there’s still more: Polymorphic datatypes!

Polymorphic datatypes are written using type variables that can be instantiated with any type.

datatype 'a tree = Child | Parent of 'a * 'a tree * 'a tree

tree is a type constructor (written in post-fix notation), which means it produces a type when applied to a type argument.

Examples:

int tree is a tree of integers
bool tree is a tree of booleans
char tree is a tree of characters
int list tree is a tree of a list of integers.

'a is a type variable: it can represent any type.

It is introduced on the left-hand of the = sign. References on the right-hand side are types.

Child and Parent are data constructors.

Child takes no arguments, and so has type 'a tree

When given a value of type 'a and two 'a trees, Parent produces a 'a tree

Constructors build `tree` values

val empty = Child
val tint1 = Parent (1, empty, empty)
val tint2 = Parent (2, tint1, tint1)
val tint3 = Parent (3, tint2, tint2)

val tstr1 = Parent ("a", empty, empty)
val tstr2 = Parent ("b", tstr1, tstr1)
val tstr3 = Parent ("c", tstr2, tstr2)

Pattern matching deconstructs `tree` values

fun inOrder Child = []
  | inOrder (Parent (v, left, right)) = 
       (inOrder left) @ [v] @ (inOrder right)

fun preOrder Child = []
  | preOrder (Parent (v, left, right)) = 
       v :: (preOrder left) @ (preOrder right)

Functions inOrder and preOrder are polymorphic: they work on any value of type 'a tree. 'a is a type variable and can be replaced with any type.

Things to notice about datatypes

Environments

Datatype declarations introduce names into:

the type environment: suit, IntTree, tree
the value environment: heart, Leaf, Parent

Inductive

Datatype declarations are inherently inductive:

the type IntTree appears in its own definition
the type tree appears in its own definition

Datatype Exercise

Exercise answers

datatype sx1 = ATOM1 of atom
             | LIST1 of sx1 list

datatype sx2 = ATOM2 of atom
             | PAIR2 of sx2 * sx2

Case expressions: How we use datatypes

Eliminate values of algebraic types

New language construct case (an expression)

fun length xs =
  case xs
    of []      => 0
     | (x::xs) => 1 + length xs

At top level, fun better than case

When possible, write

fun length []      = 0
  | length (x::xs) = 1 + length xs

case works for any datatype

 fun toStr t = 
     case t 
       of Leaf => "Leaf"
        | Node(v,left,right) => "Node"

But often pattern matching is better style:

 fun toStr' Leaf = "Leaf"
   | toStr' (Node (v,left,right)) = "Node"

Bonus: Talking type theory: Introduction and elimination constructs

Part of learning any new field: talk to people in their native vocabulary

Introduce means “produce”, “create”, “make”, “define”
Eliminate means “consume”, “examine”, “observe”, “use”

It’s like knowing what to say when somebody sneezes.

Tuple Pattern Matching

val (x,y) = (1,2)

val (left, pivot, right) = split xs

val (n,xs) = (3, [1,2,3])

val (x::xs) = [1,2,3]

val (_::xs) = [1,2,3]

Exceptions: Handling unusual circumstances

Syntax:

Declaration: exception EmptyQueue
Introduction: raise e where e : exn
Elimination: e1 handle pat => e2

Informal Semantics:

alternative to normal termination
can happen to any expression
tied to function call
- if evaluation of body raises exn, call raises exn
Handler uses pattern matching

e handle pat1 => e1 | pat2 => e2

Exception handling in action


    loop (evaldef (reader (), rho, echo))
    handle EOF            => finish ()
      | Div               => continue "Division by zero"
      | Overflow          => continue "Arith overflow"
      | RuntimeError msg  => continue ("error: " ^ msg)
      | IO.Io {name, ...} => continue ("I/O error: " ^
                                       name)
      | SyntaxError msg   => continue ("error: " ^ msg)
      | NotFound n        => continue (n ^ "not found")

Bonus Content: ML traps and pitfalls

Order of clauses matters


fun take n (x::xs) = x :: take (n-1) xs
  | take 0 xs      = []
  | take n []      = []

(* what goes wrong? *)

Gotcha — overloading

- fun plus x y = x + y;
> val plus = fn : int -> int -> int
- fun plus x y = x + y : real;
> val plus = fn : real -> real -> real

Gotcha — parentheses

Put parentheses around anything with |

case, handle, fn

Function application has higher precedence than any infix operator

Bonus content (seen in examples)

Syntactic sugar for lists

- 1 :: 2 :: 3 :: 4 :: nil; (* :: associates to the right *)
> val it = [1, 2, 3, 4] : int list

- "the" :: "ML" :: "follies" :: [];
> val it = ["the", "ML", "follies"] : string list

> concat it;
val it = "theMLfollies" : string

Bonus content: ML from 10,000 feet

Environments

The value environment

Names bound to immutable values

Immutable ref and array values point to mutable locations

ML has no binding-changing assignment

Definitions add new bindings (hide old ones):

val pattern = exp
val rec pattern = exp
fun ident patterns = exp
datatype … = …

Nesting environments

At top level, definitions

Definitions contain expressions:

def ::= val pattern = exp

Expressions contain definitions:

exp ::= let defs in exp end

Sequence of defs has let-star semantics

Patterns

What is a pattern?

pattern ::= variable
          | wildcard
          | value-constructor [pattern]
          | tuple-pattern
          | record-pattern
          | integer-literal
          | list-pattern

Design bug: no lexical distinction between

VALUE CONSTRUCTORS
variables

Workaround: programming convention

Functions

Function pecularities: 1 argument

Each function takes 1 argument, returns 1 result

For “multiple arguments,” use tuples!

 fun factorial n =
   let fun f (i, prod) = 
         if i > n then prod else f (i+1, i*prod)
   in  f (1, 1)
   end


 fun factorial n =  (* you can also Curry *)
   let fun f i prod = 
         if i > n then prod else f (i+1) (i*prod)
   in  f 1 1
   end

Tuples are “usual and customary.”

Types

18 October 2017: Types

There are PDF slides for 10/19/2017.

Announcements

Handout (slides from today)

Today

Type systems
Typing rules for a simple language
Type checker for a simple language
Adding environments

Type systems

What kind of value do we have?

Slogan: “Types classify terms.”

 n + 1  : int

 "hello" ^ "world"  : string

 (fn n => n * (n - 1))  : int -> int

 if p then 1 else 0  : int,  provided that p : bool

Questions type systems can answer:

What kind of value does it evaluate to (if it terminates)?
What is the contract of the function?
Is each function called with the right number of arguments? (And similar errors)
Who has the rights to look at it/change it?
Is the number miles or millimeters?

Questions type systems generally cannot answer:

Will my program contain a division by zero?
Will my program contain an array bounds error?
Will my program take the car of `’()?
Will my program terminate?

Decidability and Type Checking

Suppose L is a “Turing-Complete” Language.

TP is the set of programs in L that terminate.

Wish: a type system to statically classify terminating programs:

Expression e in L has type T (e : T) iff e terminates.

But: Undecideable!

We can prove no such type system exists.

Choices:

Allow type checker to run forever.
Don’t use type system to track termination.

Static vs. Dynamic Type Checking

Most languages use a combination of static and dynamic checks

Static: “for all inputs”

input independent
efficient at run-time
approximate : rules out some programs that won’t trigger errors example: (if false then 2 else "hi") ^ "there"

Dynamic: “for some inputs”

depends on input
run-time overhead
precise

Type System and Checker for a Simple Language

Define an AST for expressions with:

Simple integer arithmetic operations
Numeric comparisons
Conditional
Numeric literal

Language of expressions

    datatype exp = ARITH of arithop * exp * exp
                 | CMP   of relop   * exp * exp
                 | LIT   of int
                 | IF    of exp     * exp * exp
    and      arithop = PLUS | MINUS | TIMES | ...
    and      relop   = EQ | NE | LT | LE | GT | GE

    datatype ty = INTTY | BOOLTY

Examples to rule out

Can’t add an integer and a boolean:

3 + (3 < 99)

(ARITH(PLUS, LIT 3, CMP (LT, LIT 3, LIT 99)))

Can’t compare an integer and a boolean

(3 < (4 = 24))

CMP (LT, LIT 3, CMP(EQ (LIT 4, LIT 24)))

Inference rules to define a type system

Form of judgment Context |- term : type

Written |- e : tau

Contexts vary between type systems

(Right now, the empty context)
Inference rules determine how to write type checker typeof : exp -> ty:

Given e, find tau such that |- e : tau
What inference rules do you recommend for this language?

Rule for arithmetic operators

Informal example:

|- 3 : int    |- 5 : int
------------------------------------------------------------
|- 3 + 5 : int

Rules out:

|- 'a' : char    |- 5 : int
------------------------------------------------------------
|- 'a' + 5 : ???

General form:

|- e1 : int    |- e2 : int
------------------------------------------------------------
|- ARITH ( _ , e1, e2) : int

Rule for comparisons

Informal example:

|- 7 : int    |- 10 : int
------------------------------------------------------------
|- 7 < 10 : bool

General form:

|- e1 : int    |- e2 : int
------------------------------------------------------------
|- CMP ( _ , e1, e2) : bool

Rule for literals

Informal example:

|- 14 : int

General form:

-----------------------------------
|- LIT (n) : int

Rule for conditionals:

Informal example:

|- true : bool    
|- 3    : int
|- 42   : int      
------------------------------------------------------------
|- IF (true, 3, 42) : int

General form:

|- e : bool    
|- e1 : tau1   
|- e2 : tau2      tau1 equiv tau2
------------------------------------------------------------
|- IF ( e, e1, e2) : tau1

Experience shows it is better to test two types for equivalence than to write rule with same type appearing twice.

Typing rules let us read off what a type checker needs to do.

input to checker: e
output from checker: tau

What is a type?

Working definition: a set of values
Precise definition: classifier for terms!!
- The relationship to values becomes a proof obligation.
- Note: a computation can have a type even if it doesn’t terminate! (Or doesn’t produce a value)

Type checker in ML

val typeof : exp -> ty
exception IllTyped
fun typeof (ARITH (_, e1, e2)) = 
      case (typeof e1, typeof e2) 
        of (INTTY, INTTY) => INTTY
         | _              => raise IllTyped
  | typeof (CMP (_, e1, e2)) = 
      case (typeof e1, typeof e2) 
        of (INTTY, INTTY) => BOOLTY
         | _              => raise IllTyped
  | typeof (LIT _) = INTTY
  | typeof (IF (e,e1,e2)) = 
      case (typeof e, typeof e1, typeof e2) 
        of (BOOLTY, tau1, tau2) => 
           if eqType(tau1, tau2) 
           then tau1 else raise IllTyped
         | _                    => raise IllTyped

An implementor’s trick: If you see identical types in a rule,

Give each type a distinct subscript
Introduce equality constraints
Remember to be careful using primitive equality to check types—you are better off with eqType.

Typing Rules: Contexts and Term Variables

Add variables and let binding to our language, what happens?

Extended language of expressions

    datatype exp = ARITH of arithop * exp * exp
                 | CMP   of relop   * exp * exp
                 | LIT   of int
                 | IF    of exp     * exp * exp
                 | VAR   of name
                 | LET   of name    * exp * exp
    and      arithop = PLUS | MINUS | TIMES | ...
    and      relop   = EQ | NE | LT | LE | GT | GE

    datatype ty = INTTY | BOOLTY

What could go wrong with a variable?

Used inconsistently:

;; x can’t be both an integer and a list

x + x @ x

;; y can’t be both an integer and a string

let y = 10 in y ^ “hello” end
Need to track variable use to ensure consistency

Key idea: Type environment (Gamma) tracks the types of variables.

Rule for var

x in domain Gamma        tau = Gamma(x) 
------------------------------------------------------------
Gamma |- VAR x : tau

Rule for let

Gamma         |- e  : tau
Gamma{x->tau} |- e' : tau'   
------------------------------------------------------------
Gamma |- LET x = e in e' : tau'

Type Checker

Type checker needs Gamma – gives type of each “term variable”.

val typeof : ty env -> exp -> ty
fun typeof Gamma (ARITH ... ) =  <as before>
  | typeof Gamma (VAR x)      =
      case Gamma (x) 
        of Some tau => tau
         | None     => raise IllTyped
  | typeof Gamma (LET x, e1, e2) = 
           let tau1 = typeof Gamma e1
           in  typeof (extend Gamma x tau1) e2
           end

Review

I gave you syntax for simple language
You came up with typing rules
I showed you how to implement the type checker.
Then on your homework,
- You will design new syntax and typing rules for lists
- You will extend an existing type checker
- You will implement a full type checker from scratch

This is a big chunk of what language designers do.

23 October 2017: Type Checking with Type Constructors

There are PDF slides for 10/24/2017.

Announcements

Midterm: Wednesday Nov. 1 in class; 1-page self-prepared sheet of notes

Last Time

What are types?
Undecideability
Static vs. Dynamic types
Typing Rules
Type Checker
Type environment

Today

Type checking with type constructors
Formation, Introduction, and Elimination Rules

Functions

Introduction:

Gamma{x->tau1} |- e : tau2   
------------------------------------------------------------
Gamma |- fn x : tau1 => e  : tau1 -> tau2

Elimination:

Gamma |- e  : tau1 -> tau2   
Gamma |- e1 : tau1
------------------------------------------------------------
Gamma |- e e1 : tau2

Where we’ve been and where we’re going

New watershed in the homework

You’ve been developing and polishing programming skills: recursion, higher-order functions, using types to your advantage. But the problems have been mostly simple problems around simple data structures, mostly lists.
We’re now going to shift and spend the next several weeks doing real programming-languages stuff, starting with type systems.
You’ve already seen everything you need to know to implement a basic type checker, and you are almost fully equipped to add array operations and types to Typed Impcore.

What’s next is much more sophisticated type systems, with an infinite number of types. We’ll focus on two questions about type systems:

What is and is not a good type, that is, a classifier for terms?
How shall we represent types?

We’ll look at these questions in two contexts: monomorphic and polymorphic languages.

Monomorphic vs Polymorphic Types

Monomorphic types have “one shape.”

Examples: int, bool, int -> bool, int * int

Polymorphic types have “many shapes.”

Examples: 'a list, 'a list -> 'a list, ('a * int)

Design and implementation of monomorphic languages

Mechanisms:

Every new variety of type requires special syntax
Implementation is a straightforward application of what you already know.

Language designer’s process when adding new kinds of types:

What new types do I have (formation rules)?
What new syntax do I have to create new values with that type (introduction rules)?

For introduce think “produce”, “create”, “make”, “define”
What new syntax do I have to observe terms of a type (elimination rules)?

For eliminate think “consume”, “examine”, “interrogate”, “look inside”, or “take apart”, “observe”, “use”, “mutate”

Words “introduce” and “eliminate” are the native vocabulary of type-theoretic language design—it’s like knowing what to say when somebody sneezes.

Question: If I add lists to a language, how many new types am I introducing?

Managing the set of types: Type Formation

Examples: Well-formed types

These are types:

int
bool
int * bool
int * int -> int

Examples: Not yet types, or not types at all

These “types in waiting” don’t classify any terms

list (but int list is a type)
array (but char array is a type)
ref (but (int -> int) ref is a type)

These are utter nonsense

int int
bool * array

Type formation rules

We need a way to classify type expressions into:

types that classify terms
type constructors that build types
nonsense terms that don’t mean anything

Type constructors

Technical name for “types in waiting”

Given zero or more arguments, produce a type:

Nullary int, bool, char also called base types
Unary list, array, ref
Binary (infix) ->

More complex type constructors:

records/structs
function in C, uScheme, Impcore

Type judgments for monomorphic system

Two judgments:

The old typing judgment Γ ⊢ e : τ
Today’s judgment “τ is a type”

Monomorphic type rules

Notice: one rule for if!!

Classic types for data structures

(At run time, identical to cons, car, cdr)

Typical syntactic support for types

Explicit types on lambda and define:

For lambda, argument types:

(lambda ([n : int] [m : int]) (+ (* n n) (* m m)))

For define, argument and result types:

(define int max ([x : int] [y : int]) 
                (if (< x y) y x))

Abstract syntax:

datatype exp = ...
 | LAMBDA of (name * tyex) list * exp
    ...
datatype def = ...
 | DEFINE of name * tyex * ((name * tyex) list * exp)
    ...

Typing Rule Exercise

Coding the arrow-introduction rule

Type-checking LAMBDA

datatype exp = LAMBDA of (name * tyex) list * exp 
   ...
fun ty (Gamma, LAMBDA (formals, body)) = 
  let val Gamma' = (* body gets new env *)
        foldl (fn ((x, ty), g) => bind (x, ty, g))
              Gamma formals
      val bodytype = ty(Gamma', body)
      val formaltypes = 
        map (fn (x, ty) => ty) formals
  in  funtype (formaltypes, bodytype)
  end

25 October 2017: Polymorphic Type Checking

There are PDF slides for 10/26/2017.

Announcements

Continuation homework returned
ML HW due tonight.
Midterm: A week from today

Last Time

Monomorphic type systems (Typed Impcore)
Typing rules
- Formation: Is this type valid for classifying terms?
- Introduction: How to I create a value of this type?
- Elimination: How do I use a value of this type?
Examples:
- Pairs, functions, arrays, and references

Today

Polymorphic type systems (TypedUScheme)
Generic type representations
Kinds for classifying types

Limitations of monomorphic type systems

Monomorphic types are limiting

Each new type constructor requires

Special syntax
New type rules
New internal representation (type formation)
New code in type checker (intro, elim)
New or revised proof of soundness

Notes:

Implementing arrays on homework
Writing rules for lists on homework

Monomorphism hurts programmers too

Monomorphism leads to code duplication

User-defined functions are monomorphic:

(define int lengthI ([xs : (list int)])
   (if (null? xs) 0 (+ 1 (lengthI (cdr xs)))))
(define int lengthB ([xs : (list bool)])
   (if (null? xs) 0 (+ 1 (lengthB (cdr xs)))))
(define int lengthS ([xs : (list sym)])
   (if (null? xs) 0 (+ 1 (lengthS (cdr xs)))))

Quantified types

Type formation via kinds

``’’???

Back up here—what types do we have?

Type formation: Composing types

Typed Impcore:

Closed world (no new types)
Simple formation rules

Standard ML:

Open world (programmers create new types)
How are types formed (from other types)?

Can’t add new syntactic forms and new type formation rules for every new type.

Well-formed types

We still need to classify type expressions into:

types that classify terms (e.g., int)
type constructors that build types (e.g., list)
nonsense that means nothing (e.g., int int)

Idea: kinds classify types

one-off type-formation rules

Δ tracks type constructors, vars

Polymorphic Type Checking

Quantified types

Representing quantified types

Two new alternatives for tyex:

datatype tyex
  = TYCON  of name
  | CONAPP of tyex * tyex list
  | FUNTY  of tyex list * tyex
  | TYVAR  of name
  | FORALL of name list * tyex

Programming with quantified types

Substitute for quantified variables

-> length                                                                          
<procedure> : (forall ('a) ((list 'a) -> int))                                     
-> (@ length int)                                                                  
<procedure> : ((list int) -> int)                                                  
-> (length '(1 2 3))
type error: function is polymorphic; instantiate before applying
-> ((@ length int) '(1 2 3))
3 : int

Substitute what you like

-> length
 : (forall ('a) ((list 'a) -> int))
-> (@ length bool)
 : ((list bool) -> int)
-> ((@ length bool) '(#t #f))
2 : int

More ``Instantiations’’

-> (val length-int (@ length int))                                                 
length-int : ((list int) -> int)                                                   
-> (val cons-bool (@ cons bool))
cons-bool : ((bool (list bool)) ->
                                (list bool))
-> (val cdr-sym (@ cdr sym))
cdr-sym : ((list sym) -> (list sym))
-> (val empty-int (@ '() int))
() : (list int)

Bonus instantiation:

-> map
<procedure> :
  (forall ('a 'b)
    (('a -> 'b) (list 'a) -> (list 'b)))
-> (@ map int bool)
<procedure> :
  ((int -> bool) (list int) -> (list bool))

Create your own!

Abstract over unknown type using type-lambda

  -> (val id (type-lambda ['a]
                (lambda ([x : 'a]) x )))
  id : (forall ('a) ('a -> 'a))

'a is type parameter (an unknown type)

This feature is parametric polymorphism

Two forms of abstraction:

Power comes at notational cost

Function composition

-> (val o (type-lambda ['a 'b 'c]
    (lambda ([f : ('b -> 'c)]
             [g : ('a -> 'b)])
     (lambda ([x : 'a]) (f (g x))))))

o : (forall ('a 'b 'c)
       (('b -> 'c) ('a -> 'b) -> ('a -> 'c)))

Aka o :

Type rules for polymorphism

A phase distinction embodied in code


-> (val x 3)
3 : int
-> (val y (+ x x))
6 : int

fun processDef (d, (delta, gamma, rho)) =
  let val (gamma', tystring)  = elabdef (d, gamma, delta)
      val (rho',   valstring) = evaldef (d, rho)
      val _ = print (valstring ^ " : " ^ tystring)
  in  (delta, gamma', rho')
  end

Type formation through kinds

Bonus content: a definition manipulates three environments

30 October 2017: Midterm Review

There are PDF slides for 10/31/2017.

Announcements

ML HW returned on Saturday; Course summary on Sunday
Midterm: Wednesday in class; 1-page self-prepared sheet of notes
Course evaluations: At the end of class today.

Last Time

Kinds classify type constructors into:
- *: types (nullary type constructors that classify values)
- - => *: type constructors that become types when applied to some number of arguments
- nonsense type expressions
Polymorphic types
- Introduction: Type Lambda
  - Example: (type-lambda ['a] (lambda ([x : 'a] x)))
- Elimination: Explicit Instantiation (@ length int)
  - Example: length : (forall ('a) ('a list -> int)
  - Example: (@ length int) : int list -> int

Today

Midterm review
Sample Problems
Course Evaluations

Midterm Review

Plan on:

Writing some code uScheme and ML code
Reasoning about code (uScheme or ML)
Working with semantics

Recursion and Induction

Write recursive functions
- Explain why function terminates
Prove properties using induction
- Explain why induction is well-founded
Algebraic laws can lead naturally to recursive functions and inductive proofs

Understanding a language: Key Questions

What is the abstract syntax?
What are the values?
What are the environments?
How does evaluation happen?
What is the initial basis?
What are the types?

First-class functions

What they are
How to use them effectively
Lambdas create anonymous functions
Closures are run-time representation of functions; they capture the environment at closure-definition time
Continuations capture the rest of the computation

Local bindings

Various forms: let, let*, and letrec
What is let used for?
How do the various forms differ?

Data structures and associated operations

S-expressions ((), null?, cons, car, cdr, …)
Lists
Tuples

Cost Models and Optimizations

How many cons cells?
How many activation records?
Method of accumulating parameters
Tail calls

Operational semantics

Precisely describe meaning of programs
- What value does a program evaluate to?
- What side effects are caused in the process?
Written using inference rules
Judgement forms capture all relevant information
Environments track information about variables
Stores map locations to values

Datatypes
- Declarations introduce type constructor & data constructors
- Datatypes can be recursive
- Type variables allow polymorphic data structures

ML Pattern Matching

Deconstruct values: datatypes, lists, tuples, …
Bind variables
Appear in function definitions, case expressions, and let bindings

ML Exceptions

Declarating, raising, handling

6 November 2017: Type Inference

There are PDF slides for 11/7/2017.

Announcements

Midterm: Returned at the end of class today.

Final: Thursday, December 14, 8:30 to 10:30.

Send email to comp105-grades@cs.tufts.edu if you have another exam at the same time.

HW: Type Checking due Wednesday 11/8

Today

Type Inference Intuition

Key Ideas:

Fresh type variables represent unknown types.
- Example: In (lambda (x) (+ x 3)), assign x fresh type variable α
Constraints record knowledge about type variables.
- Example: α ≡ int

Why Study?

Useful in its own right (as we’ll see shortly)
Canonical example of Static Analysis, which is key tool in cybersecurity

What type inference accomplishes

-> (define     double (x)       (+ x x))
double                         ;; uScheme
-> (define int double ([x : int]) (+ x x))
double : (int -> int)          ;; Typed uSch.
-> (define     double (x)       (+ x x))
double : int -> int            ;; nML

The compiler tells you useful information and there is a lower annotation burden.

What else type inference accomplishes

-> ((@ cons bool) #t ((@ cons bool) #f (@ '() bool)))
(#t #f) : (list bool)    ;; typed uScheme
-> (   cons       #t (   cons       #f    '()      ))
(#t #f) : bool list      ;; nML

How it works

For each unknown type, introduce a fresh type variable
Every typing rule adds equality constraints
Instantiate every variable automatically
Introduce polymorphism at let/val bindings

Let’s do an example on the board

(val-rec double (lambda (x) (+ x x)))

What do we know?

double has type ′a₁
x has type ′a₂
+ has type int * int -> int
(+ x x) is an application, what does it require?
- ′a2 = int and ′a2 = int
Is this possible?

Key idea: Record the constraint in a typing judgement.

'a2 = int /\ 'a2 = int, { double : 'a1, x : 'a2 } |- (+ x x) : int

General form of typing judgement:

C, Gamma |- e : tau

which is pronounced “Assuming the constraints in C are true, in environment Gamma, expression e has type tau.”

Example: if

(if y 1 0)
y has type ′a3, 1 has type int, 0 has type int
Requires what constraints? (int = int, ′a3 = bool`)

Example:

(if z z (- 0 z))
z has type ′a3, 0 has type int, - has type int * int -> int
Requires what constraints? (′a3 = bool /\ int = int /\ ′a3 = int)
Is this possible?
Why not?

Inferring polymorphic types

(val app2 = (lambda (f x y)
               (begin
                   (f x)
                   (f y))))

Assume f : 'a_f

Assume x : 'a_x

Assume y : 'a_y

f x implies 'a_f = 'a_x -> 'a1

f y implies 'a_f = 'a_y -> 'a2

Together, these constraints imply 'a_x = 'a_y and 'a1 = 'a2

begin implies result of function is 'a2

So, app2 : ('a_x -> 'a1) * 'a_x * 'a_x -> 'a1

'a_x and 'a aren’t mentioned anywhere else in program, so

we can generalize to:

forall 'a_x, 'a1 . ('a_x -> 'a1) * 'a_x * 'a_x -> 'a1

which is the same thing as:

forall 'a, 'b . ('a -> 'b) * 'a * 'a -> 'b

Exercise: Give the type of cc


let val cc = (lambda (nss) (car (car nss)))

Assume nss : 'b

We know car : forall 'a . 'a list -> 'a

=> car_1 : 'a1 list -> 'a1

=> car_2 : 'a2 list -> 'a2

(car_1 nss) => 'b = 'a1 list

(car_2 (car_1 nss)) => 'a1 = 'a2 list

(car_2 (car_1 nss)) : 'a2

nss : 'b : 'a1 list : ('a2 list) list

So, cc : ('a2 list) list -> 'a2

Because 'a2 is unconstrained, we can generalize:

cc : forall 'a . ('a2 list) list -> 'a

Exercise: Give the type of cc


let val cc = (lambda (nss) (car (car nss)))

forall 'a . 'a list list -> 'a

8 November 2017: Formalizing Type Inference and Instantiation

There are PDF slides for 11/9/2017.

Announcements

Last Time

Type inference
Type variables represent unknown types
Type constraints record requirements on those types
Constraint judgement C, Gamma |- e : tau

Today

Formalizing type inference

Moving from type schemes to types (Instantiation)

Moving from types to type schemes (Generalization)

Formalizing Type Inference

Sad news:

Type inference for polymorphism is undecidable.

Solution:

Each formal parameter has a monomorphic type.

Consequences:

Polymorphic functions are not first class.
The argument to a higher-order function cannot be polymorphic
forall appears only outermost in types

Representing Hindley-Milner types

datatype ty
  = TYVAR  of name        
  | TYCON  of name        
  | CONAPP of ty * ty list

datatype type_scheme
  = FORALL of name list * ty

What you know and can do now

Your skills so far

You can complete typeof

Takes e and Γ, returns τ and C

(Except for let forms.)

Next up: solving constraints!

Solving Constraints

Representing Constraints

datatype con = ~   of ty  * ty
             | /\  of con * con
             | TRIVIAL
infix 4 ~
infix 3 /\

Question: What does a solution to a set of constraints look like?

Answer: A substitution/mapping from types variables to types: θ.

Examples

Which have solutions?

1. int        ~ bool
2. int list   ~ bool list
3. 'a         ~ int
4. 'a         ~ int list
5. 'a         ~ int -> int
6. 'a         ~ 'a
7. 'a * int   ~ bool * 'b
8. 'a * int   ~ bool -> 'b
9. 'a         ~ ('a, int)
10. 'a        ~ tau        (arbitrary tau)

Examples

Which have solutions?

1. int        ~ bool    No
2. int list   ~ bool list   No
3. 'a         ~ int         'a |-> int
4. 'a         ~ int list    'a |-> int list
5. 'a         ~ int -> int  'a |-> int -> int
6. 'a         ~ 'a          'a |-> 'a
7. 'a * int   ~ bool * 'b   'a |-> bool and 'b |-> int
8. 'a * int   ~ bool -> 'b  No
9. 'a         ~ ('a, int)   No
10. 'a        ~ tau         depends if 'a in free-vars(tau)

Question: in solving tau1 ~ tau2, how many potential cases are there to consider?

Question: how are you going to handle each case?

What you know and can do after this lecture

What you can do now

After this lecture, you can write solve, a function which, given a constraint C, has one of three outcomes:

Returns the identity substitution in the case where C is trivially satisfied
Returns a non-trivial substitution if C is satisfiable otherwise.
Calls unsatisfiableEquality in when C cannot be satisfied

You can also write a type inferencer ty for everything except let forms. (Coming Monday)

From Type Scheme to Type

Why the freshness requirement?

Consider

Gamma = {fst : forall 'a 'b. 'a * 'b -> 'a, y : 'ay}

??, Gamma |- if (y, fst 2 3, 4) : ??

Imagine we ignore the freshness constraint when instantiating fst:

fst : 'ay * 'b -> 'ay

From if, we get the constraints:

'ay ~ bool

'ay ~ int

which aren’t satisfiable. Which means this code would be erroneously flagged as an error.

Correct typing:

 'ay ~ bool, Gamma |- if (y, fst 2 3, 4) : int

Why the distinctness requirement?

fst : 'a * 'a -> 'a

Gamma |- fst 2 #t

Application rule yields constraints:

'a ~ int

'a ~ bool

which aren’t satisfiable. Which means this code would also be erroneously flagged as an error.

Correct typing:

Gamma |- fst 2 #t : int

13 November 2017: Generalization

There are PDF slides for 11/14/2017.

Announcements

Last Time

Formalizing type inference
- Judgement form: C,Gamma |- e1, … en : tau1, …, taun
- Representing constraints
- Solving constraints: tau1 ~ tau2 and C1 / C2
From type scheme to types: Instantiation
- Freshness and Distinctness requirements

Today

Generalization: going from types to type schemes

Inference rule for val
Inference for let
Inference for val-rec and let-rec

From Type to Type Scheme

The set A above will be useful when some variables in τ are mentioned in the environment.

We can’t generalize over those variables.

Applying idea to the type inferred for the function fst:

 generalize('a * 'b -> 'a, emptyset) = forall 'a, 'b. 'a * 'b -> 'a

Note the new judgement form above for type checking a declaration.

On the condition ΘΓ = Γ: Γ is “input”: it can’t be changed.
The condition ensures that Θ doen’t conflict with Γ.

We can’t generalize over free type variables in Γ.

Their presence indicates they can be used somewhere else, and hence they aren’t free to be instantiated with any type.

Let Examples

(lambda (ys) ; OK
   (let ((s (lambda (x) (cons x '()))))
      (pair (s 1) (s #t))))

(lambda (ys) ; Oops!
   (let ((extend (lambda (x) (cons x ys))))
      (pair (extend 1) (extend #t))))

(lambda (ys) ; OK
    (let ((extend (lambda (x) (cons x ys))))
       (extend 1)))

Let with constraints, operationally:

typesof: returns τ₁, …, τ_n and C
C-prime from map, conjoinConstraints, dom, inter, freetyvarsGamma
val theta = solve C'
freetyvarsGamma, union, freetyvarsConstraint
Map anonymous lambda using generalize, get all the σ_i
Extend the typing environment Gamma (pairfoldr)
Recursive call to type checker, gets C_b, \tau
Return (tau, C' /\ C_b)

Forall things

Managing Quantified types
`val` and `val-rec`	`let`, `letrec`, …	`lambda`
FORALL contains all variables (because none are free in the context)	FORALL contains variables not free in the context	FORALL is empty
Generalize over all variables (because none are free in the context)	Generalize over variables not free in the context	Never generalize

15 November 2017: Hiding information with abstract data types

There are PDF slides for 11/16/2017.

Announcements

Last Time

Generalization: Going from types to type schemes
Inferring types for val, val-rec, let, and let-rec

Today

Module Systems
Structures/Implementations
Signatures/Interfaces

Where have we been?

Programming in the small
Expressive power
Success stories:
- First-class functions
- Algebraic data types and pattern matching
- Polymorphic type systems

What about big programs?

An area of agreement and a great divide:

                 INFORMATION HIDING
                     /         \
 modular reasoning  /           \  code reuse
                   /             \ 
internal access   /               \  interoperability 
to rep           /                 \  between reps
                /                   \
            MODULES               OBJECTS           
        ABSTRACT TYPES

Why modules?

Unlocking the final door for building large software systems

You have all gotten good at first-class functions, algebraic data types, and polymorphic types
Modules are the last tool you need to build big systems
```
Implementation      Interface
```
| | | |

| Module | | I |

| | | |
```
        ^            ^
        |            |
```
Nitty gritty - - Stuff you want others to see

Modules overview

Functions of a true module system:

Hide representations, implementations, private names
“Firewall” separately compiled units (promote independent compilation)
Possibly reuse units

Real modules include separately compilable interfaces and implementations

Designers almost always choose static type checking, which should be “modular” (i.e., independent)
C and C++ are cheap imitations
- C doesn’t provide namespaces
- C++ doesn’t provide modular type checking for templates

Interfaces

Collect declarations

Unlike definition, provides partial information about a name (usually environment and type)

Things typically declared:

Variables or constants (values, mutable or immutable)
Types
Procedures (if different from values)
Exceptions

Key idea: a declared type can be abstract

Terminology: a module is a client of the interfaces it depends on

Roles of interfaces in programming:

The unit of sharing and reuse
Explainer of libraries
Underlie component technology

The best-proven technology for structuring large systems.

Ways of thinking about interfaces

Means of hiding information (ask “what secret does it hide?”)
A way to limit what we have to understand about a program
- Estimated force multiplier x10
A contract between programmers
- Essential for large systems
- Parties might be you and your future self
Interface is the specification or contract that a module implements
- Includes contracts for all declared functions

Two approaches to writing interfaces

Interface “projected” from implementation:

No separate interface
Compiler extracts from implementation
(CLU, Java class, Haskell)
When code changes, must extract again
Few cognitive benefits

Full interfaces:

Distinct file, separately compiled
(Caml, Java interface, Modula, Ada)
Implementations can change independently
Full cognitive benefits

Module Implementations

Holds all dynamically executed code (expressions/statements)
May include private names
May depend only on interfaces, or on interfaces and implementations both (max cognitive benefits when all dependency is mediated by interfaces)
Dependencies may be implicit or explicit (import, require, use)

Standard ML Modules

The Perl of module languages?

Has all known features
Supports all known styles
Rejoice at the expressive power
Weep at the terminology, the redundancy, the bad design decisions

What we’ve been using so far is the core language

Modules are a separate language layered on top.

ML module terminology

Interface is a signature

Implementation is a structure

Generic module is a functor

A compile-time function over structures
The point: reuse without violating abstraction

Structures and functors match signature

Analogy: Signatures are the ``types’’ of structures.

Signature basics

Signature says what’s in a structure

Specify types (w/kind), values (w/type), exceptions.

Ordinary type examples:

    type t        // abstract type, kind *
    eqtype t
    type t = ...  // 'manifest' type
    datatype t = ...

Type constructors work too

    type 'a t     // abstract, kind * => *   
    eqtype 'a t
    type 'a t = ...
    datatype 'a t = ...

ML Modules examples, part I

Signature example: Ordering


signature ORDERED = sig
  type t
  val lt : t * t -> bool
  val eq : t * t -> bool
end

Signature example: Integers

signature INTEGER = sig
  eqtype int             (* <-- ABSTRACT type *)
  val ~   : int -> int
  val +   : int * int -> int
  val -   : int * int -> int
  val *   : int * int -> int
  val div : int * int -> int
  val mod : int * int -> int
  val >   : int * int -> bool
  val >=  : int * int -> bool
  val <   : int * int -> bool
  val <=  : int * int -> bool
  val compare : int * int -> order
  val toString   : int    -> string
  val fromString : string -> int option
end

Implementations of integers

A selection…

structure Int    :> INTEGER
structure Int31  :> INTEGER  (* optional *)
structure Int32  :> INTEGER  (* optional *)
structure Int64  :> INTEGER  (* optional *)
structure IntInf :> INTEGER  (* optional *)

What about natural numbers?

signature NATURAL = sig
   type nat   (* abstract, NOT 'eqtype' *)
   exception Negative
   exception BadDivisor

   val of_int   : int -> nat 
   val /+/      : nat * nat -> nat
   val /-/      : nat * nat -> nat
   val /*/      : nat * nat -> nat
   val sdiv     : nat * int -> 
                  { quotient : nat, remainder : int }
   val compare  : nat * nat -> order
   val decimals : nat -> int list
end

Signatures collect

signature QUEUE = sig
  type 'a queue    (* another abstract type *)
  exception Empty

  val empty : 'a queue
  val put : 'a * 'a queue -> 'a queue
  val get : 'a queue -> 'a * 'a queue   (* raises Empty *)

  (* LAWS:  get(put(a, empty))     ==  (a, empty)
            ...
   *)
end

Structures collect definitions

structure Queue :> QUEUE = struct   (* opaque seal *)
  type 'a queue = 'a list
  exception Empty

  val empty = []
  fun put (x,q) = q @ [x]
  fun get [] = raise Empty
    | get (x :: xs) = (x, xs)


  (* LAWS:  get(put(a, empty))     ==  (a, empty)
            ...
   *)
end

Your turn! Signature for a stack

  structure Stack = struct
     type 'a stack = 'a list
     exception Empty
     val empty = []
     val push  = op ::
     fun pop []            = raise Empty
       | pop (top :: rest) = (top, rest)
  end

Your turn! Signature for a stack

  signature STACK = sig
     type 'a stack 
     exception Empty
     val empty : 'a stack
     val push  : 'a * 'a stack -> 'a stack
     val pop   : 'a stack -> 'a * 'a stack
  end

Dot notation to access elements

structure Queue :> QUEUE = struct
  type 'a queue = 'a list
  exception Empty

  val empty = []
  fun put (q, x) = q @ [x]
  fun get [] = raise Empty
    | get (x :: xs) = (x, xs)
end

fun single (x:'a) : 'a Queue.queue = 
   Queue.put(Queue.empty, x)

What interface with what implementation?

Maybe mixed together, extracted by compiler!

CLU, Haskell

Maybe matched by name:

Modula-3, Modula-3, Ada

Best: any interface with any implementation:

Java, Standard ML

But: not “any”—only some matches are OK

Signature Matching

Well-formed

 structure Queue :> QUEUE = QueueImpl

if principal signature of QueueImpl matches ascribed signature QUEUE:

Every type in QUEUE is in QueueImpl
Every exception in QUEUE is in QueueImpl
Every value in QUEUE is in QueueImp
(type could be more polymorphic)
Every substructure matches, too (none here)

Signature Ascription

Ascription attaches signature to structure

Transparent Ascription: types are revealed
```
structure strid : sig_exp = struct_exp
```
This method is stupid and broken (legacy)
(But it’s awfully convenient)
Opaque Ascription: types are hidden (“sealing”)
```
structure strid :> sig_exp = struct_exp
```
This method respects abstraction
(And when you need to expose, can be tiresome)

Slogan: “use the beak”

Transparent Ascription

Not recommended!

Example:

  structure IntLT : ORDERED = struct
    type t = int
    val le = (op <)
    val eq = (op =)
 end

Exposed: IntLT.t = int

Violates abstraction

Opaque Ascription

Recommended

Example:

 structure Queue :> QUEUE = struct
   type 'a queue = 'a list
   exception Empty

   val empty = []
   fun put (x, q) = q @ [x]
   fun get [] = raise Empty
    | get (x :: xs) = (x, xs)
 end

Not exposed: 'a Queue.queue = 'a list

Respects abstraction

Abstract data types

How opaque ascription works

Outside module, no access to representation

Protects invariants
Allows software to evolve
Type system limits interoperability

Inside module, complete access to representation

Every function sees rep of every argument
Key distinction abstract type vs object

Data abstraction for reuse

Abstract data types and your homework

Two-player games:

Abstraction not as crisp as “number” or “queue”

Problems abstraction must solve:

Interact with human player via strings
(accept moves, display progress)
Know whose turn it is
Handle special features like “extra moves”
Provide API for computer player

Result: a very wide interface

Abstraction design: Computer player

Computer player should work with any game, provided

Up to two players
Complete information
Always terminates

Brute force: exhaustive search

Your turn! What does computer player need?

Types?
Exceptions?
Functions?

Our computer player: AGS

Any game has two key types:

  type config
  structure Move : sig
     type move
     ...  (* string conversion, etc *)
  end

Key functions use both types:

  val possmoves : config -> Move.move list
  val makemove  : config -> Move.move -> config

Multiple games with different config, move?

Yes! Using key feature of ML: functor

20 November 2017: Functors and an Extended SML Example

There are PDF slides for 11/21/2017.

Announcements

type inference homework due tonight
Module HW due Sunday 12/3

Last Time

Modules/structures
Interfaces/signatures
Ascription

Today

Functors
Extended SML example
Computation abstraction

Game interoperability with functors

functor AgsFun (structure Game : GAME) :> sig
  structure Game : GAME
  val bestmove : Game.config -> Game.Move.move option
  val forecast : Game.config -> Player.outcome
end
   where type Game.Move.move = Game.Move.move
   and   type Game.config    = Game.config
= struct
    structure Game = Game
    ... definitions of `bestmove`, `forecast` ...
  end

Functors

A functor is a function that operates on modules.

Formal parameters are declarations:

functor AddSingle(structure Q:QUEUE) = 
   struct
     structure Queue = Q
     fun single x = Q.put (Q.empty, x)
   end

Combines familiar ideas:

Higher-order functions
type-lambda

Using Functors

Functor applications are evaluated at compile time.

functor AddSingle(structure Q:QUEUE) = 
   struct
     structure Queue = Q
     fun single x = Q.put (Q.empty, x)
   end

Actual parameters are definitions

structure QueueS  = AddSingle(structure Q = Queue)
structure EQueueS = AddSingle(structure Q = EQueue)

where EQueue is a more efficient implementation

Functors on your homework

Separate compilation:

Heap sort without a heap

Code reuse with type abstraction

Abstract Game Solver
(any representation of game config, move)

ML module summary

New syntactic category: declaration

Of type, value, exception, or module

Signature groups declarations: interface

Structure groups definitions: implementation

Functor enables reuse:

Formal parameter: declarations
Actual parameter: definitions

Opaque ascription hides information

Enforces abstraction

Extended Example: Error-tracking Interpreter

An Extended Example

Error-tracking interpreter for a toy language

Why this example?

Lots of interfaces using ML signatures
Idea of how to compose large systems
Some ambitious, very abstract abstractions—it’s the end of class, and you should see something ambitious.
Practice implementing functors

Error modules: Three common implementations

Collect all errors
Keep just the first error
Keep the most severe error

Your obligations: two types, three functions, algebraic laws

Classic ``accumulator’’ for errors

signature ERROR = sig
  type error   (* a single error *)
  type summary (* summary of what errors occurred *)

  val nothing : summary                  (* no errors *)
  val <+> : summary * summary -> summary (* combine *)

  val oneError : error -> summary

  (* laws:   nothing <+> s == s
             s <+> nothing == s
             s1 <+> (s2 <+> s3) == (s1 <+> s2) <+> s3    
                                        // associativity
   *)
end

First Error Implementation

structure FirstError :> 
    ERROR where type error = string
          and type summary = string option =
  struct
    type error   = string
    type summary = string option

    val nothing = NONE
    fun <+> (NONE,   s) = s
      | <+> (SOME e, _) = SOME e

    val oneError = SOME
  end

All Error Implementation

structure AllErrors :>
    ERROR where type error   = string
            and type summary = string list =
  struct
    type error   = string
    type summary = error list

    val nothing = []
    val <+> = op @
    fun oneError e = [e]
  end

Computations Abstraction

Ambitious! (will make your head hurt a bit)

Computations either:

return a value
produce an error

Errors must be threaded through everything :-(

Exercise: Simple arithmetic interpreter

(* Given: *)
datatype 'a comp = OK of 'a | ERR of AllErrors.summary 

datatype exp = LIT  of int
             | PLUS of exp * exp
             | DIV  of exp * exp


(* Write an evaluation function that tracks errors. *)

val eval : exp -> int comp = ...

Exercise: LIT and PLUS cases

fun eval (LIT n) = OK n
  | eval (PLUS (e1, e2)) = 
     (case eval e1
        of OK v1 => 
          (case eval e2
             of OK  v2 => OK (v1 + v2)
              | ERR s2 => ERR s2)
       | ERR s1 =>
          (case eval e2
             of OK  _  => ERR s1
              | ERR s2 => ERR (AllErrors.<+> (s1, s2))))

Exercise: DIV case

  | eval (DIV (e1, e2)) = 
     (case eval e1
        of OK v1 =>
          (case eval e2
             of OK   0 => ERR (AllErrors.oneError "Div 0")
              | OK  v2 => OK  (v1 div v2)
              | ERR s2 => ERR s2)
       | ERR s1 =>
           (case eval e2
              of OK  v2 => ERR s1
               | ERR s2 => ERR (AllErrors.<+> (s1, s2)))

That’s really painful!

We can extend the computation abstraction with sequencing operations to help.

Combining generic computations

signature COMPUTATION = sig
  type 'a comp    (* Computation! When run, results in
                     value of type 'a or error summary. *)

  (* A computation without errors always succeeds. *)
  val succeeds : 'a -> 'a comp   

  (* Apply a pure function to a computation. *)
  val <$> : ('a -> 'b) * 'a comp -> 'b comp

  (* Application inside computations. *)
  val <*> : ('a -> 'b) comp * 'a comp -> 'b comp
                      
  (* Computation followed by continuation. *)
  val >>= : 'a comp * ('a -> 'b comp) -> 'b comp
end

Example:

eval e1 + eval e2

(op +) (eval e1, eval e2)

curry (op +) (eval e1) (eval e2)

curry (op +) <$> eval e1 <*> eval e2

The first three versions are not well typed. Why?

The last version will thread errors through the compuation behind the scenes.

Note:

eval e1, eval e2 : int comp

curry (op +) : int -> (int -> int)

<$> : (int -> (int -> int)) * (int comp) -> (int -> int) comp

<*> : (int -> int) comp * int comp -> int comp

curry (op +) <$> eval e1 : (int -> int) comp

let pa = curry (op +) <$> eval e1

note by above,  pa : (int -> int) comp 

pa <*> eval e2  : int comp

{Buckets of algebraic laws}

  succeeds a >>= k  == k a                  // identity
  comp >>= succeeds == comp                 // identity
  comp >>= (fn x => k x >>= h) == (comp >>= k) >>= h  
                                          // associativity
  succeeds f <*> succeeds x == succeeds (f x)  // success
  ...

Environments using ``computation’’

signature COMPENV = sig
  type 'a env   (* environment mapping strings
                   to values of type 'a *)
  type 'a comp  (* computation of 'a or
                   an error summary *)

  val lookup : string * 'a env -> 'a comp
end

Payoff!

functor InterpFn(structure Error : ERROR
                 structure Comp  : COMPUTATION
                 structure Env   : COMPENV
                 val zerodivide  : Error.error
                 val error       : Error.error -> 'a Comp.comp
                 sharing type Comp.comp = Env.comp) =
struct
  val (<*>, <$>, >>=) = (Comp.<*>, Comp.<$>, Comp.>>=)
  
  (* Definition of Interpreter *)

end

Definition of intepreter, continued

datatype exp = LIT of int
             | VAR of string
             | PLUS of exp * exp
             | DIV  of exp * exp
fun eval (e, rho) =
 let fun ev(LIT n) = Comp.succeeds n
       | ev(VAR x) = Env.lookup (x, rho)
       | ev(PLUS (e1, e2)) = curry op + <$> ev e1 <*> ev e2
       | ev(DIV (e1, e2))  = ev e1 >>= (fn n1 =>
                             ev e2 >>= (fn n2 =>
                             case n2
                               of 0 => error zerodivide
                                | _ => Comp.succeeds
                                              (n1 div n2)))
 in  ev e
 end

{Extend a signature with }

signature ERRORCOMP = sig
  include COMPUTATION
  structure Error : ERROR             
  datatype 'a result = OK  of 'a
                     | ERR of Error.summary
  val run : 'a comp -> 'a result      
  val error : Error.error -> 'a comp
end

{Let’s build }

functor ErrorCompFn(structure Error : ERROR) :> 
  ERRORCOMP where type Error.error   = Error.error
              and type Error.summary = Error.summary =
struct
  structure Error = Error
  datatype 'a result = OK  of 'a
                     | ERR of Error.summary

  type 'a comp = 'a result
  fun run comp = comp
 
  fun error e = ERR (Error.oneError e)
  fun succeeds = OK
  ... 
end

27 November 2017: Object-orientation

There are PDF slides for 11/28/2017.

Announcements

SML HW due Sunday, December 3

Last Time

Functors
Extended example: Error-tracking interpretor
“Computation” abstraction

Today

Objects
Message passing

Demo: circle, square, and triangle objects

Circle, Square, and Triangle Objects

Methods:

position: cardinal-point
set-position:to: cardinal-point coordinate
draw

Instructions to student volunteers

You have one instance variable, which represents the coordinate position at the ``center’’ of the object.

Note: Mutable state is back!

Messages

Object 1, adjust your coordinate to place your South control point at (0, 0).

set-position:to: Object1 South (0,0)
Object 1, what is the coordinate position of your North control point?

position: Object1 North
Object 2, adjust your coordinate to place your South control point at (0, 2).

set-position:to: Object2 South (0,2)
Object 2, what is the coordinate position of your North control point?

position: Object2 North

More Messages

Object 3, adjust your coordinate to place your Southwest control point at (0, 4). set-position:to: Object3 Southwest (0,4)
Object 1, draw yourself on the board

draw Object1
Object 2, draw yourself on the board

draw Object2
Object 3, draw yourself on the board

draw Object3

Key concepts of object-orientation

Key mechanisms

Private instance variables

Only object knows its instance variables and can see them
C++ calls these “members”
Like the coordinate of the geometric figure

Code attached to objects and classes

Code needed to draw the object is associated with the object in one of its ``methods.’’

Dynamic dispatch

Methods invoked via message sends (like draw and position:).
Message sender doesn’t know what code will be called.
Object receiving the message send is called the ``receiver’’.
In a method, special variable self is bound to the receiver.

Key idea

Protocol determines behavioral subtyping

The protocol of an object is the set of messages it understands.
Object A is a behavioral subtype of object B if A understands all the messages that B does in a compatible way.
Intuition: If A is a behavioral subtype of B, then A can be used in any context where B can be used.

Class-based object-orientation

Object implementations determined by its class definition.

So, each class implicitly defines the protocol for its objects, and, dynamic dispatch is determined by object’s class.

Code reuse by sending messages around like crazy.

Example: list filter

-> (val ns (new List))
List( )
-> (addAll: ns #(1 2 3 4 5 6))
( 1 2 3 4 5 6 )
-> ns
List( 1 2 3 4 5 6 )
-> (select: ns [block (n) (= 0 (mod: n 2))])
List( 2 4 6 )

Blocks and Booleans

[block (formals) expressions]
For parameterless blocks (normally continuations), {expressions}

Blocks are objects

You don’t “apply” a block; you “send it the value message”

Block Examples

-> (val twice [block (n) (+ n n)])

-> (value twice 3)
6
-> (val delayed {(println #hello) 42})

-> delayed

-> (value delayed)
hello
42

Booleans use continuation-passing style

Blocks delay evaluation

Boolean example: minimum

-> (val x 10)
-> (val y 20)
-> (ifTrue:ifFalse: (<= x y) {x} {y})
10

Booleans implemented with two classes True and False

one value apiece

Method dispatch in the Booleans

Board - Method dispatch

To answer a message:

Consider the class of the receiver
Is the method with that name defined?
If so, use it
If not, repeat with the superclass

Run out of superclasses?

“Message not understood”

Dispatching to

(class True Boolean ()
  (method ifTrue:ifFalse: (trueBlock falseBlock) 
      (value trueBlock))
  ; all other methods are inherited
)

29 November 2017: Inheritance

There are PDF slides for 11/30/2017.

Announcements

Solutions for ty-inf available at the end of class.
Course evaluations!

Send confirmation to comp105-grades@eecs.tufts.edu and we’ll give you class participation credit.

Last Time

Objects
Dynamic Dispatch
Protocols and behavioral subtyping
Blocks

Today

Inheritance
Abstract classes and methods
Object-oriented design
Object initialization

Your turn: Short-circuit

(class Boolean Object
  ()
  ...
  (method not ()          
    (ifTrue:ifFalse: self {false} {true}))
  (method and: (aBlock)
    ...))

Your turn: Short-circuit

(class Boolean Object
  ()
  ...
  (method not ()          
    (ifTrue:ifFalse: self {false} {true}))
  (method and: (aBlock)
    (ifTrue:ifFalse: self aBlock {self})))

History and overview of objects

History of objects

Pioneers were Nygaard and Dahl, who added objects to Algol 60, producing SIMULA-67, the first object-oriented language

Bjarne Stroustrup liked Simula but wanted complete control of costs, so he created C++
James Gosling wanted something a little cleaner and a little more like Simula, created Java
Microsoft funded C#
Objects are everywhere

What’s an object?

We know that mixing code and data can create powerful abstractions (function closures)

Objects are another way to mix code and data

Agglutination containing

Some mutable state (instance variables)
Code that can respond to messages (code is called methods)

A lot like a closure

What are objects good at?

Not really useful for building small programs.

If you build a big, full-featured abstraction, you can use inheritance to build another, similar abstraction.

Very good at adding new kinds of things that behave similarly to existing things.

Programs that are evolving
A particular kind of evolution: operations stay the same, but we add new kinds of things
- Example: GUIs (operations are paint and respond-to-mouse-click)
- Example: numbers

For your homework, you’ll take a Smalltalk system that has three kinds of numbers, and you’ll add a fourth kind of number.

What’s hard about objects?

If you do anything at all interesting, your control flow becomes smeared out over half a dozen classes, and your algorithms are nearly impossible to undrstand.

Smalltalk objects

Why Smalltalk?

Another Turing Award
Small, simple, pure objects
Almost the complete language can be done in a relatively small interpreter
Alive and well today
- At the core of Ruby
- As an extension to C (“Objective C”) for Apple products

The six questions:

Values are objects (even true, 3, "hello")

Even classes are objects!

There are no functions—only methods on objects
Syntax:
- mutable variables
- message send
- sequential composition of mutations and message sends (side effects)
- “blocks” (really closures, objects and closures in one, used as continuations)
- No if or while. These are implemented by passing continuations to Boolean objects.
  (Smalltalk programmers have been indoctrinated and don’t even notice)

Environments
- Name stands for a mutable cell containing an object:
  - Global variables
  - “Instance variables” (new idea, not yet defined)
Types

There is no compile-time type system.

At run time, Smalltalk uses behavioral subtyping, known to Rubyists as “duck typing”
Dynamic semantics
- Main rule is method dispatch (complicated)
- The rest is familiar
The initial basis is enormous
- Why? To demonstrate the benefits of reuse, you need something big enough to reuse.

Message passing

Look at SEND

Message identified by name (messages are not values)
Always sent to a receiver
Optional arguments must match arity of message name
(no other static checking)

N.B. BLOCK and LITERAL are special objects.

Magnitudes and numbers

Key problems on homework

Natural is a Magnitude
“Large integer” is a Number

Implementation of

(class Magnitude Object 
  () ; abstract class
  (method =  (x) (subclassResponsibility self)) 
                    ; may not inherit = from Object
  (method <  (x) (subclassResponsibility self))
  (method >  (y) (< y self))
  (method <= (x) (not (> self x)))
  (method >= (x) (not (< self x)))
  (method min: (aMagnitude)
     (if (< self aMagnitude) {self} {aMagnitude}))
  (method max: (aMagnitude)
     (if (> self aMagnitude) {self} {aMagnitude}))
)

Example class : initialization

(class Fraction Number
    (num den) ;; representation (concrete!)
    (class-method num:den: (a b)
        (initNum:den: (new self) a b))
    (method initNum:den: (a b) ; private
        (setNum:den: self a b)
        (signReduce self)
        (divReduce self))
    (method setNum:den: (a b)
        (set num a) (set den b) self) ; private
    .. other methods of class Fraction ...
)

4 December 2017: Double dispatch, collections

There are PDF slides for 12/5/2017.

Announcements

Smalltalk HW available, due 12/10
Course evaluations!

Send confirmation to comp105-grades@eecs.tufts.edu and we’ll give you class participation credit.

Last Time

Inheritance
Abstract/Concrete classes
Object-oriented design
Object initialization

Today

Information hidden and revealed; three layers
Double dispatch
Subtyping

Making open system extensible

Subtyping

Key strategy for reuse in object-oriented languages: subtype polymorphism

A value of the subtype can be used wherever a value of the supertype is expected.

Board:
- SUBTYPE != SUBCLASS
- SUPERTYPE != SUPERCLASS
Some languages like C++ identify subtype with subclass, but conceptually they are different.
Subtyping relationship can be checked statically (e.g. Java, C++, Scala) or dynamically (e.g. Smalltalk, Ruby)

Bonus content not covered in class: Collections

Why collections?

Goal of objects is reuse

Key to successful reuse is a well-designed class hierarchy

Killer app: toolkits for building user interfaces
Smalltalk blue book is 90 pages on language, 300 pages on library
Lots of abstract classes
- Define protocols
- Build reusable stuff, just like Boolean, Magnitude, Number

Implementing Collections

Question: what’s the most efficient way to find the size of a list?

Question: what’s the most efficient way to find the size of an array?

Example collection - Sets

Most subclass methods work by delegating all or part of work to list members

N.B. Set is a client of List, not a subclass!

Next example highlight: class method and super!

6 December 2017: Lambda Calculus

There are PDF slides for 12/7/2017.

Announcements

Last Time

Information hiding in Smalltalk
“Private methods”
Double Dispatch
Subtyping vs. Inheritance

Today

Lambda Calculus Overview
Programming in the Lambda Calculus
Operational Semantics

Lambda Calculus

Why study lambda calculus?

Theoretical underpinnings for most programming langauges (all in this class).
Church-Turing Thesis: Any computable operator can be expressed as an encoding in lambda calculus
Test bench for new language features

The world’s simplest reasonable programming language

Only three syntactic forms:

M ::= x | \x.M | M M'

Everything is programming with functions

Everything is Curried
Application associates to the left

First example:

(\x.\y.x) M N --> (\y.M) N --> M

Crucial: argument N is never evaluated (could have an infinite loop)

Programming in Lambda Calculus

Everything is continuation-passing style

Q: Who remembers the boolean equation solver?

Q: What classes of results could it produce?

A: success, failure

Q: How were the results delivered?

A: calling continuations

Q: How shall we do Booleans?

A: true continuation, false continuation

Coding Booleans

Booleans take two continuations:

true  = \t.\f.t
false = \t.\f.f

if M then N else P = ???   (* M N P *)

if = \b.\t.\e.b t e

Coding Pairs

How many ways can pairs be created? (A: one, pair constructor)
How many continuations? (A: one, corresponding to the pair)
What information does it expect? (A: the two elements of the pair)
What are the algebraic laws?
- fst (pair x y) = x
- snd (pair x y) = y

Code pair, fst, and snd

 pair x y f = f x y
 fst p = p (\x.\y.x)
 snd p = p (\x.\y.y)

 pair = \x.\y.\f.f x y
 fst  = \p.p (\x.\y.x)
 snd  = \p.p (\x.\y.y)

Coding Lists

How many ways can lists be created? (A: two, nil and cons)
How many continuations? (A: two, one for each)
What does each continuation expect? (A: nil - nothing; cons - hd, tl)

For each creator, what are the laws regarding its continuations?

cons y ys n c = c y ys
nil       n c = n

car xs = xs error (\y.\ys.y)
cdr xs = xs error (\y.\ys.ys)

null? xs = xs true (\y.\ys.false)


cons = \y.\ys.\n.\c.c y ys
nil  = \n.\c.n

car = \xs.xs error (\y.\ys.y)
cdr = \xs.xs error (\y.\ys.ys)

null? = \xs.xs true (\y.\ys.false)

Coding numbers: Church Numerals

Wikipedia good: “Church numerals”

Key Idea: The value of a numeral is the number of times it applies its argument function.

Church Numerals in λ

zero  = \f.\x.x;
succ  = \n.\f.\x.f (n f x);
plus  = \n.\m.n succ m;
times = \n.\m.n (plus m) zero;
 ...
-> four;
\f.\x.f (f (f (f x)))
-> three;
\f.\x.f (f (f x))
-> times four three;
\f.\x.f (f (f (f (f (f (f (f (f (f (f (f x)))))))))))

Taking stock:

bools
pairs
lists
numbers

Question: What’s missing from this picture?

Answer: Recursive functions.

Astonishing fact: we don’t need letrec or val-rec

The Y-combinator = \f.(\x.f (x x))(\x.f (x x)) can encode recursion.

Operational semantics of lambda calculus

New kind of semantics: small-step

New judgment form

M --> N   ("M reduces to N in one step")

No context!! No turnstile!!

Just pushing terms around == calculus

Beta-reduction

The substitution in the beta rule is the heart of the lambda calculus

It’s hard to get right
It’s a stupid design for real programming (shell, tex, tcl)
It’s even hard for theorists!
But it’s the simplest known thing

Board examples:

Are these functions the same?
```
 \x.\y.x
 \w.\z.w
```
Are these functions the same?
```
 \x.\y.z
 \w.\z.z
```

Examples of free variables:

\x. + x y        (* y is free *)

\x.\y. x         (* nothing is free *)

Exercise: Free Variables


What are the free variables in:

  \x.\y. y z
  \x.x (\y.x)
  \x.\y.\x.x y
  \x.\y.x (\z.y w)
  y (\x.z)
  (\x.\y.x y) y

Exercise: Free Variables

What are the free variables in:

  \x.\y. y z           - z
  \x.x (\y.x)          - nothing
  \x.\y.\x.x y         - nothing
  \x.\y.x (\z.y w)     - w
  y (\x.z)             - y z
  (\x.\y.x y) y        - y

Example:

(\yes.\no.yes)(\time.no) ->
\z.\time.no

and never

\no.\time.no    // WRONG!!!!!!

Must rename the bound variable:

(\yes.\no.yes) (\time.no) tuesday
   ->   
(\yes.\z.yes)  (\time.no) tuesday
   ->  
(\z.\time.no)  tuesday
   ->
\time.no

Summary

Lambda calculus is Turing Complete
Essence of most programming languages
Evaluation proceeds by substituting arguments for formal variables (called beta reduction)
- Definition of free variables
- Alpha-conversion allows bound variables to be renamed.

11 December 2017: Comp 105 Conclusion

There are PDF slides for 12/12/2017.

Announce

Final: Thursday, December 14, 8:30 - 10:30 in Barnum 08 (class)
May bring one double-sided page of notes
Online evaluations: Send screen-shot to comp105-grades for participation credit.

Last Time

Lambda calculus
Bools, pairs, lists, numbers
Y-combinator supports recursion
Church-Turing thesis
β-reduction
α- conversion
definitions of free and bound variables.

Today

What have we done (mostly) since the midterm?
Where might you go from here?
Your questions!
Class Feedback

What have we done?

Type Systems

Type Systems: Big Idea

Static vs. Dynamic Typing

Expressiveness (+ Dynamic)
Don’t have to worry about types (+ Dynamic)
Dependent on input (- Dynamic)
Runtime overhead (- Dynamic)
Serve as documentation (+ Static)
Catch errors at compile time (+ Static)
Used in optimization (+ Static)

Type Systems: Big Idea

Undecideability forces tradeoff:
- Dynamic or
- Approximate or
- Non-terminating
Example: array bounds checking
- Occasional negative consequences: e.g., Heartbleed

Type Systems: Mechanics

Monomorphic and Polymorphic Types
Types, Type Constructors, Quantified Types (∀α.τ)
Kinds (κ) classify types:
- well-formed,
- types (*),
- type constructors: κ ⇒ κ
Type Environments: type identifiers → kinds
Typing Rules
- Introduction and Elimination forms
Type Checking
Induction and Recursion

Hindley-Milner Type Inference: Big Idea

Inferred vs Declared Types
- Advantages of Inference: write fewer types, infer more general types.
- Advantages of Declarations: better documentations, more general type system.
Canonical example of static analysis:
- Proving properties of programs based only on text of program.
- Useful for compilers and security analysis.

Hindley-Milner Type Inference: Mechanics

Use fresh type variables to represent unknown types
Generate constraints that collect clues
Solve constraints just before introducing quantifiers
Compromises to preserve decideability:
- Only generalize lets and top-level declarations
- Polymorphic functions aren’t first-class

Module Systems

Module Systems a la SML: Big Ideas

“Programming-in-the-large”
Separate implementation from interface
Enforced modularity
- Swap implementations without breaking client code

Module Systems a la SML: Mechanics

Signatures describe interfaces
- types, values, exceptions, substructures
- include to extend
Structures provide implementations
Signature ascription hides structure contents (Heap :> HEAP)
Functors
- Functions over structures
- Executed at compile time

Object-Oriented Programming

Object-Oriented Programming: Big Ideas

“Programming-in-the-medium”
Advantages and Disadvantages
- Enables code reuse
- Easy to add new kinds of objects
- Hard to add new operations
- Algorithms smeared across many classes
- Hard to know what code is executing
Good match for GUI programming
Smalltalk mantra: Everything is an Object
- Can redefine basic operations

Object-Oriented Programming: Mechanics

Classes and objects
Computation via sending messages
Double-dispatch
Inheritance for implementation reuse
Subtyping (“duck typing”) for client code reuse
Subtyping is not Inheritance
self and super
Blocks to code anonymous functions & continuations

Lambda Calculus

Lambda Calculus: Big Ideas

Three forms:
e : : = x | λx.e | e₁e₂
Church-Turing Thesis:
- All computable functions expressable in lambda calculus
- booleans, pairs, lists, naturals, recursion, …

Lambda Calculus: Mechanics

Bound vs. Free variables
α-conversion: Names of bound variables don’t matter.
β-reduction: Models computation.
Capture-avoiding substitution (Why important?)
Recursion via fixed points
Y combinator calculates fixed points:
- Y = λf.(λx.f(x x))(λx.f(x x))

Programming Experience

Recursion and higher-order functions are now second-nature to you.
- You’ll miss pattern matching and algebraic data types in any language you use that doesn’t have them!
C for impcore (imperative language)
Scheme (dynamically typed functional language)
ML (statically typed functional language)
uSmalltalk (dynamically typed OO language)

Built substantial pieces of code

SAT solver using continuations
Type checker (ML pattern matching!)
Type inference system (using constraints, reading typing rules)
Game solver (SML module system)
BigNums (Power of OO abstractions; resulting challenges)

Where might you go from here?

Haskell

At the research frontier: Still evolving.
Lazy:
- Expressions only evaluated when needed.
- Conflict with side-effects.
- Solution: Monads (computation abstraction)
Type Classes:
- Ad hoc polymorphism (aka, overloading)
- ML: Hard-wire certain operations (+, *)
- Haskell: User programmable.

Prolog

Based on logic.
Performs proof search over inference rules.
Can leave “blanks” and ask the system to figure out what they must be.

Ruby

If you liked smalltalk.

Additional Courses

Compilers
Special Topics:
- Domain-specific Languages
- Probabilisitic Programming Languages
- Advanced Functional Programming

Big-picture questions?

Studying for the Exam

Exam will be like midterm
Expect to write some code (SML, uSmalltalk) 2ex
Review homework assignments
Review recitation materials
Make sure you understand Big Ideas/Tradeoffs

Feedback

Course feedback

In future courses

What should we keep the same?
How can we improve?

Congratulations!

You have learned an amazing amount.
You have really impressed me.
Good luck on the exam!

Lecture notes for COMP 105 (Programming Languages)

6 September 2017: Introduction to Comp 105

Announcements

Overview

Why so many languages?

What this course isn’t

What this course is:

What are reusable principles?

What Programming Languages are, technically

What can you get out of Comp 105?

Great languages begin with great features

How will we study language features?

Common Framework

Course logistics and administration

Books

Homework

The role of lectures

Recitations

Questions and answers on Piazza

Other policies and procedures on the web

What am I called?

ImpCore: The first language in our common framework

Exercise: all-fours?

Scoping rules for Impcore

Recursion: a review

11 September 2017: Introduction to Semantics

Programming-language semantics

Why bother with precise semantics?

Behavior decomposes

ASTs

13 September 2017: Semantics, Syntactic Proofs, Metatheory

Announcements

Today

Last Time

Using Operational Semantics

From rules to proofs

Example derivation (rules in handout)

Building derivations

Proofs about all derivations: Metatheory

18 September 2017: Metatheory wrapup. Intro to functional programming

Announcements

Today

Last Time

Where are we going?

Recursion comes from inductive structure of input

For a new language, five powerful questions

Introduction to Scheme

Graphically:

Scheme Values

S-Expression operators

Your turn!

20 September 2017: More Scheme

Announcements

Today

Last Time

Lists

Subset of S-Expressions.

Why are lists useful?

Immutable data structures

Review: Algebraic laws of lists

Recursive functions for recursive types

Algebraic laws to design list functions

Example: Append

Cost model

Induction Principle for List(Z)

Example: The cost of append

Example: list reversal

The method of accumulating parameters

Linear reverse, graphically

25 September 2017: Let and Lambda

Announcements

Last Time

Today

Association lists represent finite maps [Not covered in class]

Algebraic laws of association lists

Handy new feature of Scheme: let binding

From Impcore to uScheme

Anonymous, first-class functions

First-class, nested functions

Your turn!!

Example: The cost of `append`

Handy new feature of Scheme: `let` binding

List search: `exists?`

Your turn: Does everything match: `all?`

List selection: `filter`

“Lifting” functions to lists: `map`

The universal list function: `fold`

In-class exercise: Folding `combine?`