Jump to content

Fixed-point combinator

From Wikipedia, the free encyclopedia
(Redirected from Y operator)

In combinatory logic for computer science, a fixed-point combinator (or fixpoint combinator),[1]: p.26  is a higher-order function (i.e. a function which takes a function as argument) that returns some fixed point (a value that is mapped to itself) of its argument function, if one exists.

Formally, if is a fixed-point combinator and the function has one or more fixed points, then is one of these fixed points, i.e.

Fixed-point combinators can be defined in the lambda calculus and in functional programming languages and provide a means to allow for recursive definitions.

Y combinator in lambda calculus

[edit]

In the classical untyped lambda calculus, every function has a fixed point. A particular implementation of is Haskell Curry's paradoxical combinator Y, given by[2]: 131 [note 1][note 2]

(Here we use the standard notations and conventions of lambda calculus: Y is a function that takes one argument f and returns the entire expression following the first period; the expression denotes a function that takes one argument x, thought of as a function, and returns the expression , where denotes x applied to itself. Juxtaposition of expressions denotes function application, is left-associative, and has higher precedence than the period.)

Verification

[edit]

The following calculation verifies that is indeed a fixed point of the function :

by the definition of
by β-reduction: replacing the formal argument f of Y with the actual argument g
by β-reduction: replacing the formal argument x of the first function with the actual argument
by second equality, above

The lambda term may not, in general, β-reduce to the term . However, both terms β-reduce to the same term, as shown.

Uses

[edit]

Applied to a function with one variable, the Y combinator usually does not terminate. More interesting results are obtained by applying the Y combinator to functions of two or more variables. The additional variables may be used as a counter, or index. The resulting function behaves like a while or a for loop in an imperative language.

Used in this way, the Y combinator implements simple recursion. The lambda calculus does not allow a function to appear as a term in its own definition as is possible in many programming languages, but a function can be passed as an argument to a higher-order function that applies it in a recursive manner.

The Y combinator may also be used in implementing Curry's paradox. The heart of Curry's paradox is that untyped lambda calculus is unsound as a deductive system, and the Y combinator demonstrates this by allowing an anonymous expression to represent zero, or even many values. This is inconsistent in mathematical logic.

Example implementations

[edit]

An example implementation of Y in the language R is presented below:

Y <- \(f) {
  g <- \(x) f(x(x))
  g(g)
}

This can then be used to implement factorial as follows:

fact <- \(f) \(n)
  if (n == 0) 1 else n * f(n - 1)
  
Y(fact)(5) # yields 5! = 120

Y is only needed when we do not have function names. Substituting all the definitions into one line so that function names are not required gives:

(\(f) (\(x) f(x(x)))(\(x) f(x(x)))) (\(f) \(n) if (n == 0) 1 else n * f(n - 1)) (5)

This works because R uses lazy evaluation.

Languages that use strict evaluation, such as Python, C++, and others, can often express Y; however, any implementation is useless in practice since it loops indefinitely until terminating through stack overflow.

Fixed-point combinator

[edit]

The Y combinator is an implementation of a fixed-point combinator in lambda calculus. Fixed-point combinators may also be easily defined in other functional and imperative languages. The implementation in lambda calculus is more difficult due to limitations in lambda calculus. The fixed-point combinator may be used in a number of different areas:

Fixed-point combinators may be applied to a range of different functions, but normally will not terminate unless there is an extra parameter. When the function to be fixed refers to its parameter, another call to the function is invoked, so the calculation never gets started. Instead, the extra parameter is used to trigger the start of the calculation.

The type of the fixed point is the return type of the function being fixed. This may be a real or a function or any other type.

In the untyped lambda calculus, the function to apply the fixed-point combinator to may be expressed using an encoding, like Church encoding. In this case particular lambda terms (which define functions) are considered as values. "Running" (beta reducing) the fixed-point combinator on the encoding gives a lambda term for the result which may then be interpreted as fixed-point value.

Alternately, a function may be considered as a lambda term defined purely in lambda calculus.

These different approaches affect how a mathematician and a programmer may regard a fixed-point combinator. A mathematician may see the Y combinator applied to a function as being an expression satisfying the fixed-point equation, and therefore a solution.

In contrast, a person only wanting to apply a fixed-point combinator to some general programming task may see it only as a means of implementing recursion.

Values and domains

[edit]

Many functions do not have any fixed points, for instance with . Using Church encoding, natural numbers can be represented in lambda calculus, and this function f can be defined in lambda calculus. However, its domain will now contain all lambda expression, not just those representing natural numbers. The Y combinator, applied to f, will yield a fixed-point for f, but this fixed-point won't represent a natural number. If trying to compute Y f in an actual programming language, an infinite loop will occur.

Function versus implementation

[edit]

The fixed-point combinator may be defined in mathematics and then implemented in other languages. General mathematics defines a function based on its extensional properties.[3] That is, two functions are equal if they perform the same mapping. Lambda calculus and programming languages regard function identity as an intensional property. A function's identity is based on its implementation.

A lambda calculus function (or term) is an implementation of a mathematical function. In the lambda calculus there are a number of combinators (implementations) that satisfy the mathematical definition of a fixed-point combinator.

Definition of the term "combinator"

[edit]

Combinatory logic is a higher-order functions theory. A combinator is a closed lambda expression, meaning that it has no free variables. The combinators may be combined to direct values to their correct places in the expression without ever naming them as variables.

Recursive definitions and fixed-point combinators

[edit]

Fixed-point combinators can be used to implement recursive definition of functions. However, they are rarely used in practical programming.[4] Strongly normalizing type systems such as the simply typed lambda calculus disallow non-termination and hence fixed-point combinators often cannot be assigned a type or require complex type system features. Furthermore fixed-point combinators are often inefficient compared to other strategies for implementing recursion, as they require more function reductions and construct and take apart a tuple for each group of mutually recursive definitions.[1]: page 232 

The factorial function

[edit]

The factorial function provides a good example of how a fixed-point combinator may be used to define recursive functions. The standard recursive definition of the factorial function in mathematics can be written as

where n is a non-negative integer. If we want to implement this in lambda calculus, where integers are represented using Church encoding, we run into the problem that the lambda calculus does not allow the name of a function ('fact') to be used in the function's definition. This can be circumvented using a fixed-point combinator as follows.

Define a function F of two arguments f and n:

(Here is a function that takes two arguments and returns its first argument if n=0, and its second argument otherwise; evaluates to n-1.)

Now define . Then is a fixed-point of F, which gives

as desired.

Fixed-point combinators in lambda calculus

[edit]

The Y combinator, discovered by Haskell B. Curry, is defined as

Other fixed-point combinators

[edit]

In untyped lambda calculus fixed-point combinators are not especially rare. In fact there are infinitely many of them.[5] In 2005 Mayer Goldberg showed that the set of fixed-point combinators of untyped lambda calculus is recursively enumerable.[6]

The Y combinator can be expressed in the SKI-calculus as

Additional combinators (B, C, K, W system) allow for a much shorter definition. With the self-application combinator, since and , the above becomes

The simplest fixed-point combinator in the SK-calculus, found by John Tromp, is

although note that it is not in normal form, which is longer. This combinator corresponds to the lambda expression

The following fixed-point combinator is simpler than the Y combinator, and β-reduces into the Y combinator; it is sometimes cited as the Y combinator itself:

Another common fixed-point combinator is the Turing fixed-point combinator (named after its discoverer, Alan Turing):[7][2]: 132 

Its advantage over is that beta-reduces to ,[note 3] whereas and only beta-reduce to a common term.

also has a simple call-by-value form:

The analog for mutual recursion is a polyvariadic fix-point combinator,[8][9][10] which may be denoted Y*.

Strict fixed-point combinator

[edit]

In a strict programming language the Y combinator will expand until stack overflow, or never halt in case of tail call optimization.[11] The Z combinator will work in strict languages (also called eager languages, where applicative evaluation order is applied). The Z combinator has the next argument defined explicitly, preventing the expansion of in the right-hand side of the definition:[12]

and in lambda calculus it is an eta-expansion of the Y combinator:

Non-standard fixed-point combinators

[edit]

If F is a fixed-point combinator in untyped lambda calculus, then we have

Terms that have the same Böhm tree as a fixed-point combinator, i.e. have the same infinite extension , are called non-standard fixed-point combinators. Any fixed-point combinator is also a non-standard one, but not all non-standard fixed-point combinators are fixed-point combinators because some of them fail to satisfy the fixed-point equation that defines the "standard" ones. These combinators are called strictly non-standard fixed-point combinators; an example is the following combinator:

where

The set of non-standard fixed-point combinators is not recursively enumerable.[6]

Implementation in other languages

[edit]

The Y combinator is a particular implementation of a fixed-point combinator in lambda calculus. Its structure is determined by the limitations of lambda calculus. It is not necessary or helpful to use this structure in implementing the fixed-point combinator in other languages.

Simple examples of fixed-point combinators implemented in some programming paradigms are given below.

Lazy functional implementation

[edit]

In a language that supports lazy evaluation, like in Haskell, it is possible to define a fixed-point combinator using the defining equation of the fixed-point combinator which is conventionally named fix. Since Haskell has lazy datatypes, this combinator can also be used to define fixed points of data constructors (and not only to implement recursive functions). The definition is given here, followed by some usage examples. In Hackage, the original sample is: [13]

fix, fix' :: (a -> a) -> a
fix f = let x = f x in x         -- Lambda dropped. Sharing.
                                 -- Original definition in Data.Function.
-- alternative:
fix' f = f (fix' f)              -- Lambda lifted. Non-sharing.

fix (\x -> 9)                    -- this evaluates to 9

fix (\x -> 3:x)                  -- evaluates to the lazy infinite list [3,3,3,...]

fact = fix fac                   -- evaluates to the factorial function
  where fac f 0 = 1
        fac f x = x * f (x-1)

fact 5                           -- evaluates to 120

Strict functional implementation

[edit]

In a strict functional language, as illustrated below with OCaml, the argument to f is expanded beforehand, yielding an infinite call sequence,

.

This may be resolved by defining fix with an extra parameter.

let rec fix f x = f (fix f) x (* note the extra x; here fix f = \x-> f (fix f) x *)

let factabs fact = function   (* factabs has extra level of lambda abstraction *)
   0 -> 1
 | x -> x * fact (x-1)

let _ = (fix factabs) 5       (* evaluates to "120" *)

In a multi-paradigm functional language (one decorated with imperative features), such as Lisp, Peter Landin suggested the use of a variable assignment to create a fixed-point combinator,[14] as in the below example using Scheme:

(define Y!
  (lambda (f)
    ((lambda (i)                       
       (set! i (f (lambda (x) (i x)))) ;; (set! i expr) assigns i the value of expr
       i)                              ;; replacing it in the present lexical scope
     #f)))

Using a lambda calculus with axioms for assignment statements, it can be shown that Y! satisfies the same fixed-point law as the call-by-value Y combinator:[15][16]

In more idiomatic modern Lisp usage, this would typically be handled via a lexically scoped label (a let expression), as lexical scope was not introduced to Lisp until the 1970s:

(define Y*
  (lambda (f)
    ((lambda (i)
       (let ((i (f (lambda (x) (i x))))) ;; (let ((i expr)) i) locally defines i as expr
	     i))                             ;; non-recursively: thus i in expr is not expr
     #f)))

Or without the internal label:

(define Y
  (lambda (f)
    ((lambda (i) (i i))
     (lambda (i)
       (f (lambda (x)
	        (apply (i i) x)))))))

Imperative language implementation

[edit]

This example is a slightly interpretive implementation of a fixed-point combinator. A class is used to contain the fix function, called fixer. The function to be fixed is contained in a class that inherits from fixer. The fix function accesses the function to be fixed as a virtual function. As for the strict functional definition, fix is explicitly given an extra parameter x, which means that lazy evaluation is not needed.

template <typename R, typename D>
class fixer
{
public:
    R fix(D x)
    {
        return f(x);
    }
private:
    virtual R f(D) = 0;
};

class fact : public fixer<long, long>
{
    virtual long f(long x)
    {
        if (x == 0)
        {
            return 1;
        }
        return x * fix(x-1);
    }
};

long result = fact().fix(5);

Another example can be shown to demonstrate SKI combinator calculus (with given bird name from Combinatory logic) being used to build up Z combinator to achieve Tail call-like behavior through trampolining:

var K = a => b => a; // Kestrel
var S = a => b => c => a(c)(b(c)); // Starling
var I = S(K)(K); // Identity
var B = S(K(S))(K); // Bluebird
var C = S(B(B)(S))(K(K)); // Cardinal
var W = C(S)(I); // Warbler
var T = C(I); // Thrush
var V = B(C)(T); // Vireo
var I_ = C(C(I)); // Identity Bird Once Removed; same as C(B(B)(I))(I)
var C_ = B(C); // Cardinal Once Removed
var R_ = C_(C_); // Robin Once Removed
var V_ = B(R_)(C_); // Vireo Once Removed
var I__ = R_(V); // Identity Bird Twice Removed
var Z = B(W(I_))(V_(B)(W(I__)));

var Z2 = S(K(S(S(K(S(S(K)(K))(S(K)(K))))(S(K(S(K(S))(K)))(S(K(S(S(K)(K))))(K))))(K(S(S(K))))))(S(S(K(S(S(K(S(K(S))(K)))(S))(K(K))))(S(K(S(S(K(S(K(S))(K)))(S))(K(K))))(S(K(S))(K))))(K(S(S(K(S(S(K)(K))(S(K)(K))))(S(K(S(K(S))(K)))(S(K(S(S(K)(K))))(K))))(K(S(K(S(S(K(S(S(K(S(K(S))(K)))(S))(K(K))))(S(K(S(S(K(S(K(S))(K)))(S))(K(K))))(S(K(S(S(K)(K))))(K))))))(K))))));
	// Alternative fully expanded form.

var Z3 = S(S(K(S(S)(K(S(S(K)(K))(S(K)(K))))))(K))(S(S(K(S))(K))(K(S(S(K(S))(S(K(S(K(S(K(S(K(S(S)(K(K))))(K)))(S)))(S(S(K)(K)))))(K)))(K(K(S(S(K)(K))(S(K)(K))))))));
	// Another shorter version.

var trampoline = fn => {
	let ctx = fn;
	while (ctx instanceof Function)
		ctx = ctx();
	return ctx;
};

var count_fn = self => n =>
	(console.log(n), n === 0)
		? n
		: () => self(n - 1); // Return thunk "() => self(n - 1)" instead.

trampoline(Z(count_fn)(10));
trampoline(Z2(count_fn)(10));
trampoline(Z3(count_fn)(10));

Typing

[edit]

In System F (polymorphic lambda calculus) a polymorphic fixed-point combinator has type[17]

∀a.(a → a) → a

where a is a type variable. That is, fix takes a function, which maps a → a and uses it to return a value of type a.

In the simply typed lambda calculus extended with recursive data types, fixed-point operators can be written, but the type of a "useful" fixed-point operator (one whose application always returns) may be restricted.

In the simply typed lambda calculus, the fixed-point combinator Y cannot be assigned a type[18] because at some point it would deal with the self-application sub-term by the application rule:

where has the infinite type . No fixed-point combinator can in fact be typed; in those systems, any support for recursion must be explicitly added to the language.

Type for the Y combinator

[edit]

In programming languages that support recursive data types, it is possible to type the Y combinator by appropriately accounting for the recursion at the type level. The need to self-apply the variable x can be managed using a type (Rec a), which is defined so as to be isomorphic to (Rec a -> a).

For example, in the following Haskell code, we have In and out being the names of the two directions of the isomorphism, with types:[19][20]

In :: (Rec a -> a) -> Rec a
out :: Rec a -> (Rec a -> a)

which lets us write:

newtype Rec a = In { out :: Rec a -> a }

y :: (a -> a) -> a
y = \f -> (\x -> f (out x x)) (In (\x -> f (out x x)))

Or equivalently in OCaml:

type 'a recc = In of ('a recc -> 'a)
let out (In x) = x

let y f = (fun x a -> f (out x x) a) (In (fun x a -> f (out x x) a))

Alternatively:

let y f = (fun x -> f (fun z -> out x x z)) (In (fun x -> f (fun z -> out x x z)))

General information

[edit]

Because fixed-point combinators can be used to implement recursion, it is possible to use them to describe specific types of recursive computations, such as those in fixed-point iteration, iterative methods, recursive join in relational databases, data-flow analysis, FIRST and FOLLOW sets of non-terminals in a context-free grammar, transitive closure, and other types of closure operations.

A function for which every input is a fixed point is called an identity function. Formally:

In contrast to universal quantification over all , a fixed-point combinator constructs one value that is a fixed point of . The remarkable property of a fixed-point combinator is that it constructs a fixed point for an arbitrary given function .

Other functions have the special property that, after being applied once, further applications don't have any effect. More formally:

Such functions are called idempotent (see also Projection (mathematics)). An example of such a function is the function that returns 0 for all even integers, and 1 for all odd integers.

In lambda calculus, from a computational point of view, applying a fixed-point combinator to an identity function or an idempotent function typically results in non-terminating computation. For example, we obtain

where the resulting term can only reduce to itself and represents an infinite loop.

Fixed-point combinators do not necessarily exist in more restrictive models of computation. For instance, they do not exist in simply typed lambda calculus.

The Y combinator allows recursion to be defined as a set of rewrite rules,[21] without requiring native recursion support in the language.[22]

In programming languages that support anonymous functions, fixed-point combinators allow the definition and use of anonymous recursive functions, i.e. without having to bind such functions to identifiers. In this setting, the use of fixed-point combinators is sometimes called anonymous recursion.[note 4][23]

See also

[edit]

Notes

[edit]
  1. ^ Throughout this article, the syntax rules given in Lambda calculus#Notation are used, to save parentheses.
  2. ^ According to Barendregt p.132, the name originated from Curry.
  3. ^
  4. ^ This terminology appears to be largely folklore, but it does appear in the following:
    • Trey Nash, Accelerated C# 2008, Apress, 2007, ISBN 1-59059-873-3, p. 462—463. Derived substantially from Wes Dyer's blog (see next item).
    • Wes Dyer Anonymous Recursion in C#, February 02, 2007, contains a substantially similar example found in the book above, but accompanied by more discussion.

References

[edit]
  1. ^ a b Peyton Jones, Simon L. (1987). The Implementation of Functional Programming (PDF). Prentice Hall International.
  2. ^ a b Henk Barendregt (1985). The Lambda Calculus – Its Syntax and Semantics. Studies in Logic and the Foundations of Mathematics. Vol. 103. Amsterdam: North-Holland. ISBN 0444867481.
  3. ^ Selinger, Peter. "Lecture Notes on Lambda Calculus (PDF)". p. 6.
  4. ^ "For those of us who don't know what a Y-Combinator is or why it's useful, ..." Hacker News. Retrieved 2 August 2020.
  5. ^ Bimbó, Katalin (27 July 2011). Combinatory Logic: Pure, Applied and Typed. CRC Press. p. 48. ISBN 9781439800010.
  6. ^ a b Goldberg, 2005
  7. ^ Alan Mathison Turing (December 1937). "The -function in --conversion". Journal of Symbolic Logic. 2 (4): 164. JSTOR 2268281.
  8. ^ "Many faces of the fixed-point combinator". okmij.org.
  9. ^ Polyvariadic Y in pure Haskell98 Archived 2016-03-04 at the Wayback Machine, lang.haskell.cafe, October 28, 2003
  10. ^ "recursion - Fixed-point combinator for mutually recursive functions?". Stack Overflow.
  11. ^ Bene, Adam (17 August 2017). "Fixed-Point Combinators in JavaScript". Bene Studio. Medium. Retrieved 2 August 2020.
  12. ^ "CS 6110 S17 Lecture 5. Recursion and Fixed-Point Combinators" (PDF). Cornell University. 4.1 A CBV Fixed-Point Combinator.
  13. ^ The original definition in Data.Function.
  14. ^ Landin, P. J. (January 1964). "The mechanical evaluation of expressions". The Computer Journal. 6 (4): 308–320. doi:10.1093/comjnl/6.4.308.
  15. ^ Felleisen, Matthias (1987). The Lambda-v-CS Calculus. Indiana University.
  16. ^ Talcott, Carolyn (1985). The Essence of Rum: A theory of the intensional and extensional aspects of Lisp-type computation (Ph.D. thesis). Stanford University.
  17. ^ Girard, Jean-Yves (1986). "The system F of variable types, fifteen years later". Theoretical Computer Science. 45 (2): 159–192. doi:10.1016/0304-3975(86)90044-7. MR 0867281. See in particular p. 180.
  18. ^ An Introduction to the Lambda Calculus Archived 2014-04-08 at the Wayback Machine
  19. ^ Haskell mailing list thread on How to define Y combinator in Haskell, 15 Sep 2006
  20. ^ Geuvers, Herman; Verkoelen, Joep. On Fixed point and Looping Combinators in Type Theory. CiteSeerX 10.1.1.158.1478.
  21. ^ Daniel P. Friedman, Matthias Felleisen (1986). "Chapter 9 - Lambda The Ultimate". The Little Lisper. Science Research Associates. p. 179. "In the chapter we have derived a Y-combinator which allows us to write recursive functions of one argument without using define."
  22. ^ Mike Vanier. "The Y Combinator (Slight Return) or: How to Succeed at Recursion Without Really Recursing". Archived from the original on 2011-08-22. "More generally, Y gives us a way to get recursion in a programming language that supports first-class functions but that doesn't have recursion built in to it."
  23. ^ The If Works Deriving the Y combinator, January 10th, 2008
[edit]