Model theory

In mathematics, more precisely in mathematical logic, model theory is the study of the relationship between formal theories (a collection of sentences in a formal language expressing statements about a mathematical structure), and their models, taken as interpretations that satisfy the sentences of that theory.[1] The aspects investigated include the number and size of models of a theory, the relationship of different models to each other, and their interaction with the formal language itself. In particular, model theorists also investigate the sets that can be defined in a model of a theory, and the relationship of such definable sets to each other. As a separate discipline, model theory goes back to Alfred Tarski, who first used the term "Theory of Models" in publication in 1954.[2] Since the 1970s, the subject has been shaped decisively by Saharon Shelah's stability theory. The relative emphasis placed on the class of models of a theory as opposed to the class of definable sets within a model fluctuated in the history of the subject, and the two directions are summarised by the pithy characterisations from 1973 and 1997 respectively:

model theory = universal algebra + logic[3]

where universal algebra stands for mathematical structures and logic for logical theories; and

model theory = algebraic geometryfields.

where logical formulas are to definable sets what equations are to varieties over a field.[4]

Nonetheless, the interplay of classes of models and the sets definable in them has been crucial to the development of model theory throughout its history. For instance, while stability was originally introduced to classify theories by their numbers of models in a given cardinality, stability theory proved crucial to understanding the geometry of definable sets.

Compared to other areas of mathematical logic such as proof theory, model theory is often less concerned with formal rigour and closer in spirit to classical mathematics. This has prompted the comment that "if proof theory is about the sacred, then model theory is about the profane".[5] The applications of model theory to algebraic and diophantine geometry reflect this proximity to classical mathematics, as they often involve an integration of algebraic and model-theoretic results and techniques.

The most prominent scholarly organization in the field of model theory is the Association for Symbolic Logic.

BranchesEdit

This page focuses on finitary first order model theory of infinite structures. Finite model theory, which concentrates on finite structures, diverges significantly from the study of infinite structures in both the problems studied and the techniques used. Model theory in higher-order logics or infinitary logics is hampered by the fact that completeness and compactness do not in general hold for these logics. However, a great deal of study has also been done in such logics.

Informally, model theory can be divided into classical model theory, model theory applied to groups and fields, and geometric model theory. A missing subdivision is computable model theory, but this can arguably be viewed as an independent subfield of logic.

Examples of early theorems from classical model theory include Gödel's completeness theorem, the upward and downward Löwenheim–Skolem theorems, Vaught's two-cardinal theorem, Scott's isomorphism theorem, the omitting types theorem, and the Ryll-Nardzewski theorem. Examples of early results from model theory applied to fields are Tarski's elimination of quantifiers for real closed fields, Ax's theorem on pseudo-finite fields, and Robinson's development of non-standard analysis. An important step in the evolution of classical model theory occurred with the birth of stability theory (through Morley's theorem on uncountably categorical theories and Shelah's classification program), which developed a calculus of independence and rank based on syntactical conditions satisfied by theories.

During the last several decades applied model theory has repeatedly merged with the more pure stability theory. The result of this synthesis is called geometric model theory in this article (which is taken to include o-minimality, for example, as well as classical geometric stability theory). An example of a proof from geometric model theory is Hrushovski's proof of the Mordell–Lang conjecture for function fields. The ambition of geometric model theory is to provide a geography of mathematics by embarking on a detailed study of definable sets in various mathematical structures, aided by the substantial tools developed in the study of pure model theory.

Fundamental notions of first-order model theoryEdit

First-order logicEdit

A first-order formula is built out of atomic formulas such as R(f(x,y),z) or y = x + 1 by means of the Boolean connectives   and prefixing of quantifiers   or  . A sentence is a formula in which each occurrence of a variable is in the scope of a corresponding quantifier. Examples for formulas are φ (or φ(x) to mark the fact that at most x is an unbound variable in φ) and ψ defined as follows:

 
 

(Note that the equality symbol has a double meaning here.) It is intuitively clear how to translate such formulas into mathematical meaning. In the σsmr-structure   of the natural numbers, for example, an element n satisfies the formula φ if and only if n is a prime number. The formula ψ similarly defines irreducibility. Tarski gave a rigorous definition, sometimes called "Tarski's definition of truth", for the satisfaction relation  , so that one easily proves:

  is a prime number.
  is irreducible.

A set T of sentences is called a (first-order) theory. A theory is satisfiable if it has a model  , i.e. a structure (of the appropriate signature) which satisfies all the sentences in the set T. A complete theory is a theory that contains every sentence or its negation. The complete theory of all sentences satisfied by a structure is also called the theory of that structure.

Gödel's completeness theorem (not to be confused with his incompleteness theorems) says that a theory has a model if and only if it is consistent, i.e. no contradiction is proved by the theory. Therefore, model theorists often use "consistent" as a synonym for "satisfiable".

Basic model-theoretic conceptsEdit

A substructure   of a σ-structure   is a subset of its domain, closed under all functions in its signature σ, which is regarded as a σ-structure by restricting all functions and relations in σ to the subset. This generalises the analogous concepts from algebra; For instance, a subgroup is a substructure in the signature with multiplication and inverse.

A substructure is said to be elementary if for any first-order formula φ and any elements a1, ..., an of  ,

  if and only if  .

In particular, if φ is a sentence and   an elementary substructure of  , then   if and only if  . Thus, an elementary substructure is a model of a theory exactly when the superstructure is a model. Therefore, while the field of algebraic numbers   is an elementary substructure of the field of complex numbers  , the rational field   is not, as we can express "There is a square root of 2" as a first-order sentence satisfied by   but not by  .

An embedding of a σ-structure   into another σ-structure   is a map f: AB between the domains which can be written as an isomorphism of   with a substructure of  . If it can be written as an isomorphism with an elementary substructure, it is called an elementary embedding. Every embedding is an injective homomorphism, but the converse holds only if the signature contains no relation symbols, such as in groups or fields.

A field or a vector space can be regarded as a (commutative) group by simply ignoring some of its structure. The corresponding notion in model theory is that of a reduct of a structure to a subset of the original signature. The opposite relation is called an expansion - e.g. the (additive) group of the rational numbers, regarded as a structure in the signature {+,0} can be expanded to a field with the signature {×,+,1,0} or to an ordered group with the signature {+,0,<}.

Similarly, if σ' is a signature that extends another signature σ, then a complete σ'-theory can be restricted to σ by intersecting the set of its sentences with the set of σ-formulas. Conversely, a complete σ-theory can be regarded as a σ'-theory, and one can extend it (in more than one way) to a complete σ'-theory. The terms reduct and expansion are sometimes applied to this relation as well.

Compactness and the Löwenheim-Skolem theoremEdit

The compactness theorem states that a set of sentences S is satisfiable if every finite subset of S is satisfiable. The analogous statement with consistent instead of satisfiable is trivial, since every proof can have only a finite number of antecedents used in the proof. The completeness theorem allows us to transfer this to satsifiability. However, there are also several direct (semantic) proofs of the compactness theorem. As a corollary (i.e., its contrapositive), the compactness theorem says that every unsatisfiable first-order theory has a finite unsatisfiable subset. This theorem is of central importance in model theory, where the words "by compactness" are commonplace.

Another cornerstone of first-order model theory is the Löwenheim-Skolem theorem. According to the Löwenheim-Skolem Theorem, every infinite structure in a countable signature has a countable elementary substructure. Conversely, for any infinite cardinal κ every infinite structure in a countable signature that is of cardinality less than κ can be elementarily embedded in another structure of cardinality κ (There is a straightforward generalisation to uncountable signatures). In particular, the Löwenheim-Skolem Theorem implies that any theory in a countable signature with infinite models has a countable model as well as arbitrarily large models.

In a certain sense made precise by Lindström's theorem, first-order logic is the most expressive logic for which both the Löwenheim–Skolem theorem and the compactness theorem hold.

DefinabilityEdit

Definable setsEdit

In model theory, definable sets are important objects of study. For instance, in   the formula

 

defines the subset of prime numbers, while the formula

 

defines the subset of even numbers. In a similar way, formulas with n free variables define subsets of  . For example, in a field, the formula

 

defines the curve of all   such that  .

Both of the definitions mentioned here are parameter-free, that is, the defining formulas don't mention any fixed domain elements. However, one can also consider definitions with parameters from the model. For instance, in  , the formula

 

uses the parameter   from   to define a curve.

Eliminating quantifiersEdit

In general, definable sets without quantifiers are easy to describe, while definable sets involving possibly nested quantifiers can be much more complicated.

This makes quantifier elimination a crucial tool for analysing definable sets: A theory T has quantifier elimination if every first-order formula φ(x1, ..., xn) over its signature is equivalent modulo T to a first-order formula ψ(x1, ..., xn) without quantifiers, i.e.   holds in all models of T. If the theory of a structure has quantifier elimination, every set definable in a structure is definable by a quantifier-free formula over the same parameters as the original definition. For example, the theory of algebraically closed fields in the signature σring = (×,+,−,0,1) has quantifier elimination. This means that in an algebriacally closed field, every formula is equivalent to a Boolean combination of equations between polynomials.

If a theory does not have quantifier elimination, one can add additional symbols to its signature so that it does. Early model theory spent much effort on proving axiomatizability and quantifier elimination results for specific theories, especially in algebra. But often instead of quantifier elimination a weaker property suffices:

A theory T is called model-complete if every substructure of a model of T which is itself a model of T is an elementary substructure. There is a useful criterion for testing whether a substructure is an elementary substructure, called the Tarski–Vaught test. It follows from this criterion that a theory T is model-complete if and only if every first-order formula φ(x1, ..., xn) over its signature is equivalent modulo T to an existential first-order formula, i.e. a formula of the following form:

 ,

where ψ is quantifier free. A theory that is not model-complete may or may not have a model completion, which is a related model-complete theory that is not, in general, an extension of the original theory. A more general notion is that of a model companion.

MinimalityEdit

In every structure, every finite subset   is definable with parameters: Simply use the formula

 .

Since we can negate this formula, every cofinite subset (which includes all but finitely many elements of the domain) is also always definable.

This leads to the concept of a minimal structure. A structure   is called minimal if every subset   definable with parameters from   is either finite or cofinite. The corresponding concept at the level of theories is called strong minimality: A theory T is called strongly minimal if every model of T is minimal. A structure is called strongly minimal if the theory of that structure is strongly minimal. Equivalently, a structure is strongly minimal if every elementary extension is minimal. Since the theory of algebraically closed fields has quantifier elimination, every definable subset of an algebraically closed field is definable by a quantifier-free formula in one variable. Quantifier-free formulas in one variable express Boolean combinations of polynomial equations in one variable, and since a nontrivial polynomial equation in one variable has only a finite number of solutions, the theory of algebraically closed fields is strongly minimal.

On the other hand, the field   of real numbers is not minimal: Consider, for intance, the definable set

 .

This defines the subset of non-negative real numbers, which is neither finite nor cofinite. One can in fact use   to define arbitrary intervals on the real number line. It turns out that these suffice to represent every definable subset of  . This generalisation of minimality has been very useful in the model theory of ordered structures. A densely totally ordered structure   in a signature including a symbol for the order relation is called o-minimal if every subset   definable with parameters from   is a finite union of points and intervals.

Definable and interpretable structuresEdit

Particularly important are those definable sets that are also substructures, i. e. contain all constants and are closed under function application. For instance, one can study the definable subgroups of a certain group. However, there is no need to limit oneself to substructures in the same signature. Since formulas with n free variables define subsets of  , n-ary relations can also be definable. Functions are definable if the function graph is a definable relation, and constants   are definable if there is a formula   such that a is the only element of   such that   is true. In this way, one can study definable groups and fields in general structures, for instance, which has been important in geometric stability theory.

One can even go one step further, and move beyond immediate substructures. Given a mathematical structure, there are very often associated structures which can be constructed as a quotient of part of the original structure via an equivalence relation. An important example is a quotient group of a group. One might say that to understand the full structure one must understand these quotients. When the equivalence relation is definable, we can give the previous sentence a precise meaning. We say that these structures are interpretable. A key fact is that one can translate sentences from the language of the interpreted structures to the language of the original structure. Thus one can show that if a structure   interprets another whose theory is undecidable, then   itself is undecidable.

TypesEdit

Basic notionsEdit

For a sequence of elements   of a structure   and a subset A of  , one can consider the set of all first-order formulas   with parameters in A that are satisfied by  . This is called the complete (n-)type realised by   over A. If there is an automorphism of   that is constant on A and sends   to   respectively, then   and   realise the same complete type over A.

The real number line  , viewed as a structure with only the order relation {<}, will serve as a running example in this section. Every single element   satisfies the same 1-type over the empty set. This is clear since any two real numbers a and b are connected by the order automorphism that shifts all numbers by b-a. The complete 2-type over the empty set realised by a pair of numbers   depends on their order: either  ,   or  . Over the subset   of integers, the 1-type of a non-integer real number a depends on its value rounded down to the nearest integer.

More generally, whenever   is a structure and A a subset of  , a (partial) n-type over A is a set of formulas p with at most n free variables that are realised in an elementary extension   of  . If p contains every such formula or its negation, then p is complete. The set of complete n-types over A is often written as  . If A is the empty set, then the type space only depends on the theory T of  . The notation   is commonly used for the set of types over the empty set consistent with T. If there is a single formula   such that the theory of   implies   for every formula   in p, then p is called isolated.

Since the real numbers   are Archimedean, there is no real number larger than every integer. However, a compactness argument shows that there is an elementary extension of the real number line in which there is an element larger than any integer. Therefore, the set of formulas   is a 1-type over   that is not realised in the real number line  .

A subset of   that can be expressed as exactly those elements of   realising a certain type over A is called type-definable over A. For an algebraic example, suppose   is an algebraically closed field. The theory has quantifier elimination . This allows us to show that a type is determined exactly by the polynomial equations it contains. Thus the set of complete  -types over a subfield   corresponds to the set of prime ideals of the polynomial ring  , and the type-definable sets are exactly the affine varieties.

Structures and typesEdit

While not every type is realised in every structure, every structure realises its isolated types. If the only types over the empty set that are realised in a structure are the isolated types, then the structure is called atomic.

On the other hand, no structure realises every type over every parameter set; if one takes all of   as the parameter set, then every 1-type over   realised in   is isolated by a formula of the form a = x for an  . However, any proper elementary extension of   contains an element that is not in  . Therefore a weaker notion has been introduced that captures the idea of a structure realising all types it could be expected to realise. A structure is called saturated if it realises every type over a parameter set   that is of smaller cardinality than   itself.

While an automorphism that is constant on A will always preserve types over A, it is generally not true that any two sequences   and   that satisfy the same type over A can be mapped to each other by such an automorphism. A structure   in which this converse does holds for all A of smaller cardinality than   is called homogeneous.

The real number line is atomic, since all n-types over the empty set realised by   in   are isolated by the order relations between the  . It is not saturated, however, since it does not realise any 1-type over the countable set   that implies x to be larger than any integer. The rational number line   is saturated, in contrast, since   is itself countable and thereofre only has to realise types over finite subsets to be saturated.


Stone SpacesEdit

The set of definable subsets of   over some parameters   is a Boolean algebra. By Stone's representation theorem for Boolean algebras there is a natural dual topological space, which consists exactly of the complete  -types over  . The topology generated by sets of the form   for single formulas  . This is called the Stone space of n-types over A. This topology explains some of the terminology used in model theory: The compactness theorem says that the Stone Space is a compact topological space, and a type p is isolated if and only if p is an isolated point in the Stone topology.

While types in algebraically closed fields correspond to the spectrum of the polynomial ring, the topology on the type space is the constructible topology: a set of types is basic open iff it is of the form   or of the form  . This is finer than the Zariski topology.

CategoricityEdit

A theory was originally called categorical if it determines a structure up to isomorphism. It turns out that this definition is not useful, due to serious restrictions in the expressivity of first-order logic. The Löwenheim–Skolem theorem implies that if a theory T has an infinite model for some infinite cardinal number, then it has a model of size κ for any sufficiently large cardinal number κ. Since two models of different sizes cannot possibly be isomorphic, only finite structures can be described by a categorical theory.

However, the weaker notion of κ-categoricity for a cardinal κ has become a key concept in model theory. A theory T is called κ-categorical if any two models of T that are of cardinality κ are isomorphic. It turns out that the question of κ-categoricity depends critically on whether κ is bigger than the cardinality of the language (i.e.   + |σ|, where |σ| is the cardinality of the signature). For finite or countable signatures this means that there is a fundamental difference between  -cardinality and κ-cardinality for uncountable κ.

 -categoricityEdit

 -categorical theories can be characterised by properties of their type space:

For a complete first-order theory T in a finite or countable signature the following conditions are equivalent:
  1. T is  -categorical.
  2. Every type in Sn(T) is isolated.
  3. For every natural number n, Sn(T) is finite.
  4. For every natural number n, the number of formulas φ(x1, ..., xn) in n free variables, up to equivalence modulo T, is finite.

The theory of  , which is also the theory of  , is  -categorical, as every n-type   over the empty set is isolated by the pairwise order relation between the  . This means that every countable dense linear order is order-isomorphic to the rational number line. On the other hand, the theories of  ,   and   as fields are not  -categorical. This follows from the fact that in all those fields, any of the infinitely many natural numbers can be defined by a formula of the form  .

 -categorical theories and their countable models also have strong ties with oligomorphic groups:

A complete first-order theory T in a finite or countable signature is  -categorical if and only if its automorphism group is oligomorphic.

The equivalent charcaterisations of this subsection, due independently to Engeler, Ryll-Nardzewski and Svenonius, are sometimes referred to as the Ryll-Nardzewski theorem.

In combinatorial signatures, a common source of  -categorical theories are Fraïssé limits, which are obtained as the limit of amalgamating all possible configurations of a class of finite relational structures.

Uncountable categoricityEdit

Michael Morley showed in 1963 that there is only one notion of uncountable categoricity.[6]

Morley's categoricity theorem
If a first-order theory T in a finite or countable signature is κ-categorical for some uncountable cardinal κ, then T is κ-categorical for all uncountable cardinals κ.

Morley's proof revealed deep connections between uncountable categoricity and the internal structure of the models, which became the starting point of classification theory and stability theory. Uncountably categorical theories are from many points of view the most well-behaved theories. In particular, complete strongly minimal theories are uncountably categorical. This shows that the theory of algebraically closed fields of a given characteristic is uncountably categorical, with the transcendence degree of the field determining its isomorphism type.

A theory that is both  -categorical and uncountably categorical is called totally categorical.

Selected applicationsEdit

Among the early successes of model theory are Tarski's proofs of the decidability of various algebraically interesting classes, such as the real closed fields, Boolean algebras and algebraically closed fields of a given characteristic.

In the 1960s, considerations around saturated models and the ultraproduct construction lead to the Abraham Robinson's development of non-standard analysis.

In 1965, James Ax and Simon B. Kochen showed a special case of Artin's conjecture on diophantine equations, the Ax-Kochen theorem, again using an ultraproduct construction.[7]

More recently, the connection between stability and the geometry of definable sets led to several applications from algebraic and diophantine geometry, including Ehud Hrushovski's 1996 proof of the geometric Mordell-Lang conjecture in all characteristics[8]

In 2011, Jonathan Pila applied techniques around o-minimality to prove the André-Oort conjecture for products of Modular curves. [9]

In a separate strand of inquiries that also grew around stable theories, Laskowski showed in 1992 that NIP theories describe exactly those definable classes that are PAC-learnable in machine learning theory. [10]

HistoryEdit

Model theory as a subject has existed since approximately the middle of the 20th century. However some earlier research, especially in mathematical logic, is often regarded as being of a model-theoretical nature in retrospect. The first significant result in what is now model theory was a special case of the downward Löwenheim–Skolem theorem, published by Leopold Löwenheim in 1915. The compactness theorem was implicit in work by Thoralf Skolem,[11] but it was first published in 1930, as a lemma in Kurt Gödel's proof of his completeness theorem. The Löwenheim–Skolem theorem and the compactness theorem received their respective general forms in 1936 and 1941 from Anatoly Maltsev. The development of model theory as an independent discipline was brought on by Alfred Tarski, a member of the Lwów–Warsaw school during the interbellum. Tarski's work included logical consequence, deductive systems, the algebra of logic, the theory of definability, and the semantic definition of truth, among other topics. His semantic methods culminated in the model theory he and a number of his Berkeley students developed in the 1950s and '60s.

In the further history of the discipline, different strands began to emerge, and the focus of the subject shifted. In the 1960s, techniques around ultraproducts became a popular tool in model theory. At the same time, researchers such as James Ax were investigating the first-order model theory of various algebraic classes, and others such as H. Jerome Keisler were extending the concepts and results of first-order model theory to other logical systems. Then, Saharon Shelah's work around categoricity and Morley's problem changed the complexion of model theory, giving rise to a whole new class of concepts. The stability theory (classification theory) Shelah developed since the late 1960s aims to classify theories by the number of different models they have of any given cardinality. Over the next decades, it became clear that the resulting stability hierarchy is closely connected to the geometry of sets that are definable in those models; this gave rise to the subdiscipline now known as geometric stability theory.

Connections to related branches of mathematical logicEdit

Finite model theoryEdit

Finite model theory (FMT) is the subarea of model theory (MT) that deals with its restriction to interpretations on finite structures, which have a finite universe.

Since many central theorems of model theory do not hold when restricted to finite structures, FMT is quite different from MT in its methods of proof. Central results of classical model theory that fail for finite structures under FMT include the compactness theorem, Gödel's completeness theorem, and the method of ultraproducts for first-order logic.

The main application areas of FMT are descriptive complexity theory, database theory and formal language theory.

Set theoryEdit

Set theory (which is expressed in a countable language), if it is consistent, has a countable model; this is known as Skolem's paradox, since there are sentences in set theory which postulate the existence of uncountable sets and yet these sentences are true in our countable model. Particularly the proof of the independence of the continuum hypothesis requires considering sets in models which appear to be uncountable when viewed from within the model, but are countable to someone outside the model.

The model-theoretic viewpoint has been useful in set theory; for example in Kurt Gödel's work on the constructible universe, which, along with the method of forcing developed by Paul Cohen can be shown to prove the (again philosophically interesting) independence of the axiom of choice and the continuum hypothesis from the other axioms of set theory.

In the other direction, model theory itself can be formalized within ZFC set theory. The development of the fundamentals of model theory (such as the compactness theorem) rely on the axiom of choice, or more exactly the Boolean prime ideal theorem. Other results in model theory depend on set-theoretic axioms beyond the standard ZFC framework. For example, if the Continuum Hypothesis holds then every countable model has an ultrapower which is saturated (in its own cardinality). Similarly, if the Generalized Continuum Hypothesis holds then every model has a saturated elementary extension. Neither of these results are provable in ZFC alone. Finally, some questions arising from model theory (such as compactness for infinitary logics) have been shown to be equivalent to large cardinal axioms.

See alsoEdit

NotesEdit

  1. ^ Chang and Keisler, p. 1
  2. ^ https://plato.stanford.edu/entries/model-theory/
  3. ^ Chang and Keisler, p. 1
  4. ^ Hodges (1997), p. vii
  5. ^ Dirk van Dalen, (1980; Fifth revision 2013) "Logic and Structure" Springer. (See page 1.)
  6. ^ Morley, Michael (1963). "On theories categorical in uncountable powers". Proceedings of the National Academy of Sciences of the United States of America. 49: 213–216.
  7. ^ Ax, James; Kochen, Simon (1965). "Diophantine Problems Over Local Fields: I.". American Journal of Mathematics. 87pages=605-630.
  8. ^ Ehud Hrushovski, The Mordell-Lang Conjecture for Function Fields. Journal of the American Mathematical Society 9:3 (1996), pp. 667-690.
  9. ^ Jonathan Pila, Rational points of definable sets and results of André–Oort–Manin–Mumford type, O-minimality and the André–Oort conjecture for Cn. Annals of Mathematics 173:3 (2011), pp. 1779–1840. doi=10.4007/annals.2011.173.3.11
  10. ^ Michael C. Laskowski, Vapnik-Chervonenkis Classes of Definable Sets. Journal of the London Mathematical Society s2-45:2 (1992), pp. 377-384.
  11. ^ "All three commentators [i.e. Vaught, van Heijenoort and Dreben] agree that both the completeness and compactness theorems were implicit in Skolem 1923…." [Dawson, J. W. (1993). "The compactness of first-order logic:from gödel to lindström". History and Philosophy of Logic. 14: 15–37. doi:10.1080/01445349308837208.]

ReferencesEdit

Canonical textbooksEdit

Other textbooksEdit

Free online textsEdit