The aim of today’s session is to get acquainted with belief revision theory, review some preliminary notions, and think about why it is worth getting our heads around it as PhD students in Philosophy.
In very general terms, belief revision theory is a branch of Philosophy (and Logic) that tackles the following question:
What are the general principles that a reasoner (whether human or artificial) should follow as they acquire more and more information from the world?
Consider the following example.
Three Composers
You are a newbie when it comes to opera.
- A friend of yours who’s really into this topic tells you that Verdi is Italian, while Bizet and Satie are both French, and you come to believe so ().
- Later on, you find out that Verdi and Bizet are compatriots (). So, you are now uncertain as to whether Verdi and Bizet are both Italian or both French, but you still believe that Satie is French.
- Eventually, you find out on Wikipedia that they are in fact all compatriots (). As a result, you are now uncertain as to whether they are all Italian or all French.
Question: Is the way you have changed your beliefs through and rational?
Answer: It depends!
What belief revision theory can do for us is help us understand under what conditions changing our beliefs as in ^7ac357 is rational, and under what conditions it is not. For example, under certain theories of belief revision, the following principle is true:
Preservation. If you learn , and is consistent with what you already believe, it is not rational to drop any belief as a result of learning .
In other words, Preservation makes you very conservative when it comes to belief. For example, a theory that validates Preservation is a theory that deems you irrational in ^7ac357. For, at , you believe that Satie is French and, between and , you learn something consistent with that belief—namely, that the three composers are all compatriots (since it is possible that they are all French). However, at you drop your belief that Satie is French, and believe only that they are either all Italian or all French.
Belief revision theory allows us to discuss such cases in a very general way and to compare theories clearly and unambiguously. In particular, what we are interested in when studying belief revision theory is establishing results like the following:
Representation Result (Schema)
Let be a set of belief revision rules, and some kind of structure. A representation result for and structures of kind is a proof of the following statement:
The rules are valid on a structure if, and only if, that structure is of kind .
Why should we study these kinds of results?
- One reason is that representation theorems demonstrate a formal equivalence between axiomatic and semantic approaches. Specifying correct belief revision rules () and defining a mathematical structure of kind to interpret a logical language amount to the same theoretical commitment. The rule set and the structure yield identical predictions regarding the rationality of a given belief change.
- This presupposes, of course, that studying belief revision is itself worthwhile. There are two primary reasons why it is. First, the rules of belief revision govern rational outright belief in the same way probability theory governs rational credence. For those interested in the epistemic rationality of full belief, this framework is indispensable. Second, even for those less interested in epistemology, studying these results provides an opportunity to apply mathematical logic to a domain where the formalism retains a strong intuitive grip – namely, modeling rational agents who accept and retract beliefs upon acquiring new information.
@myself: Hanti Lin has a nice section on why representation results are mathematically and philosophically relevant in his "Belief Revision" article in the PhilPapers handbook.
Please find a summary of all definitions and propositions discussed in this note here: link.
1. Formal Preliminaries
1.1. Language and Interpretations
To do belief revision theory, we need some basic logical preliminaries. I assume everybody has done some courses or standalone study of classical propositional logic, so I won’t revise that completely, but I will simply set the stage on which we are going to elaborate.
First, let us define a formal language of interest. We will concern ourselves with a language so constructed.
Language
A formal language is defined as follows. Let be a (countable) set of objects called “propositional variables”, that is . Then, we define via induction as follows.
- Base: .
- Step: For all :
- If , then ;
- If and then ;
- If and then ;
- If and then .
- Nothing else is in .
Note the greek letters are variables for the sentences in , not sentences themselves. contains only objects like , and so on.
Next, we need to define the semantics for our language. As it is standard, we can think of a possible world (or interpretation) simply as an assignment of truth values to our basic propositional variables. Then, we define what it means for any sentence to be true at a world using the satisfaction relation . As you know, some sentences have a special status: some are true irrespective of the interpretation you consider (i.e., tautologies) and others are false irrespective of the interpretation you consider (i.e., contradictions).
Interpretations and Truth ( )
Let be given. A possible world (or interpretation) is a function that assigns a truth value to each propositional variable . Let be the set of all possible worlds.
We define the satisfaction relation (read as “ is true at ”) via induction on the structure of :
- Base: For all , .
- Step: For all :
- ;
- .
Sentences and Propositions
As you already know, there are two ways in which we may make reference to a sentence . First, as a syntactic object in . This is the most obvious way, given ^e1e724. However, another way of referring to a sentence is by considering its “truth-set”, i.e., the set of worlds at which the sentence is true. Let be a function that takes a sentence and maps it to its truth-set , that is:
Usually, when philosophers talk about propositions rather than sentences, they refer to what we formalize through a truth-set (in a way, the truth-set of is the “content” of ). Note that propositions and sentences are not the same thing, and also they are not “equivalent”, in the sense that “looses” some information. For instance, consider that while and are different objects in , .
Before moving on, let me make an important remark regarding the cardinality of . As we will see, how many worlds are in will make a difference when doing belief revision theory.
- Suppose first that , the set of propositional variables, is finite. That is, for some . How many possible worlds are in ?
- Suppose now that is countable, i.e., . How many possible worlds are in ?
1.2. Logical Consequence
We can already say something about belief revision theory using what we established in 1. Formal Preliminaries. First, an agent will be represented by the propositions they believe. That is, a rational agent will be represented by a belief set, which is a special kind of set of sentences in our formal language (we will define exactly what makes it special in a moment) representing the sentences the agent takes to be true. Second, belief revision will be defined as an operator on those sets . Suppose an agent believes all the sentences in and that they learn . The new belief set they obtain by revising by is denoted as:
Clearly, we need more information to decide what should actually be in (given the initial content of ). To do so, however, we first need to clarify what belief sets are, given that they are not just any subset of . This is exactly why we need the notion of logical consequence.
In general, we define logical consequence based on the formal work done in 1. Formal Preliminaries as follows.
Logical Consequence ( )
Let and be given. We say that follows logically from (in symbols, ) iff the latter is true at all the worlds that make the former true. In symbols:
Note that:
- ^38e024 may be extended, with a slight abuse of notation, to include the consequence relation between sets of formulas and a sentence , i.e., . We do this by requiring that iff is true at all the worlds which satisfy all the sentences .
- I distinguish the satisfaction () and consequence () relations symbolically. We won’t need a third relation for syntactic deducibility (which is what “” usually denotes, strictly speaking), so we will use for semantic consequence here to keep things simple.
When doing belief revision theory, we will frequently use a consequence operator instead of a consequence relation, as it makes it easier to express certain properties in a concise and brief way. Note that and are inter-definable. Since we started with , we take it as basic and define in terms of .
Consequence Operator
Let be given. We define the consequence operator (of ) as a function from sets of sentences to sets of sentences :
N.b., is a function from to .
Now, we can finally define what a belief set is.
Belief Sets
Let be a subset of . is a belief set if, and only if, the following equivalence holds:
Let us call the set of all belief sets.
In other words, belief sets are not just sets of sentences, but they are sets of sentences “closed under logical entailment”. We usually explain this requirement by saying that logical closure is—in this context—the “mark of rationality”. That is, a rational agent (as modeled in this simplified setting) is an agent that believes all the logical consequences of what they believe.
Before concluding this section, let me point out that logical consequence has many important properties, independently of whether we represent it via or . In the following, I will introduce some of them. I will first define some of them in general terms (i.e., for arbitrary consequence relation and consequence operation ).
Reflexivity of
Let be a consequence relation on . is reflexive iff:
Transitivity of
Let be a consequence relation on . is transitive iff:
Monotonicity of
Let be a consequence relation on . is monotonic iff:
It is straightforward to prove the following:
Proposition
Let be the consequence relation defined in ^38e024. is reflexive, transitive, and monotonic.
bproof See 4. Exercises. eproof
Note that these properties of the consequence relation correspond to analogous properties of the consequence operator . I will omit the details here, but the following result is derivable from the definitions above and ^41852b.
Proposition
bproof It is possible to prove this result from ^41852b and ^ba4d33. See also 4. Exercises. eproof
The classical consequence relation has other important properties
Proposition
Let be the consequence relation defined in ^38e024. The following properties hold for .
- Compactness: , where is a finite subset of .
- Deduction Theorem: If , then .
- Disjunction in the Premises: If and , then .
bproof See Compactness. eproof
The proof of compactness is mathematically a bit more involved (I have prepared a supplementary note on this that we can discuss separately if you are interested!).
If you want to dive deeper into the properties of classical consequence, or if you are looking for a beautifully written explanation of consequence relations in general (abstracting away from our specific semantic definition in ^38e024), I highly recommend David Makinson’s Bridges from Classical to Nonmonotonic Logic (2005).
Now we are ready to introduce belief revision theory.
2. Belief Revision Theory
The best way to get a feel for this topic is by diving into the most famous framework for belief change: AGM theory. Named after the three scholars who introduced it in 1985—Carlos Alchourrón, Peter Gärdenfors, and David Makinson—it is essentially the gold standard in the field!
I’ll start by introducing AGM axiomatically. This simply means we’ll look at the basic rules (or postulates) that describe how an ideally rational agent—let’s call them an “AGM-rational” agent—ought to update their beliefs. After that, we’ll explore the semantic structures we use to actually model these agents. Our main goal here is to build up to AGM’s famous completeness result (see ^6675a2).
A Quick Heads-Up: Mathematics vs. Philosophy
As we go through this, it’s really important to separate the hard math from the philosophical interpretations. For instance, proving that the AGM postulates correspond to a specific mathematical structure is an uncontroversial mathematical theorem. However, claiming that “AGM-rationality” is rationality simpliciter is a philosophical claim—and a highly debated one!
To formalize the AGM framework, let be any belief set, be any sentence, and be a candidate revision function. In plain English, is just a function that takes your initial belief set (from the collection of all possible belief sets, ) and a new piece of information (a sentence from our language ), and gives you back a brand-new set: . The set is the revision of by iff validates the postulates before.
Before detailing these postulates, however, it is helpful to understand the extra-logical motivations underpinning them:
- Consistency and Closure: We require the revised belief set, (the result of revising by ), to be logically closed, consistent (unless is logically contradictory), and to successfully incorporate the new information .
- Informational Economy: The new belief set should differ from the original set as little as possible. Revision must be conservative, retaining as much prior information as logical consistency allows.
These “meta-principles” – particularly the goal of informational economy – drive the specific characterization of the revision operator . Again: these meta-principles are not true as a matter of logic, but are decided in advance on extra-logical grounds.
The function is a basic AGM belief revision function if and only if it satisfies the following postulates. Let us start with Closure:
This has a very straightforward meaning: when you revise your beliefs, you shouldn’t end up with a fragmented, messy pile of isolated sentences. You should always arrive at a new, stable epistemic state that is logically closed.
(You might be thinking: “Wait, isn’t this redundant if we already defined as a function that outputs a belief set?” Mathematically, yes! But we state it explicitly as an axiom to ensure this property of revision independently of the definition of as some kind of function.)
Also, keep in mind that the AGM axioms do not single out just one unique way to revise beliefs. Knowing the rules of logic alone isn’t enough to tell us exactly what ends up inside for a specific agent. The postulates merely set the boundary lines for what counts as a “rational” change. As we clarified earlier (see ^7ac357), to figure out the exact output of a revision, we have to bring in extra-logical information—like how strongly the agent values, or “entrenches,” their specific beliefs!
Before moving to the next postulate, consider the following mathematical consequence of Closure.
Closure Implies that Every Belief Set Contains all Tautologies
It is a trivial mathematical truth that , where is a belief set. However, this has an important consequence given the fact that classical logical consequence is monotonic. For it follows by monotonicity that, since , then . Since is a belief set, we know , which means . But what exactly is ?
is the set of all propositional tautologies. This can be easily seen by unpacking the meaning of as applied to :
In turn, is true iff for all possible worlds : if satisfies all sentences in , then .
The condition “ satisfies all sentences in ” can be logically spelled out as follows: for all , if , then . Since for any choice of by the very definition of the empty set, the antecedent “” is always false. This implies that the conditional “ satisfies all sentences in ” is vacuously true for every possible world . Therefore, the statement is true if, and only if, for all . That is, is a propositional tautology.
Therefore, any belief set logically contains all the propositional tautologies.
Also, note that, given Closure and the properties of (and ), saying (or ) is exactly the same as saying .
The second postulate is called Success:
This postulate has a rather simple meaning: revising by should result in believing . The third postulate is called Consistency:
This postulate has a rather straightforward meaning, but it forces us to deeply understand the meaning of the objects and notation we have introduced. There are three “things” to clarify here: the meaning of the antecedent, that of the consequent, and the reason why this postulate is in “If-Then” form.
- (where is just any contradiction you like, e.g., ) means that is not a contradiction. Here is why. Recall that, in general, means that there exists a world such that but . So, means that there exists some world such that and . Since for any world we have , simply means that is non-contradictory or satisfiable. (Note: might be either a tautology or a contingent proposition.)
- is just an equivalent way of saying that contains no contradictions. Let me explain this by showing that if, and only if, contains a contradiction.
- Suppose . Therefore, , i.e., . Note that since is a belief set, it is closed under logical consequence by Closure. Therefore, if , it must be that .
- Suppose now that . In classical logic, for any (the principle of explosion). So, since classical consequence is monotonic and , it follows that . Since , we get . Ultimately, recall that is a subset of our language , hence by definition, which implies that .
- Let me explain now the last bit of information contained in Consistency. Why require that only if , and not in general? The reason is that requiring it in general would conflict with Success. For suppose that the agent revises their belief set by an outright contradiction, . By Success, we have that , which (as we just saw in point 2) implies that .
Escaping an Inconsistent Belief Set
The Consistency postulate has an important consequence. Suppose that is an inconsistent belief set, i.e., . Suppose that is a non-contradictory sentence. By Consistency, it follows that . That is, revising an inconsistent set of beliefs by a non-contradictory sentence restores consistency.
(2) Why Finding the Right Belief Revision Operator Is So Hard
Closure, Success, and Consistency are rather reasonable constraints on , aren’t they? However, accepting them already forces us to reject simple, naive revision operators. Suppose we want to revise a belief set by some new information , and currently contains . You might think we can define a simple revision operator that just removes the direct contradiction , adds the new information , and then logically closes the result:
But this fails precisely because beliefs are logically entangled, which inevitably leads to a violation of Consistency. Suppose contains the belief (“It is raining”), the belief (“If it is raining, the picnic is cancelled”), and consequently the belief (“The picnic is cancelled”). Let’s focus on these core beliefs: .
Suppose you learn (“The picnic is on!”) and revise using . First, the operator removes and adds , leaving us with the intermediate set containing . However, the definition of then requires us to apply to satisfy Closure. Since and are still in the set and they logically entail , applying brings right back! Because the closed set now logically contains both and , it logically explodes into the absurd belief set (i.e., ).
Notice that the new information is perfectly non-contradictory on its own (). Yet, our revised belief set became , which is a direct violation of Consistency.
This shows that belief revision cannot just be simple set addition and subtraction of elements from . When you add new information that contradicts the old, you don’t just have to remove the direct contradiction; you have to track down and modify the underlying beliefs that logically entail it.
The fourth postulate is called Inclusion:
and it sets an “upper bound” to the effects of revision. To fully grasp this, let me clarify what is. In the belief revision literature, the operation that takes a set and a sentence and returns is called expansion (or sometimes augmentation). It is usually denoted by , meaning . It has a very straightforward meaning: you simply add the new information to your set of beliefs and take the logical closure of the result. So, Inclusion simply says that, at most, belief revision may amount to expansion.
The fifth postulate is called Vacuity:
This postulate requires more discussion, for its meaning and its role are not completely obvious. As we will see, Vacuity is motivated by the informational economy idea briefly mentioned above.
- First, notice that the right-hand side of the equation is exactly the expansion operation we just defined above. So, Vacuity tells us that revision is strictly identical with expansion in certain cases.
- Let us look closely at the condition . This condition tells us when revision and expansion deliver the same result.
- First of all, let’s intuitively clarify its meaning. Informally, means two things.
- First, it means that is consistent. Recall that , so if were inconsistent, then , which means for all (including ). So, if , it must be consistent.
- Second, means that is “compatible” with . To see this, recall that is a universal statement, saying that every world satisfying satisfies as well. Its negation, then, is an existential statement, saying that there exists a world where all are true but is false. Thus, means that there exists at least a world where all are true but is false, i.e. is true. The fact that there exists a world where both and are true means that the two are compatible.
- Now that we understand what the condition “” says: why does AGM say that revision and expansion coincide if is the case? That is, why not define in such a way that it just is for every case? After all, is a very simple operation. The problem is that, precisely when , claiming that has disastrous consequences. Suppose that , i.e., . Since obviously holds, it follows by the monotonicity of that . So, , and we also have by the reflexivity of . Therefore, their conjunction , which means . If we forced across the board, it would follow by Closure that whenever the new information contradicts our prior beliefs. The problem, however, is that if the new information is not contradictory in itself (), this is a direct violation of the Consistency postulate! To avoid this problem, we do not equate with across the board, but only in a specific, safe case: when .
- First of all, let’s intuitively clarify its meaning. Informally, means two things.
The sixth postulate is called Congruence:
and it simply says that, if and are logically equivalent, then revising by is exactly the same as revising by . That is, what you end up believing as a result of revision depends on the content of a proposition (world-theoretically, its truth-set) and not on its syntactic presentation. (This postulate is sometimes called Dalal’s Principle of Irrelevance of Syntax). Take a moment to convince yourself that and are logically equivalent in the semantic sense (i.e., ) if, and only if, .
This concludes our discussion of the main AGM postulates. The following two rules, usually called the supplementary postulates, are slightly more cumbersome. However, they are necessary for proving the representation theorem, as they will significantly constrain the kind of structure we will consider.
2.1. Supplementary Postulates
The seventh postulate is called Superexpansion:
This postulate is similar to Inclusion in 2. Belief Revision Theory above. It says that revising by a conjunction may result in, at most, revising by one conjunct and then expanding by the other. Alternatively, you are not permitted to believe more by revising by than you would if you simply revised by and then expanded by .
The eighth postulate is called Subexpansion:
This postulate says that when is compatible with , expanding by on top of revising by may result in having at most the same beliefs as in the case where you simply revise by the conjunction . It is very important to appreciate a consequence of these two postulates taken together: if the result of revising by one conjunct, i.e., , is compatible with the other conjunct , then
That is, revising by the conjunction of the two is exactly the same as revising by the first and then expanding by the second. Let me clarify two points here:
- The equality above is not affected by the order of the conjuncts, i.e., by whether we revise by or . The reason is that as a result of Congruence, since .
- The requirement that (or ) is compatible with (or ) is crucial. For if , we would have that as a result of Consistency, but . Here is why:
- .
- Obviously, .
- By the monotonicity of , .
- ex hypothesi, while by the reflexivity of .
- Therefore, .
Why We Need the Supplementary Postulates
You might wonder why we bother with these two extra rules. The answer lies in the Completeness Result we mentioned earlier.
The first six postulates only guarantee a “weak” kind of revision (called partial meet revision). As we will see, the mathematical structures that allow us to encode these belief revision rules are ordered structures. As we will see, by adding Superexpansion and Subexpansion we enforce a strict, transitive order on our beliefs. That is, adding the supplementary postulates allows us to prove that AGM-rational revision corresponds exactly to elegant semantic structures, such as a system of concentric spheres (where we fall back to the “closest” possible worlds when revising) or an epistemic entrenchment ordering (where we always sacrifice our least valuable beliefs first).
2.2. Derivative Rules of AGM
The basic AGM postulates entail several intuitive derivative rules. Here are some of the most important ones that follow strictly from the first six postulates:
We can also derive a very famous rule concerning conditional beliefs and material implication:
Once we adopt Superexpansion and Subexpansion, we unlock a new set of derivative rules that govern how revision behaves when we build up more complex pieces of new information, particularly conjunctions and disjunctions.
The following rules dictate the logic of sequential and conjunctive learning:
(Note: Rational Monotony is a sort of “generalized” version of Preservation. To convince your self, note that Rational Monotony is equivalent to Preservation when .)
We also gain derivative rules that dictate how revision handles disjunctions (situations where we learn that at least one of two things is true, but we don’t know which):
Finally, there is an important rule, called Disjunctive Factoring, which is logically equivalent to the combination of both supplementary postulates (given the six basic postulates):
3. Exercises
Solutions are provided here
3.1. Exercises: Formal Preliminaries
1. Prove ^41852b (i.e., that classical semantic consequence is reflexive, transitive, and monotonic). 2. Prove ^db37aa (i.e., that the classical consequence operator satisfies reflexivity, transitivity, monotonicity, and idempotence).
3.2. Exercises: Consequence Relations and Consequence Operators
3. The properties of consequence relations and consequence operators are not all logically independent. Explore their logical interactions by proving the following claims:
- Reflexivity + Transitivity Monotonicity: Prove that if a consequence relation (or operator ) is reflexive and transitive, it must necessarily be monotonic.
- Monotonicity + Idempotence Transitivity: For a consequence operator , prove that if is monotonic and idempotent, then is transitive.
- Reflexivity + Transitivity Idempotence: For a consequence operator , prove that if is reflexive and transitive, then is idempotent.
- Reflexivity + Monotonicity Transitivity: Provide a counterexample (e.g., a restricted consequence operator on ) that is reflexive and monotonic, but fails to be transitive.
(N.b., 4 implies that Reflexivity + Monotonicity \cancel{ \implies } Idempotence.)
4. Cumulative Transitivity and Non-Monotonicity. As established in Exercise 3.1, standard transitivity is sufficiently strong that, when paired with reflexivity, it entails monotonicity. Consequently, it is theoretically useful to identify a weaker formulation of transitivity that does not entail monotonicity in the presence of reflexivity. This allows for the formal study of non-monotonic consequence relations – such as those modeling default reasoning, where acquiring new information may lead to the retraction of prior conclusions – without reducing the consequence operator to mere reflexivity. Consider the property of Cumulative Transitivity.
Cumulative Transitivity (Cut)
A consequence relation is cumulatively transitive iff:
Equivalently, for a consequence operator , cumulative transitivity is defined as:
Explore the limits of this property by proving the following:
- Reflexivity + Cut + Idempotence Monotonicity: Provide a counterexample demonstrating that a consequence relation (or operator ) can be reflexive, cumulatively transitive, and idempotent, yet fail to be monotonic.
- Reflexivity + Monotonicity (Transitivity Cut): Prove this logical equivalence.
- Reflexivity + Monotonicity (Idempotence Cut): Prove this logical equivalence.
3.3. Exercises: Derivative Belief Revision Postulates
5. Derivative Rules for Revision. Prove that the derivative rules listed in 2.2. Derivative Rules of AGM follow from the 8 postulates listed in 2. Belief Revision Theory.
6. Derivative Rules: Rational Monotony. Prove that if we replace with a tautology in Rational Monotony (see 2.2. Derivative Rules of AGM), it becomes logically equivalent to the Preservation rule.