Building a Sequent Calculus Toolbox

After finishing the first section of this tutorial, I realized it was getting quite long and therefore decided to split it into multiple posts. Part two of the tutorial can be found here.

This article is a small tutorial on the use of the calculus toolbox for defining and generating Isabelle theory files and a graphical interface for a logic calculus. This tutorial is similar to the introduction guide to the calculus toolbox, however here I will try to showcase the work flow and the decisions and modifications that needed to be made, to formalize a new or different version of a calculus, starting from a template calculus.

In this tutorial, we will be taking a look at the LK (or "klassische Prädikatenlogik") sequent calculus, described in this wikipedia article. Due to the current restriction of the toolbox, namely the lack of support for quantifiers in calculi, only a fragment of the LK calculus without the $\exists$ and $\forall$ quantifiers will be formalized in this article.

Formally defining the LK calculus terms

To begin, we need a formal definition of the terms of this calculus. Looking at the rules of the LK system we can define the formulae of this sequent calculus fragment in the following manner:

$F := a p \in A t P r o p ∣ F \land F ∣ F \to F ∣ \neg F$

The following section describes the sequents that are built up from formulas.

$⊢$ known as the turnstile, separates the assumptions on the left from the propositions on the right

A and B denote formulae of first-order predicate logic (one may also restrict this to propositional logic),

$Γ$ , $Δ$ , $Σ$ , and $Π$ are finite (possibly empty) sequences of formulae (in fact, the order of formulae do not matter; see subsection Structural Rules), called contexts,

when on the left of the $⊢$ , the sequence of formulas is considered conjunctively (all assumed to hold at the same time),

while on the right of the $⊢$ , the sequence of formulas is considered disjunctively (at least one of the formulas must hold for any assignment of variables),

Formally we can write this definition as:

$S := F ∣ S, S$

where the inductively defined formulas $F$ are composed into structures $S$ . Two structures on either side of a turnstyle then make up sequents:

$S ⊢ S$

Note that our formal definition of the LK fragment's terms differs somewhat from the wikipedia article definition, in that it builds the sequences of formulas as trees rather than sequences/lists. This means that the rules of the calculus will have to handle associativity explicitly, and therefore, this formalization will include additional rules not found in the wikipedia article.

However, before trying to formalize these extra rules, we first need to formalize the structure of the terms in a such a way, that the toolbox can parse and create the necessary Isabelle theories and Scala code. In order to do this, we can start with a template JSON file, which encodes DCPL (display classical propositional logic), similar to the LK fragment in its terms and some rules. Even though DCPL is a display calculus, little modification of the JSON file is needed to encode the Sequent calculus.

Encoding the terms in the JSON description file

To encode the LK calculus fragent into a format which the toolbox can use, we need to make/download a copy of the template calculus, which we will be modifying.

Next, change the name of the calculus, encoded in this line, to "SequentCalc".

Having done the previous two steps, we can focus on the section of the JSON file encoding the terms of the calculus, found right underneath the above code, which needs to be modified to correspond to our earlier inductive definition of the calculus terms.

Looking at a code snippet below, it is clear that the DCPL terms are very similar to the Sequent calculus terms:

"Atprop" : {
    "Atprop" : {
       "type" : "string",
       "ascii" : "_",
       "latex" : "_"
    },
    "Atprop_Freevar" : {
       "type" : "string",
       "isabelle" : "?\\<^sub>A _",
       "ascii" : "A? _",
       "latex" : "_",
       "precedence": [320, 320]
    }
  },
  
  "Formula" : {
    "Formula_Atprop" : {
       "type": "Atprop",
       "isabelle" : "_ \\<^sub>F",
       "precedence": [320, 330]
    },
    "Formula_Freevar" : {
       "type" : "string",
       "isabelle" : "?\\<^sub>F _",
       "ascii" : "F? _",
       "latex" : "_",
       "precedence": [340, 330]
    },
  .
  .
  .

The only differences between the terms of the template calculus and the sequent calculus are in fact the following:

The formulae of the template calculus don't include the unary $\neg$ operator
The template calculus includes two structural connectives, the comma $;$ and the implication $>>$ , whereas the sequent caculus only uses the comma.

To address the first difference, we add a Formula_Un constructor to the type Formula:

"Formula_Un" : {
    "type" : ["Formula_Un_Op", "Formula"],
    "isabelle" : "U\\<^sub>F _",
    "isabelle_se" : "_",
    "precedence": [330, 331]
  }

To explain what the entries in the JSON encoding mean, have a look at the calculus encoding page of the documentation.

Once we define the Formula_Un constructor, we need to define the actual not operator:

"Formula_Un_Op" : {
    "Formula_Not" : {
      "isabelle" : "\\<not>\\<^sub>F",
      "ascii" : "-",
      "latex" : "\\lnot"
    }
  }

As one might notice, both unary and binary connectives in terms are defined in two steps, namely, the terms with binary connectives are defined as:

$ϕ = U U n O p ϕ ∣ B ϕ B i n O p ϕ$

where $U n O p$ and $B i n O p$ are of the form: $O p = O p_{1} ∣ O p_{2} ∣ . . .$

The reason for this formalization arises from the encoding and function of the match and replace functions that form the basis of a derivation function. To illustrate, let's take a look at the match function for formulas, defined in this calculus template, specifically at the following line:

match_Formula (Formula_Bin var11 op1 var12) x = (case x of 
    (Formula_Bin var21 op2 var22) ⇒ 
      (if op1 = op2 then 
        (match var11 var21) @m (match var12 var22) 
      else []) | 
    _ ⇒ [])

This snippet illustrates the advantage of separating the connectives and generalizing the terms in the aforementioned way, as now the match formula is invariant in the number of binary connectives in the calculus.

To address the second difference to the template calculus, namely the extra structural connective, we simply delete the Structure_ImpR entry under Structure_Bin_Op in the JSON file.

One final modification changes the Structure_Comma sugar notation from ' $;$ ' to ' $,$ ' (for both the ASCII and the Isabelle encoding), as this is a more conventional notation.

Encoding the rules of the LK fragment

Now that the terms of out Sequent calculus fragment are defined, we can move on to the rule encoding. Following the template, the rules first need to be declared in the calc_structure_rules section of the file and then encoded using the ASCII defined terms in the latter part of the JSON file. We will take a look at a few key rules to demonstrate the encoding.

Logical rules

The first interesting rule is the $\lor L$ rule:

$\frac{Γ, A ⊢ Δ Σ, B ⊢ Π}{Γ, Σ, A \lor B ⊢ Δ, Π} (\lor L)$

To encode thus rule, we need to do two things. firstly, we need to declare the rule in calc_structure_rules section of the JSON decription file. We create a new section for logical rules in calc_structure_rules, called RuleL:

"calc_structure_rules" : {
    "RuleL" : { }
  }

Now add the folowing entry to RuleL:

"Or_L" : {
    "ascii" : "Or_L",
    "latex" : "\\vee L"
  }

This encodes the name of the rule and the sugar used for LaTeX typesetting of the rule label. The ASCII encodes the reserved name for the rule, used for generating the ASCII parser and to-string functions (the parser and the to-string functions are used to store and retrieve prooftrees generated by the calculus toolbox UI).

We can now proceed formalizing the rule itself by adding RuleL to the rule section of the JSON file and encoding Or_L. After adding the $\lor L$ rule, the JSON file should look something like this:

"rules" : {
    "RuleL" : {
      "Or_L" : ["(?X, ?Z), F?A \\/ F?B |- ?Y, ?W",  "?X, F?A |- ?Y", "?Z, F?B |- ?W"]
    }
  }

Note the specific bracketing around $(Γ, Σ)$ in the conclusion of the rule:

$\frac{Γ, A ⊢ Δ Σ, B ⊢ Π}{(Γ, Σ), A \lor B ⊢ Δ, Π} (\lor L)$

Because our formalization of the Sequent calculus uses trees to encode the formulas on either side of the turnstile, associativity has to be handled explicitly. This means that the bracketing of the term $Γ, Σ, A \lor B ⊢ Δ, Π$ needs to be disambiguated.
Working with trees of formulae rather than lists in our version of the Sequent calculus means bracketing the expression $Γ, Σ, Δ$ as $(Γ, Σ), Δ$ as opposed to $Γ, (Σ, Δ)$ produces different terms with different trees. Because of this, we need to introduce the following invertible/bi-directional associativity rules:

$\frac{Γ, (Σ, Δ) ⊢ Π}{(Γ, Σ), Δ ⊢ Π} (A_{L})$ $\frac{Γ ⊢ (Σ, Δ), Π}{Γ ⊢ Σ, (Δ, Π)} (A_{R})$

The rest of the logical rules were formalized in a similar fashion and can be found in the following section of the Sequent.json file.

Structural rules

We now turn our attention to the structural rules of the calculus. We deviate from the wikipedia formalization of the structural rules, relaxing them somewhat, by replacing the formulas $A$ and $B$ by structures, $Λ$ and $Φ$ . I will not discuss the motivation for this change in great detail here, but will dedicate a section on this later, where I will also show that these rules are interchangeable with the ones formalized in the wikipedia article. The short answer I will give now boils down to: it makes things easier and more consistent, as we have already introduced two structural rules $A_{L}$ and $A_{R}$ , which follow this convention.

The first structural rule we take a closer look at, is the $C L$ rule of the Sequent calculus:

$\frac{Γ, A, A ⊢ Δ}{Γ, A ⊢ Δ} (C L)$

Since we decided to allow replace the single formula $A$ with a structure, the rule becomes:

$\frac{Γ, Λ, Λ ⊢ Δ}{Γ, Λ ⊢ Δ} (C L)$

As with the $\lor L$ rule, the $C L$ rule contains an ambiguous case of bracketing in the term $Γ, Λ, Λ$ . The rule was encoded with the term bracketed in the following way: $Γ, (Λ, Λ)$ .

This bracketing will hopefully seem intuitive after looking at the parse trees, corresponding to the $C L$ rule, below.

code generation diagram $⟶$

The last rule we will look at in detail is the $P L$ rule, rewritten below, with $A$ and $B$ already replaced with the structural variables $Λ$ and $Φ$ .

$\frac{Γ_{1}, Λ, Φ, Γ_{2} ⊢ Δ}{Γ_{1}, Φ, Λ, Γ_{2} ⊢ Δ} (P L)$

Our well-bracketed version of this rule is the following one:

$\frac{(Γ_{1}, Λ), (Φ, Γ_{2}) ⊢ Δ}{(Γ_{1}, Φ), (Λ, Γ_{2}) ⊢ Δ} (P L)$

The intuition behind this bracketing comes from the way this rule manipulates the terms. The bracketing above makes the parse trees look symmetric, manipulating the two subtrees $Λ$ and $Φ$ at the same level in the tree:

code generation diagram $⟶$

Additional rules

As was mentioned earlier, due to the way we defined our structural terms, we need to add extra rules to our version of the calculus on top of the ones defined in the wikipedia article.

We have already seen two of these rules, the $A_{L}$ and $A_{R}$ . Next, we need to introduce rules for the structural connective $I$ .

At this point, you might say, hang on, what is $I$ ? Where was it introduced and defined? Well, the answer is that I snuck the $I$ in without telling you. The reason for the nullary connective $I$ is to simulate the empty list in a way. Let's say we have a sequent ' $⊢ A \lor \neg A$ ' seen in the section of the article showing some example derivations in the LK calculus.

At first sight, this sequent is not a valid term in our calculus, since the right hand side contains a structure whilst the left hand side appears to be empty. Now, according to our definition, a sequent consists of two structures, one on either side of the turnstile. However, if we look at the article's definition of a sequent, we can see that the turnstile separates sequences (or, as I prefer, lists) of formulas. Since a list can be empty, it is perfectly fine to have (an empty) one on either side of the turnstile. (Note to self: Does that mean that ' $⊢$ ' is technically a well formed, albeit slightly useless, term of the LK calculus?)

Since we do not have lists, but rather trees of formulas, we introduce the nullary $I$ to represent the notion of an empty tree rather than an empty list. We therefore have to modify the definition of structures to include the $I$ :

$S := F ∣ S, S ∣ I$

The sequent ' $⊢ A \lor \neg A$ ' , rewritten in our calculus thus becomes ' $I ⊢ A \lor \neg A$ '.

The following rules involving the $I$ were added to our calculus (all the rules below are reversible):

$\frac{Γ ⊢ Δ}{I, Γ ⊢ Δ} (I_{L L})$ $\frac{Γ ⊢ Δ}{Γ, I ⊢ Δ} (I_{L R})$ $\frac{Γ ⊢ Δ}{Γ ⊢ I, Δ} (I_{R L})$ $\frac{Γ ⊢ Δ}{Γ ⊢ Δ, I} (I_{L R})$

The motivation for the above rules comes from the following property of lists: $[] + A = A = A + []$ which states that the empty list is the neutral or the identity element which, if concatenated with a list $A$ , produces $A$ again.

To show why these rules are necessary, let's have a look at the proof of derivability of ' $⊢ A \lor \neg A$ ' in the sequent calculus. The fist step of the proof uses the $C R$ rule to duplicate the formula $A \lor \neg A$ on the right hand side:

$\frac{\frac{⋮}{⊢ A \lor \neg A, A \lor \neg A}}{⊢ A \lor \neg A} (C R)$

The $C R$ rule in the article calculus can be applied to a sequent of the shape $Γ ⊢ A, Δ$ . As we have already stated, this formalization uses lists so ' $⊢ A \lor \neg A$ ' can actually be rewritten as ' $[] ⊢ A \lor \neg A, []$ '. In order to be able to apply the $C R$ rule in our calculus, we have to do something similar:

$\frac{\frac{\frac{\frac{⋮}{I ⊢ A \lor \neg A, A \lor \neg A}}{I ⊢ (A \lor \neg A, A \lor \neg A), I}}{I ⊢ A \lor \neg A, I}}{I ⊢ A \lor \neg A}$

First, we introduce the $I$ on the right side of the right structure (rule $I_R R$ ), only then we can apply the $C R$ rule. Finally, we apply the reverse of the $I_{R R}$ rule to get rid of the $I$ on the right.

For now, we will omit the $C u t$ rule from our formalization and revisit it in the next section.

There is one final rule which is not really a rule of the calculus as such, but is needed internally by the UI for certain functionality, such as the proof search. This is the $P r e m$ rule, already found in the template calculus. We will therefore just leave it in our sequent calculus JSON file.

This concludes the first part of the tutorial. In part two, we will have a look at how to use the calculus description file we created (available here for reference) to compile a Sequent calculus toolbox.