Reasoning about software is hard. Reasoning about software written in multiple languages and communicating in a complex environment is even harder. The latter is exactly the topic of inquiry in Yu Zhang’s Unifying Compositional Verification and Certified Compilation with a Three-Dimensional Refinement Algebra. Despite the complicated name of the paper, it turns out that one can use UML sequence diagrams. Well, almost.
Modeling a map application with UML Sequence Diagrams
Consider a map application that allows users to search for transport routes using multiple transport providers.
Such app can interact with a user and transport providers in various ways; an example interaction is shown in the illustration below.
The map application may be implemented in many different ways: for example, the list of connections found so far may be stored in memory, in a file, or in a database; the application may also be designed to remember only the cheapest connection so far, if memory consumption is an issue.
But when talking about the correctness of such a map application, we may actually not care about these implementation details and consider only the application’s externally observable behavior.
We could represent this behavior as a collection of all possible sequence diagrams in which the application takes place.
That, however, would have two big problems: (1) we may not know in advance what other actors there may be in the world; and (2) we would have the information about other actors, which would be irrelevant. Instead, let us represent the application’s externally obserable behavior as the collection of fragments of these sequence diagrams, such that each fragment shows only the lifeline of map application and its interractions - without the lifelines of other actors - as in the following illustration.
A little detail is that not all collection of fragments are meaningful. In the paper, the authors put two restrictions on the collections: (1) prefix-closedness, and (2) internal determinism (as modelled by a coherence relation in the paper).
In the subsequent text, let’s have a textual notation for those fragments. A fragment that simply returns an answer for a question without communicating with other actors would be written as . A fragment that answers a question by first asking three other questions would be written as . A fragment that answers first question immediatelly, for second questions asks one question, and for third question asks two questions, would be written as . Notice how in the fragment, messages sent by the environment are interleaved with messages sent by the actor.
Modeling a Map Application, More Formally
In the referenced paper, fragments are called traces , and (reasonable) collections of traces are called strategies. To model such fragments/traces and collections/strategies, the authors use the notion of effect signature. An effect signature is a collection of questions together with a collection of possible answers for each question . An effect signature is typically written as the set
Each fragment/trace has associated two effect signatures, and , and shows how concretely an actor answers questions of the kind (coming from some other actor), while possibly asking questions and receiving answers from yet another actor(s). In our example, the map application handles questions from the user, and is allowed to ask questions to a bus and a train service provider. A strategy is then some collection of fragments/traces with the same associated effect signatures; we would say that is of type , written , whenever its fragments/traces answers using .
Continuing our example, we have an effect signature
where each question/answer is interpreted as “On day , for travel from to , we have connections , each for the price ” (where all the s are some global/nation-wide identifiers of connections).
Another effect signature would be
Then, using two services providers
we would have a strategy
implementing the map application.
But what is the sign there?
Fragments Can Be Composed
Given two effect signatures , the expression denotes a new, combined effect signature:
A fragment asking questions in this combined effect signature can basically ask questions from the first or the other signature; a fragment answering questions from the combined signature has to be able to answer questions from both signatures.
Also, one can easily implement using , by simply wrapping the questions and returning the answers, as in the trace . The result of this approach is the strategy , for . The authors come up with more strategy combinators, like
for flat composition, and
for layered composition, and
for iteration.
Concluding remarks
The paper also offers tools for reasoning of refinement between strategies, and more. The authors use an example of a multi-lingual program combining C and assembly code, communicating via shell pipe. Unfortunately, they do not make use of the UML Sequence Diagrams analogy, which made the paper harder to read, at least for me.