Deterministic Automata

Formally, a deterministic automaton, denoted by G, is defined as a quintuple:

G = { X, E, f, x0, Xm }

where:

  • X is the set of states;

  • E is the finite set of events;

  • x0 is the initial state;

  • Xm (subset of X) is the set of marked (or final) states.

  • f : X x E -> X $ is the transition function. It defines the state transition in the occurrence of an event from E in the state X. In the special case of deterministic automata, the occurrence of the event in E in a state in X has a deterministic next state from X.

For example, a given automaton named ‘wip’ (wakeup in preemptive) can be defined as:

  • X = { preemptive, non_preemptive}

  • E = { preempt_enable, preempt_disable, sched_waking}

  • x0 = preemptive

  • Xm = {preemptive}

  • f =
    • f(preemptive, preempt_disable) = non_preemptive

    • f(non_preemptive, sched_waking) = non_preemptive

    • f(non_preemptive, preempt_enable) = preemptive

One of the benefits of this formal definition is that it can be presented in multiple formats. For example, using a graphical representation, using vertices (nodes) and edges, which is very intuitive for operating system practitioners, without any loss.

The previous ‘wip’ automaton can also be represented as:

                   preempt_enable
      +---------------------------------+
      v                                 |
    #============#  preempt_disable   +------------------+
--> H preemptive H -----------------> |  non_preemptive  |
    #============#                    +------------------+
                                        ^              |
                                        | sched_waking |
                                        +--------------+

Deterministic Automaton in C

In the paper “Efficient formal verification for the Linux kernel”, the authors present a simple way to represent an automaton in C that can be used as regular code in the Linux kernel.

For example, the ‘wip’ automata can be presented as (augmented with comments):

/* enum representation of X (set of states) to be used as index */
enum states {
      preemptive = 0,
      non_preemptive,
      state_max
};

#define INVALID_STATE state_max

/* enum representation of E (set of events) to be used as index */
enum events {
      preempt_disable = 0,
      preempt_enable,
      sched_waking,
      event_max
};

struct automaton {
      char *state_names[state_max];                   // X: the set of states
      char *event_names[event_max];                   // E: the finite set of events
      unsigned char function[state_max][event_max];   // f: transition function
      unsigned char initial_state;                    // x_0: the initial state
      bool final_states[state_max];                   // X_m: the set of marked states
};

struct automaton aut = {
      .state_names = {
              "preemptive",
              "non_preemptive"
      },
      .event_names = {
              "preempt_disable",
              "preempt_enable",
              "sched_waking"
      },
      .function = {
              { non_preemptive,  INVALID_STATE,  INVALID_STATE },
              {  INVALID_STATE,     preemptive, non_preemptive },
      },
      .initial_state = preemptive,
      .final_states = { 1, 0 },
};

The transition function is represented as a matrix of states (lines) and events (columns), and so the function f : X x E -> X can be solved in O(1). For example:

next_state = automaton_wip.function[curr_state][event];

Graphviz .dot format

The Graphviz open-source tool can produce the graphical representation of an automaton using the (textual) DOT language as the source code. The DOT format is widely used and can be converted to many other formats.

For example, this is the ‘wip’ model in DOT:

digraph state_automaton {
      {node [shape = circle] "non_preemptive"};
      {node [shape = plaintext, style=invis, label=""] "__init_preemptive"};
      {node [shape = doublecircle] "preemptive"};
      {node [shape = circle] "preemptive"};
      "__init_preemptive" -> "preemptive";
      "non_preemptive" [label = "non_preemptive"];
      "non_preemptive" -> "non_preemptive" [ label = "sched_waking" ];
      "non_preemptive" -> "preemptive" [ label = "preempt_enable" ];
      "preemptive" [label = "preemptive"];
      "preemptive" -> "non_preemptive" [ label = "preempt_disable" ];
      { rank = min ;
              "__init_preemptive";
              "preemptive";
      }
}

This DOT format can be transformed into a bitmap or vectorial image using the dot utility, or into an ASCII art using graph-easy. For instance:

$ dot -Tsvg -o wip.svg wip.dot
$ graph-easy wip.dot > wip.txt

dot2c

dot2c is a utility that can parse a .dot file containing an automaton as in the example above and automatically convert it to the C representation presented in [3].

For example, having the previous ‘wip’ model into a file named ‘wip.dot’, the following command will transform the .dot file into the C representation (previously shown) in the ‘wip.h’ file:

$ dot2c wip.dot > wip.h

The ‘wip.h’ content is the code sample in section ‘Deterministic Automaton in C’.

Remarks

The automata formalism allows modeling discrete event systems (DES) in multiple formats, suitable for different applications/users.

For example, the formal description using set theory is better suitable for automata operations, while the graphical format for human interpretation; and computer languages for machine execution.

References

Many textbooks cover automata formalism. For a brief introduction see:

O'Regan, Gerard. Concise guide to software engineering. Springer,
Cham, 2017.

For a detailed description, including operations, and application on Discrete Event Systems (DES), see:

Cassandras, Christos G., and Stephane Lafortune, eds. Introduction to discrete
event systems. Boston, MA: Springer US, 2008.

For the C representation in kernel, see:

De Oliveira, Daniel Bristot; Cucinotta, Tommaso; De Oliveira, Romulo
Silva. Efficient formal verification for the Linux kernel. In:
International Conference on Software Engineering and Formal Methods.
Springer, Cham, 2019. p. 315-332.