Abstract.
Cell Biology, the study of the morphological and functional organization of cells, is now
an established field in biochemical research. Computer Science can help the research in
Cell Biology in several ways. For instance, it can provide biologists with models and
formalisms able to describe and analyze complex systems such as cells. In the last few
years many formalisms, originally developed by computer scientists to model systems of
interacting components, have been applied to Biology. Among these, there are Petri Nets,
Hybrid Systems, and the pi-calculus. Moreover, formalisms such as P Systems, originally
developed to study new computational paradigms inspired by Biology, have recently found
application to the description of biological phenomena. Finally, some new formalisms have
been proposed to describe biomolecular and membrane interactions.
The first advantage of using formal models to describe biological systems is that they
avoid ambiguities. In fact, ambiguity is often a problem of the notations used by biologists.
Moreover, the formal modeling of biological systems allows the development of simulators,
which can be used to understand how the described system behaves in normal conditions,
and how it reacts to changes in the environment and to alterations of some of its components.
Furthermore, formal models allow the verification of properties of the described
systems, by means of tools (such as model checkers) which are well established and widely
used in other application fields of Computer Science, but unknown to biologists.
In this thesis we develop a formalism for the description of biological systems, called
Calculus of Looping Sequences (CLS), based on term rewriting and including some typical
features of process calculi for concurrency. What we want to achieve is a formalism
that allows describing proteins, DNA fragments, membranes and other macromolecules,
without ignoring the physical structure of these elements, and by keeping the syntax and
the semantics of the formalism as simple as possible.
CLS terms are constructed from an alphabet of basic symbols (representing simple
molecules) and include operators for the creation of sequences (representing proteins and
DNA fragments), of closed sequences which may contain something (representing membranes),
and of multisets of all these elements (representing juxtaposition). A CLS term
describes the structure of the system under study, and its evolution is given by the application
of rewrite rules describing the events that may occur in the system, and how
the system changes after the occurrence of one of these events. We equip CLS with an
operational semantics describing the possible evolutions of the system by means of application
of given rewrite rules, and we show that other formalisms for the description of
membranes can be encoded into CLS in a sound and complete way.
We propose bisimilarity as a tool to verify properties of the described systems. Bisimilarity
is widely accepted as the finest extensional behavioral equivalence one may want
to impose on systems. It may be used to verify a property of a system by assessing the
bisimilarity of the considered system with a system one knows to enjoy that property. To
define bisimilarity of systems, these must have semantics based on labeled transition relations
capturing potential external interactions between systems and their environment.
A labeled transition semantics for CLS is derived from rewrite rules by using as labels
contexts that would allow rules to be applied. We define bisimulation relations upon this
semantics, and we show them to be congruences with respect to the operators on terms.
In order to model quantitative aspects of biological systems, such as the the frequency
of a biological event, we develop a stochastic extension of CLS, called Stochastic CLS.
Rates are associated with rewrite rules in order to model the speeds of the described
activities. Therefore, transitions derived in Stochastic CLS are driven by exponential
distributions, whose rates are obtained from the rates of the applied rewrite rules and
characterize the stochastic behavior of the transitions. The choice of the next rule to be
applied and of the time of its application is based on the classical Gillespie’s algorithm for
simulation of chemical reactions.
Stochastic CLS can be used as a formal foundation for a stochastic simulator, but also
to build models to be given as an input to model checking tools. In fact, the transition
system obtained by the semantics of Stochastic CLS can be easily transformed into a
Continuous Time Markov Chain (CTMC). If the set of states of the CTMC is finite
(namely, if the set of reachable CLS terms is finite) a standard probabilistic model checker
(such as PRISM) can be used to verify properties of the described system.
Finally, we propose a translation of Kohn Molecular Interaction Maps (MIMs), a compact
graphical notation for biomolecular systems, into Stochastic CLS. By means of our
translation, a simulator of systems described with Stochastic CLS can be used to simulate
also systems described by using MIMs.
[
pdf]