You are currently browsing the category archive for the 'theoretical computer science' category.
Me (and some colleagues of mine in Pisa) have selected 3 long term research problems which could become, one day, a good (hopefully) starting point as a phd thesis. All this stuff is realeted to systems biology which is, accordingly to wikipedia,
“SB is a relatively new biological study field that focuses on the systematic study of complex interactions in biological systems, thus using a new perspective (integration instead of reduction) to study them. Particularly from year 2000 onwards, the term is used widely in the biosciences, and in a variety of contexts. Because the scientific method has been used primarily toward reductionism, one of the goals of systems biology is to discover new emergent properties that may arise from the systemic view used by this discipline in order to understand better the entirety of processes that happen in a biological system.”
Here is the list of topics:
-
Kohn interaction maps[1999]: Eventually to understand the integrated function of the cell cycle regulatory network, we must organize the known interactions in the form of a diagram, map, and/or database. A diagram convention was designed capable of unambiguous representation of networks containing multiprotein complexes, protein modifications, and enzymes that are substrates of other enzymes. To facilitate linkage to a database, each molecular species is symbolically represented only once in each diagram. Molecular species can be located on the map by means of indexed grid coordinates. Each interaction is referenced to an annotation list where pertinent information and references can be found. Parts of the network are grouped into functional subsystems. The map shows how multiprotein complexes could assemble and function at gene promoter sites and at sites of DNA damage. It also portrays the richness of connections between the p53-Mdm2 subsystem and other parts of the network. TODO: Formal defintion of their semantics (formal, complete, not ambiguos, ….
-
P Systems[1998]: A computational model based upon the architecture of a biological cell, abstracting from the way in which chemicals interact and cross cell membranes. This concept was first introduced in a 1998 report by the computer scientist Gheorghe Paun, whose last name is the origin of the letter P in ‘P Systems’. Variations on the P system model led to the formation of a branch of research known as ‘membrane computing.’ Although inspired by biological processes, the research interest regarding P systems is mainly concerned with their use as a computational model, rather than for biological modelling, although this is also being investigated. TODO: definition of more expressive variants of P Systems (space topology, time, ….)
-
Tumor immune system[XXX]: Occasionally, though, even though cells are changing from normal to abnormal, they may still appear to be normal. Their outer appearance (proteins and other molecules on their surface) may look unchanged, even though profound changes may be happening on the inside. In this way, these abnormal cells manage to escape attack by the immune system and grow and multiply without triggering an immune response. This is how it’s possible for a tumor to form, even when your immune system is working normally. Eventually, however, the tumor becomes so altered and threatening that it can no longer hide its malignant character. The immune system is no longer fooled into recognizing these cells as normal, and launches its attack. The attack may succeed, or it may come too late: the tumor may be beyond the power of the immune system by itself. The immune system may need help—bold measures such as: immune growth factors(medicines that stimulate the production of new immune cells); antibody medications(special antibodies made in a laboratory, designed to target a specific antigen on a cancer cell); vaccines(agents that stimulate your immune system to fight back, giving it a wake-up call to action). TODO: A quantitative approach based on systems biology to study the problem of interaction between the tumor and the immune system.
These sumup of the 3 topics are taken from the web. The avalaible literature is huge and, as always, someone else already did something… I am going to have, in these short period of time, a quick look at all these topics. I found interesting references and I am going to go into deep with the ones more involving…
Just today I created this blog and consequently I am going to publish this nice piece of news… I will have a paper published in the TCS journal (Theoretical Computer Science, Elsevier). I mean, for those of you used to it, this seems to be normal (“embe??” in italian) but for me is a pleasure, a kind of a positive acknowledgment for my work. It is my third publication, i don’t care (by now) about the unknown impact factor, the relevance, the …, the…, I am just proud of that. Of course this would not have been possible without the help, write ”help” but read “substantial help” of my co-authors (Paolo Milazzo in primis…).
Lets discuss now about the work….. the title is “An Intermediate Language for the Stochastic Simulation of Biological Systems” and the authors, a part from me, are P.Milazzo (a research fellow at my CS Department), A.Maggiolo-Schettini and R.Barbuti (full professors at the same depatment of me). This work started from some ideas I developed during my master of science thesis (summer 2007) and finished before I started my phd (january 2008). The starting ideas have been published in a workshop, my first workshop, From Biology To Concurrency and Back (FBTC07 satellite event at CONCUR07), held in Lisbon, Portugal. In that paper we defined almost the same language without a formal stochastic semantics, so it was no suitable to be implemented. In this paper, which will be published in the special issue of TCS for the FBTC workshop, we added such a semantics… and now a bit of details about the work…
Firstly, why an intermediate language? This is quite easy to be explained if you have a bit of experience in writing simulation software based on formal languages in systems biology. This is, in fact, not a trivial issue and, generally, hides many difficulties (data structures, efficent algorithms, ….). This language, which you will see uses simple data structures, should (I hope
), permit the implementation of a very efficient simulator. Furthermore, there exist many languages and many simulation softwares developed within systems biology, so defining an intermediate language would be usefull to avoid programming too much other than a compiler.
Some techincalities: we started from Multiset Rewriting, namely a multiset of elements and a set of rewriting rules. For instance, a multiset M containing molecules (a chemical solution in other words) A,A,A,B,B,C and a set of rules. Rule 1 replaces two molecules A with a molecule C, and rule 2 replaces two molucels B and one molecule C with two molecules A, namely (1) A|A->C and (2) C|C|B->AA. It is trivial to understand that the rules can be applied, in sequence, to the multiset M yielding to the multiset A|A|A|B|B|C–>C|B|B|C–>B|A|A. We enriched this formalism with a stochastic semantics, namely each rule can have a kinetic constant which represent the rate of the reaction. Furthermore, with respect to the kinetic theory of chemical reactions, each rule, when applied, will have a quickness. This allowed us to define a stochastic semantics based on the Gillespie’s Algorithm (SSA), a well-known stochastic framework for the simulation of biological systems. I will not go into details (now) with this semantics however, it is quite easy to understand the limits of this language. It is able to describe finite multisets (finite with respect to the molecules they store) and, from a theoretical perspective, it is not Turing complete (do you see the not-so-hidden Petri Net??). So we decided to define more expressive multisets, we added strings (modifiable) to the multisets. Of course string should be able to represent almost any interesting biological structure (DNA strands, preoteins, membranes, …). We enriched also rewriting rules, adding variables of many kinds and some novel operators in order to express logical propositions. Of course we added kinetic constants and defined the stochastic semantics of this new formalism, the Stochastic String Multiset Rewrting. Consequently, the state of a computation is represented by a multiset of strings which represent the encoded biological systems. The events that may happen in the biological systems are described by the mentioned stochastic rewriting rules.
As the paper proposes an intermediate language, then we defined also the encoding of two higher level formalisms, the Stochastic Calculus of Looping Sequences and the Sotchastic \pi-calculus. Why these langauges?? Simply becasue they are base on different theoretical frameworks. The former is based on term rewriting systems while, the latter, is based on process algebras. The latter is also a very well known language in the world of systems biology.
For the purists of proofs there are more then 15 pages of proofs for both soundness and completeness of the encodings. These proofs are big-but-boring ones ahahahahah
The paper can be downloaded from my www page. Have fun and comments….

Recent Comments