arXiv:q-bio/0509005v2 [q-bio.MN] 10 Sep 2005
Networks of Mobile Elements for Biological
Systems
Kanchan Thadani ∗
Centre for Development of Advanced Computing
Pune University Campus
Pune 411007, Maharashtra, India
Ashutosh
Persistent Systems,
3-B Bhageerath, Senapati Bapat Road
Pune 411008, Maharashtra, India
Abstract
In this paper we present a network model to study the impact of spatial distribution of constituents, coupling between them and diffusive processes in the context
of biological situations. The model is in terms of network of mobile elements whose
internal dynamics is given by differential equations exhibiting switching and/or oscillatory behaviour. To make the model more consistent with the underlying biological phenomena we incorporate properties like growth and decay into the network.
Such a model exhibits a plethora of attributes which are interesting from both the
network theory perspective as well as from the point of view of bio-chemistry and
biology.
We characterise this network by calculating the usual network measures like network efficiency, entropy growth, vertex degree distribution, geodesic lengths, centrality as well as fractal dimensions and generalised entropy. It is seen that the
model can demonstrate the features of both scale free networks as well as small
worlds network in different parameter domains. The formation of coherent spatiotemporal patterns is another feature of such networks which makes them appealing
for understanding broad qualitative aspects of problems like cell differentiation (e.g.
morphogenesis) and synchronization (e.g. quorum sensing mechanisms).
One of the key features of any biological system is its response to external attacks.
The response of the network to various attack strategies(isolated and multiple) is
also studied.
Key words:
Networks, evolving cellular networks, diffusion, pathways
Preprint submitted to Elsevier Science
14 November 2018
1
Introduction
Biological systems exhibit tremendous complexity, versatility and robustness
in their responses to their environment. Though molecular biology has had
spectacular success it is becoming clearer that simple enumeration of genes,
proteins and metabolites is not sufficient to understand this complex behaviour. In recent time it has been shown that [Lee et.al,Milo et.al] to capture system level behaviour, it is instructive to search for patterns common
to complex systems and networks in general. For example, even if we could
calculate the physio-chemical properties of a protein from its structure, to obtain the physiological behaviour from these would be a daunting task without
the knowledge about the organisation of the network in which the proteins
operate. Networks have been discovered to be important at various levels in
biological systems [Hartwell et.al.]. These networks of interacting elements are
intrinsically dynamic and describe how the system changes in space and time
in response to the external stimuli, grows and reproduces, differentiates and
dies. Therefore there is a pressing need in biology to develop an alternative
language and methodology to deal with such situations.
“Perhaps a proper understanding of the complex regulatory networks making
up cellular systems like the cell cycle will require a shift from common sense
thinking. We might need to move into a strange more abstract world, more
readily analyzable in terms of mathematics than our present imaginings of
cells operating as a microcosm of our everyday world” [Nurse P.]
“The best test of our understanding of cells will be to make quantitative predictions about their behaviour and test them. This will require detailed simulations of the biochemical processes taking place within [cells]. We need to
develop simplifying, higher-level models and find general principles that will
allow us to grasp and manipulate the functions of [biochemical networks].”
[Hartwell et.al.]
This modelling of processes in cellular and molecular biology is based upon the
assumption that interactions between molecular components can be approximated by a network of biochemical reactions in an ideal macroscopic reactor.
Then following the standard methods of chemical reaction kinetics, ordinary
∗
Email addresses:
[email protected] (Kanchan Thadani),
ashutosh
[email protected] (Ashutosh).
2
differential equations are obtained. This modeling approach has been applied
to many systems.
However, there are a few considerations which are omitted in above approximation. Firstly, though a few studies consider the spatial aspects of the problem
(e.g. [Eldar et. al (2002)] on Drosophila melanogaster), most of the time they
are simply neglected. However, it is known that the cell is not a homogenous
well stirred reactor, but instead is a highly compartmentalized and heterogenous environment where phenomena like crowding or channeling are present
[Ellis E.(2001)]. In such a situation, the spatial aspects may well play a key
role and may need to be incorporated. Such spatial models are particularly
relevant for biological networks where the concentration and growth of signalling factors decays with the distance. Thus the probability of establishing
a connection with far off nodes is much smaller. Thus the nodes in such networks are more likely to be directly connected to the “nearby nodes” than in
the far off nodes. The spatial inhomogeneity observed in real life networks is
also indicative of the fact that the values in the network are not uniformly distributed but instead clustered together at various points in space. There may
also be bounds and limits in spatial model which forbid or allow connections
depending on a cut-off distance.
Another important consideration is that of mobility. In most of the studies
the elements are considered to be fixed in space and the structure of coupling
is fixed in time i.e. two elements that are initially coupled are coupled forever and they remain at the same point in space. Some models do consider
time dependent coupling coefficients [Shibata T. and Kaneko K.] though spatial positions are fixed. However, in real situations the elements move due to
diffusion or chemical gradients [Painter K.J and Hillen T.(2002)] and the local
interactions determine the connectivity of the element.
There is another way in which such models make restrictive assumptions in
context of biological systems. The number of nodes and degrees of freedom are
fixed initially and remain fixed. However most of the biological systems show
growth, reproduction and death. For example a cell multiplies and transfers
its attributes to its offspring and dies. There may be more than one pathway
to produce a protein and also more than one way it is degraded. The biological
elements also differentiate themselves to allow flexibility with respect to function [Huang S. and Ingber D.E. (2000)]. This differentiation and growth leads
to degrees of freedom no longer remaining fixed for a node and diversification
of the nature of the elements themselves.
Lastly, most of the models assume either a boolean function for evolution
[Shmulevich I. et.al. (2002)] or an arbitrary dynamical rule like dynamical
maps(e.g. logistic, circle or Cellular Automata update rules) or some convenient rule like piecewise linear differential equations [de Jong H. et. al.(2003)].
3
However, such a dynamical rule should have some correspondence(e.g. same
kind of non-linearity, continuity properties) to the underlying biological consideration so that it can reproduce behaviour more realistically. Ideally this
function should be inferred from experimental data or the detailed knowledge
of underlying kinetics. However, as an approximation, at least some motif biological evolution rules which are found in a number of contexts should be
used to model the broad qualitative features with a hope that they fall in the
same class of systems as the one being modelled and reproduce results with a
greater degree of faithfulness to reality.
Thus in light of the above discussion, there is a need to generalise the existing
network models of the biological systems. Here we attempt to relax all of the
assumptions made above and study the behaviour of such models.
The paper is organised as follows: In the next section we set up the network
and elaborate on its features. We also discuss the internal dynamics of the
nodes and various choices one can make. The section on analysis studies the
mathematical properties of the network and compares them with numerical
results. After that we consider various strategies of attack on the network to
determine its robustness. Finally we conclude with discussion and interpretation of results.
2
Materials and methods
In this section we set up the network as follows:
2.1 Defining Network Behaviour
A network consists of a set of connected nodes. Each node has an internal
state and an intra-dynamics which evolves the state according to a dynamical
system. The current state of the node is also dependent on the state of all
its neighbours by coupling. In the network that we set up, the number of
neighbours a node is coupled to is dependent on the dynamics of the network
(inter-dynamics). This is in contrast to the lattice models where the number of
neighbours is fixed a-priori. In such lattice models, one either couples to a fixed
number of nearest neighbours or has a global coupling to all the other nodes
in the network (mean-field models). We feel that in a number of biological
contexts all the above assumptions are a little too restricted. During the course
of time evolution, a given sub part of the system usually interacts with a
variable number of other elements [Shibata T. and Kaneko K.]. Furthermore
the network that we set up is an evolving one, where new nodes get added and
4
redundant nodes get removed. Therefore, the number of neighbours should be
dynamically changing.
Evolving networks have been studied in recent times in great detail
[Erdos et al(1961); Barabasi(2001); Watts(1998); Newman]. Typically the networks evolve by adding nodes to a random initial configuration (e.g. ErdosRenyi network model). The nodes are usally added by using different heuristic prescriptions. In particular, scale free networks are obtained by adding
new nodes based on the vertex degree distribution. These networks have been
subject of much study and debate [Albert, R. and Barabasi, A.-L. (2002). ]
and model many biological contexts. for example phenomena involving food
webs, biochemical interactions, protein-protein interaction maps for viruses,
prokaryotes and eukaryotes C.elegans etc are described by scale free networks.
A different prescription (e.g Watt’s Beta model) results in a small worlds network which is found to be useful in modeling co-operative behavior in biological systems [Erdos et al(1961); Barabasi(2001); Watts(1998); Newman].
However, due to finite size and limited data available in most realistic situations, it is difficult to establish scale free behavior of a network unambiguously [Albert, R. and Barabasi, A.-L. (2002). ]. Secondly, the scale free nature
is often inherent in the prescription to grow a network.
We take a different approach in the present work. Rather than starting from a
random network, we start with a network of few elements and then grow the
network by state-splitting [Lind D. and Marcus B. (1995)]. Such a growth law
does not have a built in prescription for power law behaviour. Thus any scale
free nature found is truly an emergent property rather than a consequence of
a heuristic for network growth.
Diffusion plays a major role in most of the biological processes. Thus any model
which describes a process involving spatial motion or distribution of biological
elements should take into account this important feature. The existing network
models try to incorporate this by varying the connections between the elements
dynamically [Shibata T. and Kaneko K.]. However the assumptions that are
made in these models are too restrictive in many situations.We describe these
assumptions and their possible relaxations below.
(1) State is mostly considered to be discrete. For example most commonly used network for modelling is a boolean network in which internal
state of the nodes is a discrete (0 or 1) state. This kind of model is,
at best, an approximation for modelling qualitative coarse grained behaviour of the system. Some of the limitations of a boolean approach are
the following : firstly since the internal state of an elemnent is evolving
in a discrete manner, it becomes diffficult to model situations which involve elements with different times scale being updated asynchronously.
Secondly, we can not have a coupling between elements which is weighted
5
by the continous value of the internal state. Such a coupling is desirable
particularly in bio-chemical reaction networks Thirdly, as a dynamical
system, the nodes can have more than one attractor i.e. more than one
possible final state depending on initial values. To model such situations
with boolean networks is cumbersome. On the other hands, using differential equation has advantages of preserving ”quantitativity and causality” [De Hoon M. et. al.(2003)] In more realistic models, the nodes have
a number of states possible. We assume the state variable describing the
internal state to be a continuous variable thus allowing more quantitative
approach.
(2) The internal state of the node does not affect the interactions in
the networks. In realistic situations this assumption is restrictive. This
assumption is actually inherent in the nature of the dynamical systems
models where couplings are usually considered as fixed. There are models
which consider variable topologies
[Erdos et al(1961); Barabasi(2001); Watts(1998); Newman]. Even in such
models the nodes are usually kept fixed and the connections that they
make are dynamically changing. However, in most of the situations the
elements move (either along a signal gradient or in a diffusive manner) in
a very heterogenous medium. Most of the interactions are also dependent
on the actual concentrations of active features of these elements. These
active features depend on the internal state of the node and change with
time. Thus the number of other nodes a given element would interact with
depends on the current values of its internal state as well as the current
values of internal states of the other elements. Therefore there is a need
for introducing an explicit coupling between the internal states and the
strength of interaction. We address this issue as follows: The nodes are
allowed to move according to either a diffusive motion or a gradient or
both. The interaction that takes place depends upon the number of other
nodes in its “sphere of influence” and the internal states of the interacting elements have an effect on each other. The mobility of the element is
dependent on its internal state and the region it finds itself in. Thus we
have an explicit coupling between the number of neighbours that a node
has and its internal state.
(3) The interaction range is independent of time. However, for biological systems one can not categorise them as having a fixed interaction
range but should have coupling terms which are changing with time. This
is because such systems usually have a number of spatio-temporal scales
with different elements interacting at different scales. This is modelled
in the current system by using a coupling strength that is dependent on
the distance between the interacting elements. This dependence can incorporated either by using a threshold value or by using an interaction
strength which decays with distance. Thus as an element moves around,
its interaction range with other elements is dynamically changing.
(4) The structure of interactions does not change with time i.e. if two
6
elements are coupled then they have no mechanism to decouple and two
decoupled elements do not couple. We allow for such a mechanism. If an
element is outside the sphere of influence its interaction diminishes and if
it moves within sphere of influence of other element it gets coupled. Thus
the overall network dynamics encompasses many coupling and decoupling
events and overall evolution of network travels through many structures
of interactions.
(5) Though there is a growth law (fixed by network evolution heuristic) there is usually no decay law. As mentioned earlier elements
can grow, reproduce and die in many situations of interest. The decay law, modelling the removal or death of elements in biology plays
an important role in determining behaviour. We incorporate these effects as follows: the growth is modelled by state splitting of the network
[Lind D. and Marcus B. (1995)]. Such a mechanism is natural for these
contexts. There are two kinds of state splitting that are considered. Insplitting in which the incoming degree of a node is split between the
node and its offspring and the outgoing degree is shared by both and
the out-splitting in which outgoing degree is split and incoming connections are shared. This is different from self-organised scale free network
in which nodes get all the degrees inherited. The decay is modelled as
following: once a node is out of the interaction range of all other nodes,
it decouples and is marked as dying. In such a state it would evolve as
an independent dynamical system and reach a attractor of its evolution
state and stay at that state without contributiong to the dynamics on the
network. However, since other nodes are also mobile it can happen that
another node with connections to the network comes within the sphere
of influence of this node and get coupled to it. Then the node is “healed”
and starts contributing to the network dynamics again. The nodes which
do not heal within a specified time are considered dead and are removed
from the system. Such a removal does not affect the dynamics of the rest
of the system and is analogous to how biological systems clear dead and
redundant parts.
(6) The elements are considered identical. We incorporate diversity and
differentiation of the elements in our system. Firstly, the network is made
up of a number of different types of elements which are representative of
various kinds of dynamics actually observed in biological networks (see
next section).
Thus our network is described by the following: We have a set of coupled nodes.
The number of nodes each element couples to is determined by its state and
its sphere of influence. Every node also has a variable defining its position
in space. This variable depends on the current state of the element, current
position of the element, current position of its neighbours and current state of
the neighbours. Thus the interaction structure of the network is dynamically
dependent on the states, positions and spheres of influence of the nodes.
7
The network grows by reproduction of its nodes. To model this growth we use
state splitting and to model decay we use the following rule: if two nodes are
greater than a certain distance apart then they are set to be decoupled. A
node which has been decoupled from every node is a “dying” node. However,
due to dynamic interactions, such a node may still be revived (“healed”). Any
dying node which does not heal for a specified period is considered “dead”
and removed from the network.
Thus the network is described by the following set of equations
Calculation of the state of a node.
N2t
N1t
Xt+1
= Xtj + (1/N1tj )
j
Xj
ǫin (i)f [i] (Xti ) − (1/N2tj )
i=1
Xj
ǫout (i)f [i] (Xti ) (1)
i=1
Calculation of the current position of a node.
N1t
rjt+1 = rjt +
Xj
i=1
ǫin (i)
q
Xi .Xj
|rit − rjt |
N2t
+
Xj
i=1
ǫout (i)
q
Xi .Xj
|rit − rjt |
(2)
Calculation of the number of in-neighbours and out-neighbours of a node.
N1t
N1t+1
=
j
Xj
2
θ(|rit
i=1
N2t
N2t+1
=
j
j
X
2 θ(|rit
i=1
− rjt |)
exp(−c/4)
− rjt |) exp(−c/4)
where
Xtj → The state vector for a node j at time t
rjt → The position of a node j at time t
N1tj → The number of in neighbours of a node j at time t
N2tj → The number of out neighbours of a node j at time t
ǫin (i) → The in coupling coefficient for node i
ǫout (i) → The out coupling coefficient for node i
8
(3)
(4)
c(R) → is a coefficient depending on radius of influence R with value between
0 and 1.
θ(|ri − rj |) = 0 if |ri − rj | > R
= 1 otherwise
(5)
2.2 Defining Node Behaviour
Every node has an internal state and an internal dynamics. For the internal
dynamics we choose some well known motifs which are frequently seen in dynamics of signalling and regulatory pathways in the cells [Tyson (2003)]. These
motifs, which behave as switches and oscillators, serve as building blocks for
the bigger, more complex networks. For example, consider a typical situation
The proteins that modulate Cdc2 activity are themselves modulated by
Cdc2:Cdc13, through a set of feedback loops
(1) Rum1 inhibits Cdc2:Cdc13, but Cdc2:Cdc13 phosphorylates Rum1, thereby
targeting Rum1 for degradation.
(2) Ste9:APC labels Cdc13 for degradation, but Cdc2:Cdc13 can phosphorylate Ste9, thereby downregulating its activity and targeting it for degradation.
(3) Wee1 phosphorylates and inactivates Cdc2:Cdc13, but, at the same time,
Cdc2:Cdc13 is trying to phosphorylate and inactivate Wee1.
(4) Cdc25 takes the inactivating phosphate group off PCdc2: Cdc13, and
Cdc2:Cdc13 returns the favor by phosphorylating and thereby activating
Cdc25.
(5) Slp1:APC, which also labels Cdc13 for degradation, is itself activated by
Cdc2:Cdc13 by an indirect pathway.
The first three feedback loops are examples of mutual antagonism. Under
appropriate conditions, the antagonists cannot coexist, i.e. the feedback loop
works like a toggle switch. Either Cdc2:Cdc13 has the upper hand and its
antagonist (Rum1 or Ste9 or Wee1) is suppressed, or vice versa. The fourth
interaction is a positive feedback loop: Cdc2 and Cdc25 activate each other in
a mutually amplifying fashion. The last interaction is a time-delayed negative
feedback loop, which, under appropriate conditions, can generate oscillations
(as Cdc2:Cdc13 concentration rises, it turns on Slp1, which targets Cdc13 for
degradation, causing Cdc2:Cdc13 concentration to fall, and Slp1 to turn off)
To study the dynamical consequences of these feedback loops, one must formulate these interactions as a precise molecular mechanism, convert the mechanism into a set of nonlinear ordinary differential equations, and study the
9
solutions of the differential equations by numerical simulation.
We can set up a network by randomly selecting these elements and connecting
them. However, for the present study we choose four elements behaving as
switches and two behaving as oscillators.
The equations for the switches and oscillators that we use are as follows
Perfectly Adapted Switch
Perfect adaptation means that the steady state response of the element is
independent of the signal strength. Such a behaviour is typical of chemotactic
systems (including human sense of smell) which initally responds to an abrupt
change in interacting attractants and then settles down into a steady response
[Painter K.J and Hillen T.(2002)], [Levchenko A, Iglesias P(2002)] have used
a mechanism of this sort to model phosphoinosityl signaling in slime mold
cells and neutrophils. The equations representing such a system are
dX1
= k 1 S − k 2 X1 X2
dt
dX2
= k 3 S − k 4 X1
dt
k1 = k2 = 2, k3 = k4 = 1, S = 1.2
(6)
In some situations, like Frog oocyte maturation in response to progesterone
and Apoptosis the switching is one way. At a critical signal strength we get
an irreversible transition to large response starting from small responses. The
transition can be activating or inhibitory giving rise to mutual activation or
inhibition
Mutual Activation Switch
The equations representing such a system are
dX1
= k0 E(X1 ) − k2 E(X1 )X1
dt
E(X1 ) = GBK(k3 X1 , k4 , J3 , J4 )
k0 = 0.4, k1 = 0.1, k2 = k3 = 1, k4 = 0.2,
J3 = J4 = 0.05, S = 9.0
(7)
The Goldbeter- Koshland function [Goldbeter A, Koshland DE(1981)] is graded
and reversible. By graded we mean that the response increases continuously
with signal strength. A slightly stronger signal gives a slightly stronger response. Reversible means that if we go from an initial value to a final value.
10
Although continuous and reversible, a sigmoidal response is abrupt. Tyson
[Tyson (2003)] compares this to buzzer or a laser pointer, to activate the response one must push hard enough on the button, and to sustain the response
one must keep pushing. When one lets up on the button, the response switches
off at precisely the same signal strength at which it switched on.
However, even in the presence of such a reversible term, the above switch
shows irreversible behaviour due to a feedback loop that is present.
Mutual Inhibition Switch
In this switch, if signal is decreased enough, the switch will go back to the
off-state. For intermediate stimulus strengths the response of the system can
be either small or large, depending on how S was changed. This switch shows
hystersis and is found in the lac operon in bacteria
[Laurent M and Kellershohn N (1999)], the activation of M-phase-promoting
factor (MPF) in frog egg extracts [Novak B and Tyson J(1993)], the autocatalytic conversion of normal prion protein to its pathogenic form
[Kellershohn N and Laurent M (2001)] and budding yeast cell cycle. This behaviour has been observed in a number of experiments The equations representing this system are
dX1
′
= k0 + k1 S − k2 X1 − k2 E(X1 )X1
dt
E(X1 ) = GBK(k3 , k4 X1 , J3 , J4 )
k0 = 0.0, k1 = 0.5, k2 = 0.1, k3 = 1, k4 = 0.2,
′
J3 = J4 = 0.05, k2 = 0.5, S = 10.0
(8)
Negative FeedBack Switch This type of regulation, commonly employed in
biosynthetic pathways, is called homeostasis.
The equations representing this system are
dX1
= k0 E(X1 ) − k2 SX1
dt
E(X1 ) = GBK(k3 , k4 X1 , J3 , J4 )
k0 = 1.0, k2 = 1.0, k3 = 0.5, k4 = 1.0,
J3 = J4 = 0.01, S = 0.1
(9)
Activator Inhibitor Oscillator
The classic example of an activator-inhibitor system is cyclic AMP production
in the slime mold, Dictyostelium discoideum [Martiel J and Goldbeter A(1987)].
11
External cAMP binds to a surface receptor, which stimulates adenylate cyclase to produce and excrete more cAMP. At the same time, cAMP-binding
pushes the receptor into an inactive form. After cAMP falls off, the inactive
form slowly recovers its ability to bind cAMP and stimulate adenylate cyclase
again. This mechanism lies behind all the curious properties of the cAMP
signaling system in Dictyostelium: oscillations, relay, adaptation, and wave
propagation.
The equations representing this system are
dX1
′
= k0 E(X1 ) + k1 S − k2 X1 − k2 X1 X2
dt
dX2
= k 5 X1 − k 6 X2
dt
E(X1 ) = GBK(k3 X1 , k4 , J3 , J4 )
(10)
′
k0 = 4.0, k1 = k2 = k2 = k3 = k4 = 1.0, k5 = 0.1, k6 = 0.075,
J3 = J4 = 0.3, S = 0.2
Substrate Depletion Oscillator
This describes MPF oscillations in frog egg extracts [Novak B and Tyson J(1993)].
MPF is a dimer of a kinase sub-unit, cyclin-dependent kinase 1 (Cdk1), and a
regulatory subunit, cyclin B. As cyclin B accumulates in the extract, it combines rapidly with Cdk1 (in excess). The dimer is immediately inactivated by
phosphorylation of the kinase subunit X can be converted into active MPF by
a phosphatase called Cdc25. Active MPF activates Cdc25 by phosphorylating
it.
The equations representing this system are
dX1
′
= k1 S − (k0 + k0 E(X2 )X1
dt
dX2
′
= (k0 + k0 E(X2 )X1 − k2 X1
dt
E(X1 ) = GBK(k3 X1 , k4 , J3 , J4 )
(11)
′
k0 = 0.01, k0 = 0.4, k1 = k2 = k3 = 1.0
k4 = 0.3, J3 = J4 = 0.05, S = 0.3
where the GBK(u, v, J, K) is the Goldbeter-Koshland term
GBK(u, v, J, K) =
2uk
v − u + vJ + uK +
12
q
v − u + vJ + uK 2 − 4(v − u)uK
(12)
Thus we can model a number of situations by coupling together these motifs.
In the current work, we choose a pair of any of these elements to represent
the types of nodes in our network. The parameters specified are fixed at these
values following [Tyson (2003)]. The nodes are evolved by integrating the equations using a fourth order Runge-Kutta routine for 100 iterates with time step
0.1 for the switches and 0.001 for oscillators.
2.3 Evolving the Network
We follow the algorithm outlined in flow chart of Fig. 1. The algorithm evolves
each node for a fixed time, updates the number of its neighbours Eq. (3), Eq.
(4), update the current state by coupling it to the neighbours by Eq. (1),
obtain the current position of the node by Eq. (2) and finally update the
network by splitting the in and out degree. This consitutes one iteration of
the network dynamics. We set the simulation as follows: we choose two of the
above six types of nodes and set them up according to an initial golden mean
network. The coupling coefficient ǫ for this particular simulation is considered
same for every node, though in principle the equations allow us to have a
different value of ǫ for every node. The offspring of each node is of the same
as parent. We choose all possible pairs of the node to do the simulation.
3
Characterisation and Analysis
To understand the behaviour of a network a number of characterizing quantities have been introduced in recent times
[Erdos et al(1961); Barabasi(2001); Watts(1998); Newman]. It is important to
choose the set of characterisers which reflect the essential features of the network with respect to the problems of interest. We classify the characterisers
as the following : Topological characterisers which contain the information
about the underlying topological structure of the connections in the network,Geometric characterisers , the growth or Entropic characterisers and
characterisers for the dynamics. These characterisers are listed in Table 1 for
three representative sets of parameters.
3.1 Topological characterisers
Degree Distributions
The degree is an important characteristic of a node. Based on the degree
13
of the nodes, it is possible to construct measurements for the network. One
of the simplest is the maximum degree which is simply the maximum connections that any node in the network has. The degree distribution, P (k)
which expresses the fraction of nodes in a network with degree k contains
more information. One may be interested in finding out if there is a correlation between the degrees of different nodes. Such correlations were found
to have an important role in many network structural and dynamical properties [Erdos et al(1961); Barabasi(2001); Watts(1998); Newman]. The most
common choice is to find correlations between two nodes connected by a link.
This correlation can be expressed by the joint degree distribution P (k, k ′), i.e.,
as the probability of a link connecting two nodes of degree k and k ′ . Another
way to express this is by giving the conditional probability that an arbitrary
neighbor of a node of degree k has degree k ′
P (kkk ′ ) =
P (k, k ′)
P (k)
(13)
This distribution gives a very detailed description of node degree correlations
but, for fat tailed degree distributions as in scale-free networks, it is diffcult to
evaluate experimentally, because of the poor statistics. A measure with better
statistics is to computing the mean degree of the neighbors of nodes with a
given degree, given by
knn (k) =
X
k ′ P (k ′|k)
(14)
k′
In the simulation we use Eq. (14) for calulating vertex degree distribution. The
vertex degree distribution can be calculated for the self degrees and neighbour
degrees separately. For a random or small worlds network, the vertex degree
distribution is a Poisson distribution in a large N limit with a peak at a
particular degree bein most prominent. In contrast, for the scale free network
of Barabasi and Albert, the degree distribution follows a power law for large
k. In the network model that we consider we can get both types of degree
distribution depending on parameter regimes. This is demonstrated in Figs
2(a) and 2(b). the Fig 2(a) shows a typical small worlds behaviour whereas
2(b) shows a power law like decay in large k region. Thus we can model
both kinds of phenomena with the same model. As one changes the coupling,
we get a transition from small worlds phase to a scale free phase. Further
investigations of this transition are in progress.
Clustering Coefficient
A characteristic of the Erdos-Renyi model is that the local structure of the
network near a node is a tree. More precisely, the probability of loops involving
14
a small number of nodes tends to 0 in the large network size limit. This is in
marked contrast with the profusion of short loops which shows up in many realworld networks. One way to characterise the presence of such loops is through
the clustering coefficient. Two different clustering coefficients are frequently
used. we can define a coefficient
3Nt
N3
X
Nt =
aij ajk aik
C
=
i,j,k
N3 =
X
aij ajk
i,j,k
(15)
This coefficient gives the ratio of number of triangles i.e. the triples which are
linked to each other to number of connected triples (i.e. three nodes connected
to each other directly or indirectly). If we denote by li the number of links
between neighbours of node i and ki is the number of neighbours of node i
then we can also write clustering coefficient as
2li
ki (ki − 1)
1 X
C =
Ci
N i
Ci =
(16)
The difference between the two definitions is that Eq. (15) gives the same
weight to each triangle in the network, while Eq. (16) gives the same weight
to each node, resulting in different values because nodes of higher degree are
possibly involved in a larger number of triangles than nodes of smaller degree.
In the simulation result we calculate the coefficient given by Eq. (16).
Centrality
Greater the number of paths in which a node or link takes part, the greater the
importance of this node or link for the network. Assuming that the interactions
follow the shortest paths between two nodes, it is possible to quantify the
importance of a node in this sense by its betweenness centrality
Bi =
X
jk
σ(j, i, k)
σ(j, k)
(17)
where σ(j, i, k) is the number of shortest paths between nodes j and k which
15
pass through node i and σ(j, k) is total number of paths between nodes j and
k. The sum is over all distinct pairs (j,k). Other centrality measures are given
by closeness centrality D , network centrality N and stress centrality S
Di
=
X
j
1
dij
N = (maxj dij )−1
S=
X
σ(j, i, k)
(18)
jk
where dij = shortest distance between i,j
3.2 Geometric characterisers
Geodesic Lengths
In the general case, two nodes of a complex network are not adjacent. In fact,
most of the networks of interest are sparse, in the sense that only a small
fraction of all possible links are present. Nevertheless, two non-adjacent nodes
i and j can be connected through a sequence of m links (i, k1), (k1, k2), . .
. , (km-1, j); such set of links is called a path between i and j, and m is the
length of the path. We say that two nodes are connected if there is at least
one path connecting them.
A geodesic path (or shortest path) between nodes i and j, is one of the paths
connecting these nodes with minimum length. The length of the geodesic paths
is the geodesic distance dij between nodes i and j. There can be more than
one geodesic paths between two nodes. We take the mean geodesic distance
as a characteriser of the network. The definition has a problem if the network
contains unconnected nodes therefore this measurement is only performed
after cleanup operation. The Dijkstra algorithm is used for calculating the
shortest path on the network.
Network Efficiency
Network efficiency measure the ease of communication on a network. This can
be defined as
ξ=
X
1
dij −1
N(N − 1) i6=j
(19)
This quantity is also useful in characterizing robustness of network to attack
16
(see below)
3.3 Entropic characterisers
Growth rate and Entropy
The entropy of a network can be calculated by using the following result
based on Frobenius - Perron theory which we state without proof. for proof
see [Lind D. and Marcus B. (1995)]
Theorem :
If G is an irreducible graph then h(XG ) = logλA(G) where h(XG ) is the entropy
of the shift dynamical system XG associated with the graph λA(G) is the Perron
eigenvalue of the graph( i.e. in this case the largest real eigenvalue of the
adjacency matrix).
In general, however the above result provides an upper bound to entropy and
we calculate this by diagonalising the adjacency matrix and finding the largest
eigen value. Since our network is an evolving hypergraph , we compute this
quantity at every time step and plot it with time in Fig 3. We find that
the entropic bound saturates to certain fixed value (see Table 1) for a set
of parameters and grows without bound for another set of parameters. This
means that the network attains certain optimal size for a set of parameter
values and has unrestrictive growth for another set. Continuously varying the
parameters we have a transition from restrictive to unrestricted growth.
Generalised Entropy
Our network starts with nodes having a degree <=2. the degrees are added by
splitting and by merger and removed by nodes dying. In the parameter regime
where we get scale free behaviour the degree distributions can be fitted by
P (≥ k)
= eq(k−2)/K
c
1
exqc = [1 + (1 − qc )x] 1−qc
f or
1 + (1 − qc )x
>0
(20)
where K is related to the the value qc which best fits the data. This quantity is
determined as follows: we take the logarithm of the probability for a series of
different values of qc ranging from -3 to 3 in steps of 0.1. The values obtained
from the q -exponential form are then fitted to data obtained from the model
17
by a linear fit. The value of q which gives the best fit is determined as value
qc
This form is taken from the formulation of a non-extensive genrealised entropic
function introduced by Tsallis [Gell-Mann M. and Tsallis C.(2004) ]
The entropic function
Sq ≡
1−
R∞
dk[p(k)]q
1−q
2
(21)
extremising this as described in [Gell-Mann M. and Tsallis C.(2004) ] gives
(2−q)
P (≥ K) = [1 − (1 − q)β(k − 2)] (1−q)
(22)
this equation is identical to Eq.(20) if we assume qc = 1/(2 − q) and K =
1/(2−q)β. Thus this non-extensive entropic function reproduces the behaviour
of probability distribution. Therefore, it is probable that the correct entropic
form for these models is non-extensive Tsallis entropy rather than by Shannon - Boltzmann -Gibbs entropy. The consequences of this conjecture require
further investigations.
4
Response to Attack
One of the key features of any network is its ability to withstand various kinds
of attacks. An attack or damage to a network typically hampers its performance which leads to a decrease in its efficiency of communication or transfer
of signals along various available paths in the network. Self organised networks have various responses depending on the types of attack. Needless to
say, this characteristic is of utmost importance for biological contexts where
the response to an external attack could determine the survival, recovery or
viability of a system. A number of attack strategies are usually considered
while evaluating the response. These include targeted attacks (where a particular node or connected segment is attacked based on some of its properties
like vertex degree or length), multiple attacks (where random nodes are targeted), isolated attack (where a single node is targeted) etc. In the current
simulation we consider isolated and multiple attacks. The measure of response
to attack is given by network efficiency as defined in Eq. (19). The response of
the network to these attacks is shown in the table where the average change
in network efficiency with the number of nodes attacked is shown. This quantity is computed by choosing many nodes either one at a time or in a bunch
18
and averaging over the decrease in network efficiency. As can be seen from
the table, the network is more susceptible to multiple random attacks rather
than to isolated single ones. This is a further manifestation of the fact that
the spatio-temporal patterns found in the network are robust with respect to
external perturbation such that removal of an element from a local pattern
does not impair the network as much as removal of multiple randomly placed
elements does. This is also indicative of the fact that there are more than one
signalling paths available in any local neighbourhood and unless one blocks a
large number of them the efficiency is not significantly impaired.
5
Conclusion
Therefore, in this paper we have proposed a generic model reproducing qualitative behaviour of collective biological systems. The network incorporates a
new feature of mobility which has been hitherto missing from such models.
This property is shown to be essential in reproducing behaviour in presence of
diffusion which many contexts show. The network goes beyond the assumption of boolean elements. It is shown that some of the shortcomings found in
boolean networks can be overcome by making the state of an element a continous variable. Prominient among such enhancements is an ability to model
highly heterogenous and diverse environment where the internal state determines and is determined by its surroundings, as in a cellular interaction mechanism. At the same time the network allows us to have a number of states
with switching attractors. Thus this model mimics the “robust yet fragile”
nature of biological systems.
This explicit coupling between intra and inter dynamics also gives an insight into several topological features of such networks. In particular, we have
demonstrated that starting from a particularly simple initial configuration and
without explicitly assuming any arbitrarily constructed rule for adding nodes,
we can obtain both scale free nature as well as networks containing important hubs. The latter being reminiscent of small world networks. We present
evidence for a transition between two regimes upon variation of parameters.
We also show that interesting cooperative phenomena like community structures and synchronisation emerge in these types of networks. This is clearly
seen in Fig. 4 showing the spatial profile of internal variables. In this figure the
regions of closely linked variations are distinctly separate from other regions.
To investigate robustness of such a network to external perturbations we
present a study of two different types of attack on the system. It is found
that the multiple random attack is more effective than an isolated one. In
current networks the reponse to the attack is reflected by a decrease in its
19
efficiency.
The growth and size of the network follows an entropic law which shows signatures of non-extensivity. This leads to the speculation that Tsallis statistics
is probably more relevant to biological networks.
Several of these interesting features need further exploration and investigation
which are currently under progress. However, in the present paper we only
report that many qualitative aspects of the diverse behaviour of cooperation in
biological systems can be modeled by a single generic network model. Various
aspects of this model can give us an insight into important questions like
quorum sensing in bacteria, synchronisation of biological elements, emergence
of heterogeniety and morphogeneis, tissue and cell growth, tumour growth,
protein interaction pathways and biochemial networks.
For the network theory, this paper presents a network which shows number
of different, well known network characteristics in a single model. Therefore,
this effort may help in unifying some of different pieces of knowledge which
we hope would be useful in understanding general networks better.
Both authors contributed equally to the work. Supplementary materials like
programs, data set and additional figures are freely available from the authors
upon request.
20
6
Figure Captions
(1) Figure 1: Figure 1 describes the flow of the network evolution. The
blocks in the figure describe how various stages of dynamics are handled
and how they depend on each other.
(2) Figure 2: Figure 2 describes the typical vertex degree distributions obtained. The figures 2(a) and 2(b) are generated by initially setting up a
golden mean network with two different kinds of node,namely the Mutual
activation switch Eq.(8) and the activator - inhibitor oscillator Eq.(11).
The network was evolved with value of R = 0.35, and the same values of
node parameters as in [Tyson (2003)]. The value of the coupling strength
ǫ was 0.66 for 2(a) and 0.033 for 2(b). Each node was evolve with respect
to internal dynamics by using a fourth order unge-Kutta method. The
step size for the switch was 0.1 and for the oscillator 0.001.
(3) Figure 3: Describes the entropic bound for growth as described in the
paper. Fig. 3(a) describes the behaviour where the growth of the network
saturates after some time and Fig. 3(b) where it grows without bounds.
(4) Figure 4: shows the patterns found in internal profile of the network at
various times (a)-(e). The presence of correlated patterns can be easily
seen from this figure.
References
[Lee et.al,Milo et.al] Lee T. et al.(2002) Science 298 799
Milo R. et.al(2002) Science 298 824
[Nurse P.] Nurse P.(2000) Cell 100 71
[Hartwell et.al.] Hartwell, L. H., Hopfield, J. J., Leibler, S. Murray, A.W. From
molecular to modular biology. Nature 402 (Suppl.), C47C52 (1999),
[Eldar et. al (2002)] Eldar A. et.al.(2002) Nature 419 304
[Ellis E.(2001)] Ellis E.J.(2001) Trends BioChem. Sci. 26 597
[Shibata T. and Kaneko K.] Shibata T. and Kaneko K. arXiv.org : nlin.AO
/0204024 and references therein
[Painter K.J and Hillen T.(2002)] Painter K.J. and Hillen(2002) T. Volume filling
and Quorum sensing in Models for Chemosensitive Movements Canadian Appl.
Math. Quaterly 10(4) 501 and references therein
[Huang S. and Ingber D.E. (2000)] Huang S. and Ingber D.E. Shape dependent
Control of Cell growth,Differentiation and Apoptosis: Switching Between
Attractors in Cell Regulatory Networks Exp.Cell Research 261 91-103 (2000).
21
[Shmulevich I. et.al. (2002)] Shmulevich
I.
et.al.
Gene
perturbation and Intervention in Probailistic boolean Networks Bioinformatics
18 1319-1331 (2002).
[Albert, R. and Barabasi, A.-L. (2002). ] Albert, R. and Barabasi, A.-L. (2002).
Statistical mechanics of complex networks. Rev. Mod. Phys. 74, p47-97 and
references therein.
[Lind D. and Marcus B. (1995)] Lind D. and Marcus B. an introduction to Symbolic
dynamics and Coding Cambridge university Press (1995)
[Erdos et al(1961); Barabasi(2001); Watts(1998); Newman] Erdos, P. and Renyi,
A., On the strength of connect- edness of a random graph, Acta Mathematica
Scientia Hungary 12, 261267 (1961).
Jeong, H., Mason, S., Barabasi, A.-L., and Oltvai, Z. N., Lethality and centrality
in protein networks, Nature 411, 4142 (2001).
Watts, D. J. and Strogatz, S. H., Collective dynamics of small-world networks,
Nature 393, 440442 (1998).
[De Hoon M. et. al.(2003)] De Hoon M. et. al.(2003) Proceedings of the Pacific
Symposium on Biocomputing, 8, 1728, 2003.
[de Jong H. et. al.(2003)] de Jong, H., Gouz J.-L., Hernandez, C., Page, M., Sari,
T., and Geiselmann, H. (2003). Qualitative simulation of genetic regulatory
networks using piecewise-linear models. J. Math. Biol, 66:301–340
[Levchenko A, Iglesias P(2002)] Levchenko A, Iglesias PA Models of eukaryotic
gradient sensing: application to chemotaxis of amoebae and neutrophils.
Biophys J 2002, 82 50-63.
[Goldbeter A, Koshland DE(1981)] Goldbeter A, Koshland DE(1981) An amplified
sensitivity arising from covalent modification in biological systems. Proc. Natl.
Acad. Sci.USA , 78 6840-6844.
[Laurent M and Kellershohn N (1999)]
Laurent M and Kellershohn N: Multistability: a major means of differentiation
and evolution in biological systems. Trends Biochem Sci 1999, 24418-422.
[Novak B and Tyson J(1993)] Novak B and Tyson J : Numerical analysis of a
comprehensive model of M-phase control in Xenopus oocyte extracts and intact
embryos. J. Cell. Sci. 1993, 106 1153-1168.
[Kellershohn N and Laurent M (2001)] Kellershohn N and Laurent M: Prion
diseases: dynamics of the infection and properties of the bistable transition.
Biophys J 2001, 81 2517-2529.
[Martiel J and Goldbeter A(1987)] Martiel J, Goldbeter A: A model based on
receptor desensitiztion for cyclic-AMP signaling in Dictyostelium cells. Biophys
J 52 808-828.
[Tyson (2003)] Tyson J. et.al (2003) Current Opinion in Cell Biology 15 :221 -231.
22
[Gell-Mann M. and Tsallis C.(2004) ] Gell-Mann M. and Tsallis C. (Editors),
Nonextensive Entropy Interdisciplinary Applications (Oxford University Press,
New York) 2004
23
Fig. 1.
24
0.0006
random
sw
0.0005
P(n)
0.0004
0.0003
0.0002
0.0001
0
0
1000
2000
3000
vertex degree (n)
4000
(a)
ln(P(k)
0.1
0.01
0.001
1
10
100
ln(Vertex Degree (k))
(b)
Fig. 2.
25
1000
7
entropic bound
6
5
4
3
2
1
10
20
30
40
50
iterations
60
70
(a)
10
9
entropic bound
8
7
6
5
4
3
2
1
10
15
20
25
30
iterations
(b)
Fig. 3.
26
35
40
45
50
Fig. 4.
27