Complex Networks
Complex Networks
Complex Networks
an introduction
Alain Barrat
CPT, Marseille, France
ISI, Turin, Italy
http://www.cpt.univ-mrs.fr/~barrat
http://cxnets.googlepages.com
http://www.sociopatterns.org
REVIEWS:
Networks, An introduction
M. Newman
4
•Statistical mechanics of complex networks, edited by R.
Pastor-Satorras, J.-M. Rubí, A. Diaz-Guilera. Springer,
2003.
I. INTRODUCTION
I. Networks: definitions, statistical characterization, examples
II. Modeling frameworks
II. DYNAMICAL PROCESSES
I. Resilience, vulnerability
II. Random walks
III. Epidemic processes
IV. Social phenomena
V. Some perspectives
What is a network
Network=set V of nodes joined by links (set E)
L. Euler:
Can one walk across the seven bridges and
never cross the same bridge twice?
1735: Leonhard Euler’s theorem:
(b) If a graph is connected and has no odd degree nodes, it has at least one path.
11
Metabolic Network Protein Interactions
N. Martinez, R. Williams
Scientific collaboration network
Weights: depending on
•number of co-authored papers
•number of authors of each paper
•number of citations…
City i
City a
City j
Internet
Graph representation
different
granularities
Internet mapping
•continuously evolving and growing
•intrinsic heterogeneity
•self-organizing
Largely unknown topology/properties
Mapping projects:
•Multi-probe reconstruction (router-level): traceroute
•Use of BGP tables for the Autonomous System level (domains)
23
The World-Wide-Web
Virtual network to find and share informations
•web pages
•hyperlinks
Online (virtual) social networks
Online (virtual) social networks
Online (virtual) social networks
Interdisciplinary science
Science of complex networks:
-graph theory
-sociology
-communication science
-biology
-physics
-computer science
Interdisciplinary science
Science of complex networks:
• Empirics
• Characterization
• Modeling
• Dynamical processes
Networks characteristics
Networks: of very different origins
• Undirected
• Directed
• Weighted
• Multi-relational
• Bipartite
Graph theory: basics
G=(V,E) ; |V|=N
Complete graph:
• Adjacency matrix
Adjacency matrix
N nodes i=1,…,N
aij= 1 if (i,j) ∈ E
0 if (i,j) ∉ E
0 1 2 3 0 1
0 0 1 1 1
1 1 0 1 1 2
2 1 1 0 1
3
3 1 1 1 0
Adjacency matrix
N nodes i=1,…,N
0 1 2 3 0 1
0 0 1 0 0
1 1 0 1 1 2
2 0 1 0 1
3
3 0 1 1 0
Adjacency matrix
N nodes i=1,…,N
(Non symmetric
wij= ≠ 0 if (i,j) ∈ E
for directed networks)
0 if (i,j) ∉ E
0
0 1 2 3 1
0 0 2 0 10
1 0 0 0 0 2
2 0 5 0 0
3
3 0 1 2 0
Sparse graphs
Density of a graph D=|E|/(N(N-1)/2)
Number of edges
D=
Maximal number of edges
39
Node characteristics
• Degree=number of neighbours=∑j aij
ki=5
i
NB: in a sparse graph we expect ki << N
0 1 2 3 0 1
0 0 1 0 0
1 1 0 1 1 2
i 2 0 1 0 1
3
3 0 1 1 0
Node characteristics
• Degree in directed graphs:
– in-degree= number of in-neighbours=∑j aji
0
0 1 2 3 1
0 0 1 0 1
1 0 0 0 0 2
2 0 1 0 0
3
3 0 1 1 0
Node characteristics
• Weighted graphs: Strength si = ∑j wij
• Directed Weighted graphs:
–in-strength si = ∑j wji
–out-strength si = ∑j wij
0
0 1 2 3 1
0 0 2 0 10
1 0 0 0 0 2
2 0 5 0 0
3
3 0 1 2 0
Paths, connectedness,
small-world effect
43
Paths
G=(V,E)
Path of length n = ordered collection of
• n+1 vertices i0,i1,…,in ∈ V
• n edges (i0,i1), (i1,i2)…,(in-1,in) ∈ E
i3
i4
i5
i0 i1 i2
is connected
Tendrils
Tube Tendril
Shortest paths
Shortest path between i and j: minimum number
of traversed edges
distance l(i,j)=minimum
j
number of edges traversed
on a path between i and j
i
SMALL-WORLD CHARACTER
Social networks as small-worlds:
Milgram’s experiment, revisited
Dodds et al., Science 301, 827 (2003)
email chains
51
Small-world properties
Average number of nodes
within a chemical distance l
Scientific collaborations
Internet
The intuition behind the small-world effect
versus
54
Structure of neighborhoods
k
Clustering coefficient of a node
3
# of links between 1,2,…n neighbors
C(i) =
i 2
k(k-1)/2
2
1
58
Ranking nodes
59
Centrality measures
How to quantify the importance of a node?
• Degree=number of neighbours=∑j aij
ki=5
i
• Closeness centrality
gi= 1 / ∑j l(i,j)
61
Betweenness centrality
for each pair of nodes (l,m) in the graph, there are
σlm shortest paths between l and m
σilm shortest paths going through i
bi is the sum of σilm / σlm over all pairs (l,m)
path-based quantity
i
bi is large
j bj is small
non-local quantity
“hard” to compute
64
Statistical characterization
Degree distribution
•Distribution: 1 2 3 4 k
Broad degree
distributions
(often: power-law tails
P(k) ∝ k-γ ,
typically 2< γ <3)
No particular
Internet characteristic scale
Unbounded fluctuations
Topological heterogeneity
Statistical analysis of centrality measures:
linear scale
Poisson
vs.
Power-law
log-scale
Exp. vs. Scale-Free
Poisson distribution Power-law distribution
Level of heterogeneity:
Other heterogeneity levels
Weights
Strengths
Main things to (immediately) measure
in a network
• Degree distribution
• Clustering coefficient
• (Weights/strengths distributions)
75
Networks characteristics
Most often:
• Small diameter
• Large cohesiveness (clustering)
• Heterogeneities
Of course, this is not
the whole story...
77
Correlations
78
Statistical characterization
Multipoint degree correlations
Simplest case:
P(k’|k): conditional probability that a node of degree k is
connected to a node of degree k’
k=4
k=4
i
Correlation spectrum:
putting together nodes which
have the same degree
class of degree k
Statistical characterization
case of random uncorrelated networks
P(k’|k)
• independent of k
• proba that an edge points to a node of degree k’
number of edges from nodes of degree k’
number of edges from nodes of any degree
Clustering spectrum:
putting together nodes which
have the same degree
class of degree k
Empirical clustering and
correlations
non-trivial
structures
No special
scale
Correlations in weighted
networks
Weighted assortativity
5 5
1
5
i
5
ki=5; knn,i=1.8
Weighted assortativity
1 1
5
1
i
1
ki=5; knn,i=1.8
Weighted assortativity
5 5
1
5
i
5
1 1
5
1
i
1
92
Assortativity and weighted
assortativity
Airports' network
i i
wij=1
wij=5
si=16 si=8
ki=4
ciw=0.625 > ci ciw=0.25 < ci
ci=0.5
Weighted clustering coefficient
k
Average clustering coefficient
(wjk) C=∑i C(i)/N
wik
j Cw=∑i Cw(i)/N
i wij
Random(ized) weights: C = Cw
C < Cw : more weights on cliques
C > Cw : less weights on cliques
Clustering spectra
Examples:
97
Clustering and weighted clustering
Scientific collaborations: C= 0.65, Cw ~ C
C(k) ~ Cw(k) at small k, C(k) < Cw(k) at large k: larger weights on cliques
Clustering and weighted clustering
Airports' network: C= 0.53, Cw=1.1 C
101
Weighted graphs:
• Strength si = ∑j wij
• Degree ki = ∑j aij
103
Strengths and degrees
104
Strengths vs degrees
Collaboration network
105
Strengths vs degrees
Air transportation network
1-core
shell index 1
shell index 2
2-core
3-core shell index 3
k-core analysis
maximal shell index
size of shells versus shell index
structure of the successive cores:
degree distribution, correlations, clustering...
Remarks
Additional investigation tool
Easy to construct
Visualization tool
Analysis of AS Internet maps
Analysis of Internet maps
Degree distributions
Analysis of Internet maps
Clustering and assortativity properties
Subgraphs,
communities
Subgraphs
Examples:
Communities and community detection
Communities: examples
Communities: examples
Social network
(Zachary’s karate club)
Communities: examples
Group of nodes
that are more tightly linked together
than with the rest of the graph
Why are communities interesting?
Group of nodes
that are more tightly linked together
than with the rest of the graph
Dendogram
Communities: detection problem
Quality function
http://www.cfinder.org/
http://www.oslom.org/
http://www.tp.umu.se/~rosvall/code.html
For a review
S. Fortunato, Phys. Rep. 486, 75-174, 2010
(http://sites.google.com/site/santofortunato/)
Modularity resolution limit
140
Complex networks
Complex is not just “complicated”
I. INTRODUCTION
I. Networks: definitions, statistical characterization, examples
II. Modeling frameworks
II. DYNAMICAL PROCESSES
I. Resilience, vulnerability
II. Random walks
III. Epidemic processes
IV. Social phenomena
V. Some perspectives