Watts–Strogatz model

The Watts–Strogatz model is a random graph generation model that produces graphs with small-world properties, including short average path lengths and high clustering. It was proposed by Duncan J. Watts and Steven Strogatz in their article published in 1998 in the Nature scientific journal.^[1] The model also became known as the (Watts) beta model after Watts used $\beta$ to formulate it in his popular science book Six Degrees.

Rationale for the model edit

The formal study of random graphs dates back to the work of Paul Erdős and Alfréd Rényi.^[2] The graphs they considered, now known as the classical or Erdős–Rényi (ER) graphs, offer a simple and powerful model with many applications.

However the ER graphs do not have two important properties observed in many real-world networks:

They do not generate local clustering and triadic closures. Instead, because they have a constant, random, and independent probability of two nodes being connected, ER graphs have a low clustering coefficient.
They do not account for the formation of hubs. Formally, the degree distribution of ER graphs converges to a Poisson distribution, rather than a power law observed in many real-world, scale-free networks.^[3]

The Watts and Strogatz model was designed as the simplest possible model that addresses the first of the two limitations. It accounts for clustering while retaining the short average path lengths of the ER model. It does so by interpolating between a randomized structure close to ER graphs and a regular ring lattice. Consequently, the model is able to at least partially explain the "small-world" phenomena in a variety of networks, such as the power grid, neural network of C. elegans, networks of movie actors, or fat-metabolism communication in budding yeast.^[4]

Algorithm edit

Watts–Strogatz graph

Given the desired number of nodes $N$ , the mean degree $K$ (assumed to be an even integer), and a parameter $\beta$ , all satisfying $0\leq \beta \leq 1$ and $N\gg K\gg \ln N\gg 1$ , the model constructs an undirected graph with $N$ nodes and ${\frac {NK}{2}}$ edges in the following way:

Construct a regular ring lattice, a graph with $N$ nodes each connected to $K$ neighbors, $K/2$ on each side. That is, if the nodes are labeled $0\ldots {N-1}$ , there is an edge $(i,j)$ if and only if $0<|i-j|\ \mathrm {mod} \ \left(N-1-{\frac {K}{2}}\right)\leq {\frac {K}{2}}.$
For every node $i=0,\dots ,{N-1}$ take every edge connecting $i$ to its $K/2$ rightmost neighbors, that is every edge $(i,j)$ such that $0<(j-i)\ \mathrm {mod} \ N\leq K/2$ , and rewire it with probability $\beta$ . Rewiring is done by replacing $(i,j)$ with $(i,k)$ where $k$ is chosen uniformly at random from all possible nodes while avoiding self-loops ( $k\neq i$ ) and link duplication (there is no edge $(i,{k'})$ with $k'=k$ at this point in the algorithm).

Properties edit

The underlying lattice structure of the model produces a locally clustered network, while the randomly rewired links dramatically reduce the average path lengths. The algorithm introduces about $\beta {\frac {NK}{2}}$ of such non-lattice edges. Varying $\beta$ makes it possible to interpolate between a regular lattice ( $\beta =0$ ) and a structure close to an Erdős–Rényi random graph $G(N,p)$ with $p={\frac {K}{N-1}}$ at $\beta =1$ . It does not approach the actual ER model since every node will be connected to at least $K/2$ other nodes.

The three properties of interest are the average path length, the clustering coefficient, and the degree distribution.

Average path length edit

For a ring lattice, the average path length^[1] is $\ell (0)\approx N/2K\gg 1$ and scales linearly with the system size. In the limiting case of $\beta \rightarrow 1$ , the graph approaches a random graph with $\ell (1)\approx {\frac {\ln N}{\ln K}}$ , while not actually converging to it. In the intermediate region $0<\beta <1$ , the average path length falls very rapidly with increasing $\beta$ , quickly approaching its limiting value.

Clustering coefficient edit

For the ring lattice the clustering coefficient^[5] $C(0)={\frac {3(K-2)}{4(K-1)}}$ , and so tends to $3/4$ as $K$ grows, independently of the system size.^[6] In the limiting case of $\beta \rightarrow 1$ the clustering coefficient is of the same order as the clustering coefficient for classical random graphs, $C=K/(N-1)$ and is thus inversely proportional to the system size. In the intermediate region the clustering coefficient remains quite close to its value for the regular lattice, and only falls at relatively high $\beta$ . This results in a region where the average path length falls rapidly, but the clustering coefficient does not, explaining the "small-world" phenomenon.

If we use the Barrat and Weigt^[6] measure for clustering

C'(\beta )

defined as the fraction between the average number of edges between the neighbors of a node and the average number of possible edges between these neighbors, or, alternatively,

C'(\beta )\equiv {\frac {3\times {\text{number of triangles}}}{\text{number of connected triples}}}

then we get

C'(\beta )\sim C(0)(1-\beta )^{3}.

Degree distribution edit

The degree distribution in the case of the ring lattice is just a Dirac delta function centered at $K$ . The degree distribution for a large number of nodes and $0<\beta <1$ can be written as,^[6]

P(k)\approx \sum _{n=0}^{f(k,K)}{{K/2} \choose {n}}(1-\beta )^{n}\beta ^{K/2-n}{\frac {(\beta K/2)^{k-K/2-n}}{(k-K/2-n)!}}e^{-\beta K/2},

where $k_{i}$ is the number of edges that the $i^{\text{th}}$ node has or its degree. Here $k\geq K/2$ , and $f(k,K)=\min(k-K/2,K/2)$ . The shape of the degree distribution is similar to that of a random graph and has a pronounced peak at $k=K$ and decays exponentially for large $|k-K|$ . The topology of the network is relatively homogeneous, meaning that all nodes are of similar degree.

Limitations edit

The major limitation of the model is that it produces an unrealistic degree distribution. In contrast, real networks are often scale-free networks inhomogeneous in degree, having hubs and a scale-free degree distribution. Such networks are better described in that respect by the preferential attachment family of models, such as the Barabási–Albert (BA) model. (On the other hand, the Barabási–Albert model fails to produce the high levels of clustering seen in real networks, a shortcoming not shared by the Watts and Strogatz model. Thus, neither the Watts and Strogatz model nor the Barabási–Albert model should be viewed as fully realistic.)

The Watts and Strogatz model also implies a fixed number of nodes and thus cannot be used to model network growth.

References edit

^ ^a ^b Watts, D. J.; Strogatz, S. H. (1998). "Collective dynamics of 'small-world' networks" (PDF). Nature. 393 (6684): 440–442. Bibcode:1998Natur.393..440W. doi:10.1038/30918. PMID 9623998. S2CID 4429113. Archived (PDF) from the original on 2020-10-26. Retrieved 2018-05-18.
^ Erdos, P. (1960). "Publications Mathematicae 6, 290 (1959); P. Erdos, A. Renyi". Publ. Math. Inst. Hung. Acad. Sci. 5: 17.
^ Ravasz, E. (30 August 2002). "Hierarchical Organization of Modularity in Metabolic Networks". Science. 297 (5586): 1551–1555. arXiv:cond-mat/0209244. Bibcode:2002Sci...297.1551R. doi:10.1126/science.1073374. PMID 12202830. S2CID 14452443.
^ Al-Anzi, Bader; Arpp, Patrick; Gerges, Sherif; Ormerod, Christopher; Olsman, Noah; Zinn, Kai (2015). "Experimental and Computational Analysis of a Large Protein Network That Controls Fat Storage Reveals the Design Principles of a Signaling Network". PLOS Computational Biology. 11 (5): e1004264. Bibcode:2015PLSCB..11E4264A. doi:10.1371/journal.pcbi.1004264. PMC 4447291. PMID 26020510.
^ Albert, R., Barabási, A.-L. (2002). "Statistical mechanics of complex networks". Reviews of Modern Physics. 74 (1): 47–97. arXiv:cond-mat/0106096. Bibcode:2002RvMP...74...47A. doi:10.1103/RevModPhys.74.47. S2CID 60545.{{cite journal}}: CS1 maint: multiple names: authors list (link)
^ ^a ^b ^c Barrat, A.; Weigt, M. (2000). "On the properties of small-world network models". European Physical Journal B. 13 (3): 547–560. arXiv:cond-mat/9903411. doi:10.1007/s100510050067. S2CID 13483229.

[WS-1] Watts, D. J.; Strogatz, S. H. (1998). "Collective dynamics of 'small-world' networks" (PDF). Nature. 393 (6684): 440–442. Bibcode:1998Natur.393..440W. doi:10.1038/30918. PMID 9623998. S2CID 4429113. Archived (PDF) from the original on 2020-10-26. Retrieved 2018-05-18.

[Erdos1960-2] Erdos, P. (1960). "Publications Mathematicae 6, 290 (1959); P. Erdos, A. Renyi". Publ. Math. Inst. Hung. Acad. Sci. 5: 17.

[Ravasz2002-3] Ravasz, E. (30 August 2002). "Hierarchical Organization of Modularity in Metabolic Networks". Science. 297 (5586): 1551–1555. arXiv:cond-mat/0209244. Bibcode:2002Sci...297.1551R. doi:10.1126/science.1073374. PMID 12202830. S2CID 14452443.

[4] Al-Anzi, Bader; Arpp, Patrick; Gerges, Sherif; Ormerod, Christopher; Olsman, Noah; Zinn, Kai (2015). "Experimental and Computational Analysis of a Large Protein Network That Controls Fat Storage Reveals the Design Principles of a Signaling Network". PLOS Computational Biology. 11 (5): e1004264. Bibcode:2015PLSCB..11E4264A. doi:10.1371/journal.pcbi.1004264. PMC 4447291. PMID 26020510.

[AlbertBarabasi-5] Albert, R., Barabási, A.-L. (2002). "Statistical mechanics of complex networks". Reviews of Modern Physics. 74 (1): 47–97. arXiv:cond-mat/0106096. Bibcode:2002RvMP...74...47A. doi:10.1103/RevModPhys.74.47. S2CID 60545.{{cite journal}}: CS1 maint: multiple names: authors list (link)

[Barrat2000-6] Barrat, A.; Weigt, M. (2000). "On the properties of small-world network models". European Physical Journal B. 13 (3): 547–560. arXiv:cond-mat/9903411. doi:10.1007/s100510050067. S2CID 13483229.

[1]

[2]

[3]

[4]

[5]

[6]