Weak component

In graph theory, the weak components of a directed graph partition the vertices of the graph into subsets that are totally ordered by reachability. They form the finest partition of the set of vertices that is totally ordered in this way.

Definition edit

The weak components were defined in a 1972 paper by Ronald Graham, Donald Knuth, and (posthumously) Theodore Motzkin, by analogy to the strongly connected components of a directed graph, which form the finest possible partition of the graph's vertices into subsets that are partially ordered by reachability. Instead, they defined the weak components to be the finest partition of the vertices into subsets that are totally ordered by reachability.^[1]^[2]

In more detail, Knuth (2022) defines the weak components through a combination of four symmetric relations on the vertices of any directed graph, denoted here as $\Leftrightarrow$ , $\parallel$ , $\approx$ , and $\asymp$ :

For any two vertices $u$ and $v$ of the graph, $u\Leftrightarrow v$ if and only if each vertex is reachable from the other: there exist paths in the graph from $u$ to $v$ and from $v$ to $u$ . The $\Leftrightarrow$ relation is an equivalence relation, and its equivalence classes are used to define the strongly connected components of the graph.
For any two vertices $u$ and $v$ of the graph, $u\parallel v$ if and only if neither vertex is reachable from the other: there do not exist paths in the graph in either direction between $u$ and $v$ .
For any two vertices $u$ and $v$ of the graph, $u\approx v$ if and only if either $u\Leftrightarrow v$ or $u\parallel v$ . That is, there may be a two-way connection between these vertices, or they may be mutually unreachable, but they may not have a one-way connection.
The relation $\asymp$ is defined as the transitive closure of $\approx$ . That is, $u\asymp v$ when there is a sequence $u\approx \cdots \approx v$ of vertices, starting with $u$ and ending with $v$ , such that each consecutive pair in the sequence is related by $\approx$ .

Then $\asymp$ is an equivalence relation: every vertex is related to itself by $\asymp$ (because it can reach itself in both directions by paths of length zero), any two vertices that are related by $\asymp$ can be swapped for each other without changing this relation (because $\asymp$ is built out of the symmetric relations $\Leftrightarrow$ and $\parallel$ ), and $\asymp$ is a transitive relation (because it is a transitive closure). As with any equivalence relation, it can be used to partition the vertices of the graph into equivalence classes, subsets of the vertices such that two vertices are related by $\asymp$ if and only if they belong to the same equivalence class. These equivalence classes are the weak components of the given graph.^[2]

The original definition by Graham, Knuth, and Motzkin is equivalent but formulated somewhat differently. Given a directed graph $G$ , they first construct another graph ${\hat {G}}$ as the complement graph of the transitive closure of $G$ . As Tarjan (1974) describes, the edges in ${\hat {G}}$ represent non-paths, pairs of vertices that are not connected by a path in $G$ .^[3] Then, two vertices belong to the same weak component when either they belong to the same strongly connected component of $G$ or of ${\hat {G}}$ .^[1]^[3] As Graham, Knuth, and Motzkin show, this condition defines an equivalence relation,^[1] the same one defined above as $\asymp$ .^[4]

Corresponding to these definitions, a directed graph is called weakly connected if it has exactly one weak component. This means that its vertices cannot be partitioned into two subsets, such that all of the vertices in the first subset can reach all of the vertices in the second subset, but such that none of the vertices in the second subset can reach any of the vertices in the first subset. It differs from other notions of weak connectivity in the literature, such as connectivity and components in the underlying unconnected graph, for which Knuth suggests the alternative terminology undirected components.^[2]

Properties edit

If $X$ and $Y$ are two weak components of a directed graph, then either all vertices in $X$ can reach all vertices in $Y$ by paths in the graph, or all vertices in $Y$ can reach all vertices in $X$ . However, there cannot exist reachability relations in both directions between these two components. Therefore, we can define an ordering on the weak components, according to which $X<Y$ when all vertices in $X$ can reach all vertices in $Y$ . By definition, $X\nless X$ . This is an asymmetric relation (two elements can only be related in one direction, not the other) and it inherits the property of being a transitive relation from the transitivity of reachability. Therefore, it defines a total ordering on the weak components. It is the finest possible partition of the vertices into a totally ordered set of vertices consistent with reachability.^[1]

This ordering on the weak components can alternatively be interpreted as a weak ordering on the vertices themselves, with the property that when $u<v$ in the weak ordering, there necessarily exists a path from $u$ to $v$ , but not from $v$ to $u$ . However, this is not a complete characterization of this weak ordering, because two vertices $u$ and $v$ could have this same reachability ordering while belonging to the same weak component as each other.^[2]

Every weak component is a union of strongly connected components.^[2] If the strongly connected components of any given graph are contracted to single vertices, producing a directed acyclic graph (the condensation of the given graph), and then this condensation is topologically sorted, then each weak component necessarily appears as a consecutive subsequence of the topological order of the strong components.^[3]

Algorithms edit

An algorithm for computing the weak components of a given directed graph in linear time was described by Pacault (1974), and subsequently simplified by Tarjan (1974) and Knuth (2022).^[2]^[3]^[5] As Tarjan observes, Tarjan's strongly connected components algorithm based on depth-first search will output the strongly connected components in (the reverse of) a topologically sorted order. The algorithm for weak components generates the strongly connected components in this order, and maintains a partition of the components that have been generated so far into the weak components of their induced subgraph. After all components are generated, this partition will describe the weak components of the whole graph.^[2]^[3]

It is convenient to maintain the current partition into weak components in a stack, with each weak component maintaining additionally a list of its sources, strongly connected components that have no incoming edges from other strongly connected components in the same weak component, with the most recently generated source first. Each newly generated strongly connected component may form a new weak component on its own, or may end up merged with some of the previously constructed weak components near the top of the stack, the ones for which it cannot reach all sources.^[2]^[3]

Thus, the algorithm performs the following steps:^[2]^[3]

Initialize an empty stack of weak components, each associated with a list of its source components.
Use Tarjan's strongly connected components algorithm to generate the strongly connected components of the given graph in the reverse of a topological order. When each strongly connected component $S$ is generated, perform the following steps with it:
- While the stack is non-empty and $S$ has no edges to the top weak component of the stack, pop that component from the stack.
- If the stack is still non-empty, and some sources of its top weak component are not hit by edges from $S$ , again pop that component from the stack.
- Construct a new weak component $W$ , containing as sources $S$ and all of the unhit sources from the top component that was popped, and push $W$ onto the stack.

Each test for whether any edges from $S$ hit a weak component can be performed in constant time once we find an edge from $S$ to the most recently generated earlier strongly connected component, by comparing the target component of that edge to the first source of the second-to-top component on the stack.

References edit

^ ^a ^b ^c ^d Graham, R. L.; Knuth, D. E.; Motzkin, T. S. (1972), "Complements and transitive closures" (PDF), Discrete Mathematics, 2: 17–29, doi:10.1016/0012-365X(72)90057-X, MR 0323577
^ ^a ^b ^c ^d ^e ^f ^g ^h ⁱ Knuth, Donald E. (15 January 2022), "Weak components", The Art of Computer Programming, Volume 4, Pre-Fascicle 12A: Components and Traversal (PDF), pp. 11–14
^ ^a ^b ^c ^d ^e ^f ^g Tarjan, Robert Endre (July 1974), "A new algorithm for finding weak components", Information Processing Letters, 3 (1): 13–15, doi:10.1016/0020-0190(74)90040-4
^ Knuth (2022), Exercise 81, p. 21.
^ Pacault, Jean François (1974), "Computing the weak components of a directed graph", SIAM Journal on Computing, 3: 56–61, doi:10.1137/0203005, MR 0376418

[gkm-1] Graham, R. L.; Knuth, D. E.; Motzkin, T. S. (1972), "Complements and transitive closures" (PDF), Discrete Mathematics, 2: 17–29, doi:10.1016/0012-365X(72)90057-X, MR 0323577

[knuth-2] ^ ^a ^b ^c ^d ^e ^f ^g ^h ⁱ Knuth, Donald E. (15 January 2022), "Weak components", The Art of Computer Programming, Volume 4, Pre-Fascicle 12A: Components and Traversal (PDF), pp. 11–14

[tarjan-3] ^ ^a ^b ^c ^d ^e ^f ^g Tarjan, Robert Endre (July 1974), "A new algorithm for finding weak components", Information Processing Letters, 3 (1): 13–15, doi:10.1016/0020-0190(74)90040-4

[FOOTNOTEKnuth2022Exercise_81,_p._21-4] Knuth (2022), Exercise 81, p. 21.

[pacault-5] Pacault, Jean François (1974), "Computing the weak components of a directed graph", SIAM Journal on Computing, 3: 56–61, doi:10.1137/0203005, MR 0376418

[1]

[2]

[3]

[4]

[5]