# Set packing

Set packing is a classical NP-complete problem in computational complexity theory and combinatorics, and was one of Karp's 21 NP-complete problems.

Suppose one has a finite set S and a list of subsets of S. Then, the set packing problem asks if some k subsets in the list are pairwise disjoint (in other words, no two of them share an element).

More formally, given a universe ${\displaystyle {\mathcal {U}}}$ and a family ${\displaystyle {\mathcal {S}}}$ of subsets of ${\displaystyle {\mathcal {U}}}$, a packing is a subfamily ${\displaystyle {\mathcal {C}}\subseteq {\mathcal {S}}}$ of sets such that all sets in ${\displaystyle {\mathcal {C}}}$ are pairwise disjoint. The size of the packing is ${\displaystyle |{\mathcal {C}}|}$. In the set packing decision problem, the input is a pair ${\displaystyle ({\mathcal {U}},{\mathcal {S}})}$ and an integer ${\displaystyle k}$; the question is whether there is a set packing of size ${\displaystyle k}$ or more. In the set packing optimization problem, the input is a pair ${\displaystyle ({\mathcal {U}},{\mathcal {S}})}$, and the task is to find a set packing that uses the most sets.

The problem is clearly in NP since, given k subsets, we can easily verify that they are pairwise disjoint in polynomial time.

The optimization version of the problem, maximum set packing, asks for the maximum number of pairwise disjoint sets in the list. It is a maximization problem that can be formulated naturally as an integer linear program, belonging to the class of packing problems.

## Integer linear program formulation

The maximum set packing problem can be formulated as the following integer linear program.

 maximize ${\displaystyle \sum _{S\in {\mathcal {S}}}x_{S}}$ (maximize the total number of subsets) subject to ${\displaystyle \sum _{S\colon e\in S}x_{S}\leqslant 1}$ for all ${\displaystyle e\in {\mathcal {U}}}$ (selected sets have to be pairwise disjoint) ${\displaystyle x_{S}\in \{0,1\}}$ for all ${\displaystyle S\in {\mathcal {S}}}$ . (every set is either in the set packing or not)

## Complexity

The set packing problem is not only NP-complete, but its optimization version (general maximum set packing problem) has been proven as difficult to approximate as the maximum clique problem; in particular, it cannot be approximated within any constant factor.[1] The best known algorithm approximates it within a factor of ${\displaystyle O({\sqrt {|U|}})}$ .[2] The weighted variant can also be approximated as well.[3]

However, the problem does have a variant which is more tractable: if we assume no subset exceeds k≥3 elements, the answer can be approximated within a factor of k/2 + ε for any ε > 0; in particular, the problem with 3-element sets can be approximated within about 50%. In another more tractable variant, if no element occurs in more than k of the subsets, the answer can be approximated within a factor of k. This is also true for the weighted version.

## Equivalent problems

There is a one-to-one polynomial-time reduction between the independent set problem and the set packing problem:

• Given a set packing problem on a collection ${\displaystyle {\mathcal {S}}}$ , create a graph where for each set ${\displaystyle S\in {\mathcal {S}}}$  there is a vertex ${\displaystyle v_{S}}$ , and there is an edge between ${\displaystyle v_{S}}$  and ${\displaystyle v_{T}}$  if ${\displaystyle S\cap T\neq \varnothing }$ . Now every independent set of vertices in the generated graph corresponds to a set packing in ${\displaystyle {\mathcal {S}}}$ .
• Given an independent vertex set problem on a graph ${\displaystyle G(V,E)}$ , create a collection of sets where for each vertex ${\displaystyle v}$  there is a set ${\displaystyle S_{v}}$  containing all edges adjacent to ${\displaystyle v}$ . Now every set packing in the generated collection corresponds to an independent vertex set in ${\displaystyle G(V,E)}$ .

This is also a bidirectional PTAS reduction, and it shows that the two problems are equally difficult to approximate.

## Special cases

Matching and 3-dimensional matching are special cases of set packing. A maximum-size matching can be found in polynomial time, but finding a largest 3-dimensional matching or a largest independent set is NP-hard.

## Other related problems

Set packing is one among a family of problems related to covering or partitioning the elements of a set. One closely related problem is the set cover problem. Here, we are also given a set S and a list of sets, but the goal is to determine whether we can choose k sets that together contain every element of S. These sets may overlap. The optimization version finds the minimum number of such sets. The maximum set packing need not cover every possible element.

The NP-complete exact cover problem, on the other hand, requires every element to be contained in exactly one of the subsets. Finding such an exact cover at all, regardless of size, is an NP-complete problem. However, if we create a singleton set for each element of S and add these to the list, the resulting problem is about as easy as set packing.

Karp originally showed set packing NP-complete via a reduction from the clique problem.

1. ^ Hazan, Elad; Safra, Shmuel; Schwartz, Oded (2006), "On the complexity of approximating k-set packing", Computational Complexity, 15 (1): 20–39, CiteSeerX 10.1.1.352.5754, doi:10.1007/s00037-006-0205-6, MR 2226068. See in particular p. 21: "Maximum clique (and therefore also maximum independent set and maximum set packing) cannot be approximated to within ${\displaystyle O(n^{1-\epsilon })}$  unless NP ⊂ ZPP."