In computer science, a disjoint-set data structure is a data structure used to handle the merging and querying of disjoint sets (a series of sets with no duplicate elements).
Structure
A non-intersecting forest represents each set as a tree (deepth with 1), where each node is an element. A node holds a reference to its parent node, and the root node of the tree holds an empty reference or a reference to itself or some other invalid value to indicate itself as the root node. This data structure was first proposed by Bernard A. Galler and Michael J. Fischer in 1964,but it took several years before a precise analysis was completed. As shown:
New Approach
There are many different types of variations of this data structure that we can change to suit different needs. I used two HashMap to store the roots and parent (parent
and roots
). For parent
, it stores the roots of every element, for roots
, it stores the root of all the elements.parent
can be seen as {node: root}
, roots
can be seen as {root: [node1, node2, ...]}
.
Search
In my method, I use Hashmap to store the tree. So, the search complexity is $O(n)$.
1 | return parent[x] |
Union
For union, we need to set the root of such element to be a new root. And merge the childs of such root.
1 | def union(x, y): |
Application
It can be used for subgraph pattern matching decompistion, bipartite graph and commuity.
Code
1 | class UF: |