This is an old revision of the document!


4.6. Implementing Kruskal's Algorithm: The Union-Find Data Structure


  • The basic idea behind the algorithm is that the graph in question has a fixed population of nodes
  • And the graphs grows over time by inserting edges between certain pairs of nodes.
  • Goal: Maintain a set of connected components of the graph being processed throughout the whole operation
  • With this algorithm, we develop a data structure called the Union-Find structure
  • The Union-Find data structure stores a representation of the components in a way that supports fast searching and updating
  • This data structure is used to efficiently implement Kruskal's Algorithm
As each edge e = (u,w) is considered
Find the connected components containing v and w.

If these components are different:
Then there is no path from v to w, so e should be added to the structure

Elif they're the same:

Then there's a path from v to w, so e should be omitted.


The Problem

  • Operations:
    • Find(u) : returns name of set containing u
      • It can be used to find if two nodes u and v are in the same set by simply checking if Find(u) = Finf(v)
      • Goal implementation: O(log n)
    • Union(A, B) : merges sets A and B into one set
      • Goal implementation: O(log n)
  • The sets obtained will be the connected components of the graph
  • So, for a given node u, Find(u) returns the name of the component containing u
  • To add an edge (u,v) we first test if Find(u) = Find(v)
  • If Find(u) ≠ Find(v), then Union(Find(u), Find(v)) merges the two components into one
  • The Union-Find data structure maintains components of a graph only when we add edges, not when we delete them
courses/cs211/winter2012/journals/jeanpaul/chapterfour_sectionvi.1330570086.txt.gz · Last modified: 2012/03/01 02:48 by mugabej
CC Attribution-Noncommercial-Share Alike 4.0 International
Driven by DokuWiki Recent changes RSS feed Valid CSS Valid XHTML 1.0