Priority Queues

CS240E: Data Structures and Data Management (Enriched)

2019 Winter, David Duan

Abstract Data Types

ADT

Abstract Data Type: A description of information and a collection of operations on that information.
The information is accessed only through the operations.
We can have various realizations of an ADT, which specify:
- How the information is stored -- data structure
- How the operations are performed -- algorithms

Stack ADT

Stack: An ADT consisting of a collection of items with operations:
- push: inserting an item
- pop: removing the most recently inserted item
- size, isEmpty, top.
Items are removed in Last-In First-Out order.
Applications: procedure calls, web browser back button.
Realizations: arrays or linked lists.

Queue ADT

Queue: An ADT consisting of a collection of itesm with operations:
- enqueue: inserting an item
- dequeue: removing the least recently inserted item
- size, isEmpty, front.
Items are removed in First-In First-Out order.
Items enter the queue at the rear and are removed from the front.
Applications: CPU scheduling, disk scheduling
Realizations: (circular) arrays, linked lists.

Priority Queue

PQ: An ADT consisting of a collection of items (each having a priority or key) with operations
- insert: inserting an item tagged with a priority
- deleteMax (or extractMax, getMax): removing the item of highest priority
The above definition is for a maximum-oriented priority queue. A minimum-oriented priority queue is defined in the natural way, by replacing the operation deleteMax by deleteMin.
Applications: to-do list, simulation systems, sorting.

Using a Priority Queue to Sort


1
PQ-Sort(A[0..n-1])
2
1.  initialize PQ to an empty priority queue
3
2.  for k <- 0 to n-1 do
4
3.    PQ.insert(A[k])
5
4.  for k <- n-1 down to 0 do
6
5.    A[k] <- PQ.deleteMax()

Main idea: Insert all the elements to be sorted into a priority queue, and sequentially remove them; they will come out in sorted order.
Runtime depends on how we implement the priority queue.
$O(n + n \cdot \text{insert} + n \cdot \text{deleteMax})$

Realizations of Priority Queues

Unsorted Arrays / Unsorted Linked Lists

dynamic arrays $O(1)$ extra time).
$O(1)$ insert $O(n)$ deleteMax.
This realization used for sorting yields selection sort.

Sorted Arrays / Sorted Linked Lists

$O(n)$ insert $O(1)$ deleteMax.
This realization used for sorting yields insertion sort.

Heaps

A heap is a tree-based data structure satisfying two properties:

The tree is complete: all the levels of heap are completely filled, except (possibly) for the last level; the filled items in the level are left-justified.
heap property $i$ $i$ (in a max-heap).

Remarks

In a heap, the highest (or lowest) priority element is always stored at the root.
A heap is not a softed structure; it can be regarded as being partially sorted.
$n$ $\Theta(\log n)$ .

Representation

We usually do not store heaps as binary trees but structured arrays.

root $A[0]$ .
children $A[i]$ $A[2i+1]$ $A[2i+2]$ .
parent $A[i], i \ne 0$ $A[\lfloor \frac{i-1}2\rfloor]$ .
last $A[n-1]$ .

We should hide implementation details using helper functions such as root(), parent(i), last(n), hasLeftChild(i), etc.

Operations on Heaps

Insertion

Place the new key at the first free leaf, then perform a fix-up:


x
1
fix-up(A, k)
2
k: an index corresponding to a node of the heap
3
4
1.  while parent(k) exists and A[parent(k)] < A[k] do
5
2.    swap A[k] and A[parent(k)]
6
3.    k <- parent(k)

$O(h) = O(\log n)$ .

Delete Max

We replace root, which is the max item of the heap, by the last leaf and remove last leaf, then perform a fix-down:


xxxxxxxxxx
12
1
fix-down(A, n, k)
2
A: an array that stores a heap of size n
3
k: an index corresponding to a node of the heap
4
5
1.  while k is not a leaf do
6
2.    // Find the child with the larger key
7
3.    j <- left child of k
8
4.    if (j is not last(n) and A[j+1] > A[j])
9
5.      j <- j + 1
10
6.    if A[k] >= A[j] break
11
7.    swap A[j] and A[k]
12
8.    k <- j

$O(h) = O(\log n)$ .

Priority Queue Realization Using Heaps

$A$ and keep track of its size:


xxxxxxxxxx
6
1
insert(x)
2
3
1.  increase size     
4
2.  l <- last(size)
5
3.  A[l] <- x
6
4.  fix-up(A, l)


xxxxxxxxxx
10
1
def last(n):
2
    """Return the index of last item given a n-item heap."""
3
    return n - 1
4
5
def insert(x):
6
    """Insert item x to the heap."""
7
    size += 1          # Increase size
8
    idx = last(size)     # Get index of first free leaf, which is n-1
9
    A[idx] = x           # Insert x to this position
10
    fix_up(A, idx)       # Perform fix_up to get it to the correct position.


xxxxxxxxxx
7
1
deleteMax()
2
3
1.  l <- last(size)
4
2.  swap A[root()] and A[l]
5
3.  decrease size
6
4.  fix-down(A, size, root())
7
5.  return (A[l])


xxxxxxxxxx
11
1
def root():
2
    """Return the index of the root of a heap."""
3
    return 0
4
5
def deleteMax():
6
    """Pop max of a heap"""
7
    idx = last(size)          # Get position of the last element
8
    A[root()], A[idx] = A[idx], A[root()]   # Swap last leaf with root
9
    size -= 1                 # Update size
10
    fix_down(A, size, root()) # Perform fix_down to get new root to correct position
11
    return (A[idx])           # Return max

Sorting using Heaps

HeapSort

sort $O(n + n \cdot \text{insert} + n \cdot \text{deleteMax})$ . Using the binary-heaps implementation of PQs, we obtain:


xxxxxxxxxx
7
1
PQ-SortWithHeaps(A)
2
3
1.  initialize H to an empty heap
4
2.  for k <- 0 to n-1 do
5
3.    H.insert(A[k])
6
4.  for k <- n-1 down to 0 do
7
5.    A[k] <- H.deleteMax()

$O(\log n)$ $O(n \log n)$ time.

We can improve this with two simple tricks:

Heaps can be built faster if we know all input in advance.
$O(1)$ additional space. This is called Heapsort.

Building Heaps by Bubble-up

Problem $n$ $A[0, \ldots, n-1]$ , build a heap containing all of them.

Solution 1 $n$ $\Theta(n \log n)$ .

Building Heaps by Bubble-down

Problem $n$ $A[0, \ldots, n-1]$ , build a heap containing all of them.

Solution 2: Using fix-downs instead:


xxxxxxxxxx
6
1
heapify(A)
2
A: an array
3
4
1.  n <- A.size()
5
2.  for i <- parent(last(n)) downto 0 do
6
3.    fix-down(A, n, i)

$\Theta(n)$ , which means a heap can be built in linear time.

HeapSort Implementation

Idea: PQ-Sort with heaps

Improvement $A$ for storing heap.


xxxxxxxxxx
12
1
HeapSort(A, n)
2
3
1.  //heapify
4
2.  n <- A.size()
5
3.  for i <- parent(last(n)) downto 0 do
6
4.    fix-down(A, n, i)
7
5.  // repeatedly find maximum
8
6.  while n > 1
9
7.    // do deleteMax
10
8.    swap items at A[root()] and A[last(n)]
11
9.    decrease n
12
10.   fix-down(A, n, root())

for $\Theta(n)$ $O(n \log n)$ time.

Other Heap Operations

GetMax

Output the maximum element of the heap, but don't remove it.
$O(1)$ time.

ChangePriority


xxxxxxxxxx
8
1
changePrority(item, newKey)
2
- Change the priority of <item> to <newKey>. O(log n) time.
3
- Helper: fix-up(item): if the item's priority is greater than its parent, switch. In the worst case we need to do log(n) comparisons, so runtime is O(log n).
4
- Helper: fix-down(item): if the item's priority is less than its children, switch. For the same reason, runtime is O(log n).
5
6
1. key(item) <- newKey    // Set item's key to newKey
7
2. fix-up(item)           // Bubble up/down the item until it's at the 
8
3. fix-down(item)         // ... correct place

Change the key/priority of an element.
$O(\log n)$ time.

Delete


xxxxxxxxxx
12
1
delete(item) 
2
- Delete the item from an addressable max-heap. O(log n) time.
3
- As a remark, `infinity` here represents an arbitrarily large value.
4
- Helper: changePriority - See above, O(log n)
5
- Helper: deleteMax - See above, O(log 1)
6
7
1. changePriority(item, infinity)  // Set item's key to a large number
8
2. deleteMax()                     // Since changePriority makes sure that the 
9
                                   // ... item is at the correct place, which, 
10
                                   // ... given its new priority is infinity,
11
                                   // ... it must be the root of the heap. 
12
                                   // ... Thus, deleteMax() removes it.

Now assume that the item knows where it is (aka a handle). More on Addressable Heap.
$O(\log n)$ time.
Remark. $O(n)$ time.

Heap Merge/Join

Problemjoin(H1, H2) $H_1$ $H_2$ .

$O(n)$ . (Khan Academy: Linear Time Merging).
$O(k \log n)$ .
- $H_1$ $H_2$ $n = |H_1|$ $k=|H_2|$ $k \leq n$ .
- $k$ $O(\log n)$ $H_2$ $H_1$ .
- $O(k\log n)$ in total.

Solution We present three ideas:

$O((\log n)^3)$ time.
1. Remark. $O(\log k \cdot \log n))$ .
structural $T^\text{exp} = O(\log n)$ .
$O(\log n)$ .

Worst-Case Heap Joining

$H_1$ $H_2$ $|H_1| = n$ $|H_2| = k \leq n$ , we want to create a heap combining both heaps.

Case I. Both heaps are full and they have the same height:

$r$ $\infty$ .
$H_1$ $r$ $H_2$ $r$ 's right child.
Call deleteMax(). Since deleteMax() also takes care of structural property, we are done.

deleteMax() $O(\log n)$ time.

$H_2$ has a smaller height:

$\implies$

Phase I: Merge

$h(H)$ $H$ $h(H_2) < h(H_1)$ $H_1$ $H_2$ $A$ $B$ $C$ $D$ are such heaps.
$h(A) = h(H_2)$ Case I $H_2$ $A$ :
1. $r$ $\infty$ .
2. $A$ $H_2$ $r$ 's left and right child, respectively.
3. $A$ with this new heap.
We arrive at the right diagram.

Phase II: Adjust

$\infty$ node, consider the following three nodes:

$y$ $A$ .
$z$ $H_2$ .
$x$ $\infty$ $y$ 's parent.

We have the following observations:

$A$ $H_2$ are valid heaps by construction, hence both satisfy the heap property.
$H_1$ $x$ $y$ $x>y$ $A$ $y$ ) will not violate the heap property.
$x$ $z$ .
- $x>z$ $\infty$ and call deleteMax().
- $x < z$ $x < z$ $H_2$ $x$ $x$ $z$ .

$x < z$ ):

$a$ $x$ , call fix-down(a).
1. $x$ will be bubbled into the correct place.
2. $\log n$ $h(H_1) = n$ fix-down(a) $\log n$ $h(H_1) = n$ $O(\log^2 n)$ .
Call deleteMax()fix-down $\infty$ deleteMax() $\infty$ .

fix-down $O(\log^2 n)$ .

$H_2$ $H_1$ is not.

$O(\log^2 n)$ time. We omit this case.

$H_1, H_2$ are not full.

$H_2$ $O(\log k)$ heaps that are full.


xxxxxxxxxx
9
1
                                       A 
2
                             /                    \
3
                            B                      K  
4
                       /         \            /         \   
5
                      C           D          k           K
6
                    /   \       /   \      /   \       /   \
7
                   C     C     E     F    K     K     K     K
8
                  / \   / \   / 
9
                 C   C C   C G

In the above diagram, elements labeled with same letter are grouped into one full heap, e.g., the entire right subheap (labeled k) is a full heap, and the node F itself is a heap (because we have nothing to group it with).
Since the heap is not full, there will always be a path from root to leaf, where each node on the path is a one-node heap. In the diagram above, this path is represented by A -> B -> D -> E -> F.
$\log k$ $O(\log k)$ full heaps.

$H_1$ Case III $O(\log^2 n)$ $O(\log k)$ $O(\log k \cdot \log^2 n)$ time.

ConclusionCase IV $O(\log^3 n)$ $\square$

$O(\log n)$ Merge

Pseudocode


xxxxxxxxxx
14
1
heapMerge(r1, r2)
2
r1, r2: roots of two heaps, possibly NIL
3
return: root of merged heap
4
5
1.  if r1 is NIL return r2
6
2.  if r2 is NIL return r1
7
3.  if key(r2) > key(r1)
8
4.    randomly pick one child c of r1
9
5.    replace subheap at c by heapMerge(c, r2)
10
6.    return r1
11
7.  else
12
8.    randomly pick one child d of r2
13
9.    replace subheap at d by heapMerge(r1, d)
14
10.   return r2

Assume heaps are stored as binary trees but with heap ordering property.
We merge the heap with smaller root into the other one.
We randomly choose the child at which to merge, then enter recursion.

Graphical Illustration

Suppose we want to merge the following two heaps.

We see that root of left heap is greater than root of right heap (see line 7).

$50$ $29$ (see line 8).

$29 < 45$ (see line 3).

$50$ $29$ (see line 4).

We keep doing this:

Finally, we see that r1NIL $r_2$ $24$ ).

Return from call stack:

$29$ $45$ $29 < 45 \implies$ $29$ .

$45$ $50$ $50 > 45 \implies$ $50$ .

We are done!

Expected Runtime Analysis

Recall the expected runtime of a randomized algorithm can be expressed as:

\begin{align*} T^\text{exp}(I) &= \mathbb E[\text{runtime for $I$}] \\ &= \sum_{\text{sequences of random numbers}} \text{P}(\text{$r$ is chosen}) \cdot \text{(runtime of $I$ when using sequence $r$)}\\\\ T^\text{exp}(n) &= \max_{\text{instances $I$ of size $n$}} T^\text{exp}(I) \end{align*}

$n$ have equally good performance $\max$ of every instance.

Before we analyze the expected runtime of heap merge, let us first consider a simpler problem:

Problem Given a binary tree, you do a random walk downtime until you reach NIL. What is the runtime for this? We make no assumption on the tree, i.e., it may be very unbalanced.

Lemma $T^\text{exp}(n)$ $n$ $O(\log n)$ .

Proof. $c$ $T(n) \leq c \cdot \log(n+1)$ . We prove by induction.

$n = 1$ .
- $T(1) \leq c$ because we are one step away from NIL.
- $c \log (2) = c$ .
$n > 1$ .
- $n_L$ $n_R$ $n_L + n_R = n-1 \implies (n_L+1)+(n_R+1) = n+1$ .
- The total runtime can be expressed s:
  $T(n) = c + \frac{1}{2}T(n_L) + \frac{1}{2}T(n_R) = c+ \frac{1}{2}c\log(n_L+1)+ \frac{1}{2}c\log(n_R+1)$
  - $c$ : time for choosing child and walking down
  - $\frac{1}{2}T(n_L)$ : probability of choosing the left subtree times the runtime when left subtree is chosen.
  - $\frac{1}{2}T(n_R)$ : probability of choosing the right subtree times the runtime when right subtree is chosen.
- Recall concave functions has the property that
  $\forall x,y: \frac{1}{2}(f(x)+f(y)) \leq f\left(\frac{x+y}{2}\right)$
  Thus,
  $\begin{align*} T(n) &= c + \frac{1}{2}T(n_L) + \frac{1}{2}T(n_R) \\ &= c+ \frac{1}{2}c\log(n_L+1)+ \frac{1}{2}c\log(n_R+1) \\ &= c + c\cdot \frac{1}{2}\big((\log n_L+1) + (\log n_R + 1)\big) \\ &\leq c + c \cdot \log\left(\frac{n_L+1 + n_R + 1}{2}\right) \\ &= c + c \cdot \big(\log\left(n+1\right) - \log 2 \big) \\ &= c + c \cdot \log\left(n+1\right) - c \\ &= c \log\left(n+1\right) &\square \end{align*}$

Back to the heap merge analysis, we can view it as going down in both heaps randomly until one hits a NIL child. Then

T^\text{exp}(n) \leq 2 \cdot \text{random walk in a tree with size $n$} \in O(\log n) \qquad \square

Binomial Heap

$O(1)$ $O(\log n)$ merging, we can store a list of trees that satisfy the heap-property.

Binomial Tree

A binomial tree is defined recursively as follows:

$0$ is a single node.
$k$ $k-1, k-2, \ldots, 2, 1, 0$ (in this order).

$k$ $k-1$ trivially by attaching one of them as the left-most child of the root of the other tree. This feature is central to the merge operation of a binomial heap, which is its major advantage over the conventional heaps.

Remark. $n$ $\binom{n}{d}$ $d$ .

Claim. $k$ $2^k$ $k$ .

Proof. $k-1$ $B_k$ $B_{k-1}$ $B_k$ $2^{k-1} + 2^{k-1} = 2^k$ $B_k$ $B_{k-1}$ $(k-1+1) = k$ .

Binomial Heap

A binomial heap is implemented as a set (or list, as presented in our class) of binomial trees that satisfy the binomial heap properties:

Each binomial tree in a heap obeys the heap ordering property.
There can only be either one or zero binomial trees for each order, include zero order.

The first property ensures that the root of each binomial tree contains the maximum in the tree, which applies to the entire heap.

$n$ $1 + \log n$ $n$ $1$ $n$ $n=13=1101_2=2^3+2^2+2^0$ $13$ $3$ $2,$ $0$ .

Binomial Heap Operations

$R$ $d_T$ $T$ (take the max).

Merge $O(1)$ as we are basically concatenating lists.

Insert $O(1)$ for the same reason. In fact, insertion can be seen as merging as well.

DeleteMax $O(|R|+\max_T d_T$ ).

$|R|$ $O(|R|)$ for searching.
$d_T$ $d_T$ $O(d_T)$ time for merging.

Cleanup: we want to merge trees together so after this step, no two trees have the same degrees in the list.


xxxxxxxxxx
20
1
binomialHeapCleanup(R, n)
2
R: stack of trees (of a binomial heap of size n)
3
4
1.  C <- array of length (log n + 1), initialized NIL
5
2.  while R is non-empty:
6
3.    T <- R.pop() // extract a tree to work with
7
4.    d_T <- degree of root of T // check how many children root of T has
8
5.    if C[d_T] is NIL // if we haven't found another tree of the same degree
9
6.      C[d_T] <- T    // we put a pointer to T in the C[d_T]
10
7.    else // not NIL means we have saw another tree of the same degree
11
8.      T' <- C[d_T];  C[d_T] <- NIL // extract the previous result
12
9.      if key(root(T)) > key(root(T')) 
13
10.       make root(T') a child of root(T)
14
11.       R.push(T)
15
12.     else
16
13.       make root(T) a child of root(T')
17
14.       R.push(T')
18
15.  for i<-0 to log n: // Finally, we merge the binomial trees together
19
16.    if C[i] != NIL
20
17.      R.push(C[i])

Amortized Analysis (via Potential Function)

Lemma. $d_T \leq \log n + 1$ .

Proof. $\text{root degree } d\implies n = 2^d \implies d = \log n$ $\max_T d_T \in O(\log n)$ $\square$

$\phi(t) = N_t \cdot c$ $N_t$ $t$ $c$ $O$ $t \to t+1$ $\text{actual runtime} + \phi(t+1) - \phi(t)$ .

merge(R1, R2) $O(1)$ :

$O(1) \implies c$ .
$N_{t+1}-N_{t}$ $t+1$ $t$ $|R_1|+|R_2|$ trees. Since we are not modifying anything (only moving them), there are still the same number of trees.
$T_{\text{amortized}} \leq c + (|R_1| + |R_2|)- (|R_1| + |R_2|) = c \in O(1)$ .

Time to analyze deleteMax():

$O(|R|+\max_T d_T) \implies c \cdot (|R| + \max_T d_T)$ .
$c|R'| - c|R|$ , number of trees after minus number of trees before.
Putting them together:
$\begin{align*} T_\text{amortized} &\leq c \cdot (|R| + \max_T d_T) + c|R'| - c|R| \\ &= c\max_Td_T + c|R'| \\ &= c\max_Td_T + c\max_Td_T \\ &= 2c\max_Td_T \leq 2c \log n \in O(\log n) &\square \end{align*}$
$\max_T$ $n$ items.

Improvement on Worst-Case Performance

$O(\log n)$ amortized runtime for deleteMax(). If we are willing to trade runtime for merge/insertdeleteMax() $O(\log n)$ for all three operations.

By calling clean-up after every operation (insertmerge $|R| \in O(\log n)$ clean-up $O(|R| + \max_Td_T)$ . Enforcing this would make insert/mergedeleteMax $O(\log n)$ worst-case runtime.

Magic Trick

$T'$ $T$ with an arbitrary degree!

$T'$ $T$ $v$ $T'$ $T'$ $T$ $T'$ $T$ .

$\implies$

Source: A very very interesting paper to read.

Priority Queues

CS240E: Data Structures and Data Management (Enriched)

2019 Winter, David Duan

Abstract Data Types

ADT

Stack ADT

Queue ADT

Priority Queue

Using a Priority Queue to Sort

Realizations of Priority Queues

Unsorted Arrays / Unsorted Linked Lists

Sorted Arrays / Sorted Linked Lists

Heaps

Representation

Operations on Heaps

Insertion

Delete Max

Priority Queue Realization Using Heaps

Sorting using Heaps

HeapSort

Building Heaps by Bubble-up

Building Heaps by Bubble-down

HeapSort Implementation

Other Heap Operations

GetMax

ChangePriority

Delete

Heap Merge/Join

Worst-Case Heap Joining

Expected O(\log n) Merge

Pseudocode

Graphical Illustration

Expected Runtime Analysis

Binomial Heap

Binomial Tree

Binomial Heap

Binomial Heap Operations

Amortized Analysis (via Potential Function)

Improvement on Worst-Case Performance

Magic Trick

$O(\log n)$ Merge