Lecture 1 - Course Introduction

Asymptotic Notation

We consider two functions $f(n), g(n)$ with values in $\mathbb R_{> 0}$.

Big-O

We say $f(n) \in O(g(n))$ if there exist $C > 0$ and $n_0$, such that for $n \geq n_0$, $f(n) \leq Cg(n)$

Big-$\Omega$

We say that $f(n) \in \Omega (g(n))$ if there exist $C > 0$ and $n_0$ such that for $n \geq n_0$, $f(n) \geq Cg(n)$.
This is equivalent to $g(n) \in O(f(n))$

$\Theta$

We say that $f(n) \in \Theta (g(n))$ if there exist $C, C' > 0$ and $n_0$ such that for $n \geq n_0$, $C'g(n) \leq f(n) \leq Cg(n)$
This is equivalent to $f(n) \in O(g(n))$ and $f(n) \in \Omega(g(n))$
In particular true if $\lim_{n \to \infty} f(n)/g(n) = C$ for some $0 < C < \infty$

little-o

We say that $f(n) \in o(g(n))$ if for all $C > 0$, there exists $n_0$ such that for $n \geq n_0$, $f(n) \leq Cg(n)$
This is equivalent to $\lim_{n \to \infty} f(n)/g(n) = 0$

little-$\omega$

We say that $f(n) \in \omega(g(n))$ if for all $C > 0$, there exists $n_0$ such that for $n \geq n_0$, $f(n) \geq Cg(n)$
This is equivalent to $\lim_{n \to \infty} f(n)/g(n) = \infty$
This is equivalent to $g(n) \in o(f(n))$

Some side notes

$n^k + c_{k-1} n^{k-1} + \dots + c_0 \in \Theta (n^k)$
$n^{O(1)}$ means (at most) polynomial in $n$
$n \log (n)$ is in $O(n^2)$ and $\Omega(n)$

Maximum subarray problem

BruteForce
DivideAndConquer
DynamicProgramming

Lecture 2: Solving Recurrences

Recusion Tree Method

The Master Method

Suppose that $a \geq 1$ and $b > 1$. Consider the recurrence $$T(n) = a T\left( \frac{n}{b} \right) + \Theta(n^y)$$ in sloppy or exact form. Denote $x = \log_b a$. Then: $$T(n) \in \begin{cases} \Theta(n^x) & \text{if } y < x \\ \Theta(n^y \log n) & \text{if } y = x \\ \Theta(n^y) & \text{if } y > x \end{cases} $$

Case	r	y, x	Complexity of $T(n)$
heavy leaves	$r > 1$	$y < x$	$T(n) \in \Theta(n^x)$
balanced	$r = 1$	$y = x$	$T(n) \in \Theta(n^y \log n)$
heavy top	$r < 1$	$y > x$	$T(n) \in \Theta(n^y)$

"Heavy leaves" means that cost of the recursion tree is dominated by the cost of the leaf nodes.

"Heavy top" means that cost of the recursion tree is dominated by the cost of the root node.

Lecture 3 - Divide and Conquer

A general algorithmic paradigm (strategy):

Divide: Split a problem into several subproblems.
Conquer: Solve the subproblems (recursively) applying the same algorithm.
Combine: Use subproblem results to derive a final result for the original problem.

Counting Inversions

Easy algorithm (two nested loops) in $\Theta (n^2)$.

Option 1

Take each $i \leq n/2$ and binary-search its position in the right-hand side.

Option 2

Enhanced Mergesort

Multiplying Polynomials

Divide and Conquer

Karatsuba's algorithm

Toom-Cook and FFT

Multiplying Matrices

Divide and Conquer

Strassen's algorithm

Lecture 4: Divide and Conquer (Part 2)

Closest pairs

Quick Select

Median of medians

Lec 05: Breadth First Search

Undirected graphs

Definition

A graph $G$ is pair $(V, E)$:

$V$ is a finite set, whose elements are called vertices.
$E$ is a finite set, whose elements are unordered pairs of distinct vertices, and are called edges.

Convention: $n$ is the number of vertices, $m$ is the number of edges.

Data structures

Adjacency list: an array $A[1..n]$ s.t. $A[v]$ is the linked list of all edges connected to $v$. $2m$ list cells, total size $\Theta (n + m)$, but testing if an edge exists is not $O(1)$.
Adjacency matrix: a $(0, 1)$ matrix $M$ of size $n \times n$, with $M [v, w] = 1$ iff $\{v, w\}$ is an edge. Size $\Theta(n^2)$, but testing if an edge exists is $O(1)$

Connected graphs, path, cycles, trees

Definition

path: a sequence $v_1, ..., v_k$ of vertices, with $\{v_i, v_{i+1}\}$ in $E$ for all $i$. $k = 1$ is OK.
connected graph: $G = (V, E)$ such that for all $v$, $w$ in $V$, there is a path $v \leadsto w$.
cycle: a path $v_1, ..., v_k$, $v_1$ with $k \geq 3$ and $v_i$'s pairwise distinct
tree: a connected graph without any cycle
rooted tree: a tree with a special vertex called root

Subgraphs, connected components

Definition

subgraph of G = (V, E): a graph $G' = (V', E')$, where
- $V' \subset V$
- $E' \subset E$, with all edges $E'$ joining vertices from $V'$
connected component of $G = (V, E)$
- a connected subgraph of $G$
- that is not contained in a larger connected subgraph of $G$

Let $G_i = (V_i, E_i)$, $i = 1, ..., s$ be the connected components of $G = (V, E)$.

the $V_i$'s are a partition of $V$ , with $\sum_i n_i = n$, $n_i = |V_i|$.
the $E_i$'s are a partition of $E$, with $\sum_i m_i = m$, $m_i = |E_i|$

Breadth-first search

Complexity

Analysis:

each vertex is enqueued at most once
so each vertex is dequeued at most once
so each adjacency list is read at most once

Total: $O(n+m)$

Correctness

Claim

For all vertices $v$, if visited$[v]$ is true at the end, there is a path $s \leadsto v$ in $G$

Lemma

For all vertices $v$, there is a path $s \leadsto v$ in $G$ if and only if visited$[v]$ is true at the end

BFS tree

Definition

the BFS tree $T$ is the subgraph made of:

all $w$ such that parent[$w$] $\neq$ NIL.
all edges $\{w, \textbf{parent}[w]\}$, for $w$ as above (except $w = s$)

Claim

The BFS tree $T$ is a tree

Shortest paths from the BFS tree

Claim

The levels in the queue are non-decreasing
For all vertices $u, v$, if there is an edge $\{u, v\}$, then level$[v]$ $\leq$ level$[u]$+1.
For all $v$ in $G$:
- there is a path $s \leadsto v$ in $G$ iff there is a path $s \leadsto v$ in $T$
- if so, the path in $T$ is a shortest path and level$[v]$ = dist$(s, v)$

Bipartite graphs

Definition

A graph $G = (V, E)$ is bipartite if there is a partition $V = V_1 \cup V_2$ such that all edges have one end in $V_1$ and one end in $V_2$.

Using BFS to test bipartite-ness

Claim

Suppose $G$ connected, run BFS from any $s$, and set

$V_1$ = vertices with odd level
$V_2$ = vertices with even level.

Then $G$ is bipartite if and only if all edges have one end in $V_1$ and one end in $V_2$ (testable in $O(m)$)

Computing the connected components

Idea: add an outer loop that runs BFS on successive vertices

Lec 06: Depth First Search

Going depth-first

The idea:

travel as deep as possible, as long as you can
when you can't go further, backtrack.

DFS implementations are based on stacks, either implicitly (recursion) or explicitly (as with queues for BFS).

The white path lemma

Claim ("white path lemma")

When we start exploring a vertex $v$, any $w$ that can be connected to $v$ by an unvisited path will be visited during explore($v$).

Another basic property

Claim

If $w$ is visited during explore$(v)$, there is a path $v \leadsto w$.

Consequences

Previous properties: after we call explore at $v_1, ..., v_k$ in DFS, we have visited exactly the connected components containing $v_1, ..., v_k$
Shortest paths: no
Runtime: still $O(n + m)$

Trees, forest, ancestors and descendants

Previous observation

DFS($G$) gives a partition of $G$ into vertex-disjoint rooted trees $T_1, ..., T_k$ (DFS forest)

Definition

Suppose the DFS forest is $T_1, ... , T_k$ and let $u$, $v$ be two vertices

$u$ is an ancestor of $v$ if they are on the same $T_i$ and $u$ is on the path root $\leadsto v$
equivalent: $v$ is a descendant of $u$

Key property

Claim

All edges in $G$ connect a vertex to one of its descendants or ancestors.

Back edges

Definition

a back edge is an edge in $G$ connecting an ancestor to a descendant, which is not a tree edge.

Observation

All edges are either tree edges or back edges (key property).

Start and finish times

Observation

if start$[u]$ $<$ start$[v]$, then either finish$[u]$ < start$[v]$ or finish$[v]$ < finish$[u]$.

Cut vertices

Definition

for $G$ connected, a vertex $v$ in $G$ is a cut vertex if removing $v$ (and all edges that contain it) makes $G$ disconnected. Also called articulation points.

Finding the cut vertices (G connected)

The root $s$ is a cut vertex if and only if it has more than one child.

Definition

for a vertex $v$, let

$a(v)$ = min{level$[w]$, $\{v, w\}$ edge}
$m(v)$ = min{$a(w)$, $w$ descendant of $v$}

Using the values $m(v)$

For any $v$ (except the root), $v$ is a cut vertex if and only if it has a child $w$ with $m(w) \geq$ level$[v]$.

Computing the values $m(v)$

if $v$ has children $w_1, ... ,w_k$, then $m(v)$ = min{$a(v)$, $m(w_1)$, ... , $m(w_k)$}

Lec 07: Directed Graphs

Directed graphs basics

Definition

$G = (V, E)$ as in the undirected case, with the difference that edges are (directed) pairs $(v, w)$
- edges also called arcs
- usually, we allow loops, with $v = w$
- v is the source node, w is the target
a path is a sequence $v_1, ... ,v_k$ of vertices, with $(v_i, v_{i+1})$ in $E$ for all $i$. $k = 1$ is OK.
a cycle is a path $v_1, ... , v_k$, $v_1$, $k \geq 1$
a directed acyclic graph (DAG) is a directed graph with no cycle
Degree
- the in-degree of $v$ is the number of edges of the form $(u, v)$
- the out-degree of $v$ is the number of edges of the form $(v, w)$
Data structure
- adjacency lists
- adjacency matrix (not symmetric anymore)

BFS and DFS for directed graphs

The algorithms work without any change.

Classification of edges

Suppose we have a DFS forest. Edges of $G$ are one of the following:

tree edges
back edges: from descendant to ancestor
forward edges: from ancestor to descendant (but not tree edge)
cross edges: all others

If w was visited:

if $w$ not finished, $(v, w)$ back edge
else if start$[v]$ < start$[w]$ < finish$[w]$, $(v, w)$ forward edge
else start$[w]$ < finish$[w]$ < start$[v]$, $(v, w)$ cross edge

Testing acyclicity

$G$ has a cycle if and only if there is a back edge in the DFS forest

Topological ordering

Definition

Suppose $G = (V, E)$ is a DAG. A topological order is an ordering $<$ of $V$ such that for any edge $(v, w)$, we have $v < w$. No such order if there are cycles.

From a DFS forest

Suppose that $V$ is ordered using the reverse of the finishing order: $v < w \Leftrightarrow$ finish$[w]$ $<$ finish$[v]$. This is a topological order.

Consequence: topological order in $O(n + m)$.

Strong connectivity

Definition

A directed graph $G$ is strongly connected if for all $v, w$ in G, there is a path $v \leadsto w$ (and thus a path $w \leadsto v$).

Observation

$G$ is strongly connected iff there exists $s$ such that for all $w$, there are paths $s \leadsto w$ and $w \leadsto s$.

Testing strong connectivity

Algorithm

call explore twice, starting from a same vertex $s$
edges reversed the second time

Consequence

test in $O(n + m)$

Structure of directed graphs

Definition

a strongly connected component of $G$ is

a subgraph of $G$
which is strongly connected
but not contained in a larger strongly connected subgraph of $G$.

Exercise

The vertices of strongly connected components form a partition of $V$.

Exercise

$v$ and $w$ are in the same strongly connected component if and only if there are paths $v \leadsto w$ and $w \leadsto v$.

Structure of directed graphs

A directed graph $G$ can be seen as a DAG of disjoint strongly connected components.

Kosaraju's algorithm for strongly connected components

Lec 08: Greedy Algorithms

Greedy algorithms

Context: we are trying to solve a combinatorial optimization problem:

have a large, but finite, domain $\mathcal D$
want to find an element $E$ in $\mathcal D$ that minimizes / maximizes a cost function

Greedy strategy

build $E$ step-by-step
don’t think ahead, just try to improve as much as you can at every step
simple algorithms
but usually, no guarantee to get the optimal
it is often hard to prove correctness, and easy to prove incorrectness.

Interval Scheduling

Interval Scheduling Problem

Input: $n$ intervals $[s_1, f_1], [s_2, f_2], ... , [s_n, f_n]$.

Output a maximal subset of disjoint intervals.

Algorithm

$S = \{\}$
Sort the intervals such that $f_1 ≤ f_2 ≤ ··· ≤ f_n$
For $i$ from 1 to $n$ do
if interval $i$, $[s_i, f_i]$, has no conflicts with intervals in $S$
add $i$ to $S$
return $S$

Correctness: The Greedy Algorithm Stays Ahead

Assume $O$ is an optimal solution. Our goal is to show $|S| = |O|$.

Suppose $i_1, i_2, ... , i_k$ are the intervals in $S$ in the order they were added to $|S|$ by the greedy algorithm.
Similarly, let the intervals in $O$ are denoted by $j_1, ... , j_m$.
- Assume that the intervals in $O$ are ordered in the order of the start and finish times.
We prove that $k = m$.

Lemma: For all indices $r ≤ k$ we have $f (i_r) ≤ f (j_r)$.

Proof: We use induction. For $r = 1$ the statement is true. Suppose $r > 1$ and the statement is true for $r − 1$. We will show that the statement is true for $r$. By induction hypothesis we have $f (i_{r−1}) ≤ f (j_{r−1})$. By the order on $O$ we have $f (j_{r−1}) < s(j_r)$. Hence we have $f (i_{r−1}) < s(j_r)$. Thus at the time the greedy algorithm chose $i_r$, the interval $j_r$ was a possible choice. The greedy algorithm chooses an interval with the smallest finish time. So, $f (i_r) ≤ f (j_r)$.

Theorem: The greedy algorithm returns an optimal solution Prove by contradiction. If the output $S$ is not optimal, then $|S| < |O|$. $i_k$ is the last interval in $S$ and $O$ must have an interval $j_{k+1}$. Apply the previous lemma with $r = k$, and we get $f (i_k) ≤ f (j_k)$. We have $f (i_k) ≤ f (j_k) < s(j_{k+1})$. So, $j_k+1$ was a possible choice to add to $S$ by the greedy algorithm. This is a contradiction.

Interval Coloring

Interval Coloring Problem

Input: $n$ intervals $[s_1, f_1], [s_2, f_2], ..., [s_n, f_n]$

Output: use the minimum number of colors to color the intervals, so that each interval gets one color and two overlapping intervals get two different colors.

Algorithm

Sort the intervals such that $s_1 ≤ s_2 ≤ ··· ≤ s_n$
For $i$ from 1 to $n$ do
Use the minimum available color $c_i$ to color the interval $i$. (i.e. use the minimum number to color the interval $i$ so that it doesn't conflict with the colors of the intervals that are already colored.)

Correctness

Assume the greedy algorithm uses $k$ colors. To prove the correctness, we show that there are no other way to solve the problem using at most $k − 1$ colors.

Proof of correctness: Suppose interval ℓ is the first interval to use the color k. Interval ℓ overlaps with intervals with colors 1, ..., k − 1. Call these intervals $[s_{i_{1}} , f_{i_{1}} ], [s_{i_{2}} , f_{i_{2}} ], . . . , [s_{i_{k−1}} , f_{i_{k−1}}]$. For $1 ≤ j ≤ k − 1$ we have $s_{i_{j}} ≤ s_{ℓ}$. All the intervals overlap with $[s_{ℓ}, f_{ℓ}]$. Since all these intervals overlap with $[s_{ℓ}, f_{ℓ}]$, we also have $s_{ℓ} ≤ f_{i_{j}}$ for $1 ≤ j ≤ k − 1$. Hence $s_{ℓ}$ is a time contained in $k$ intervals. So, there is no $k − 1$ coloring.

Minimizing Total Completion Time

The problem

Input: n jobs, each requiring processing time $p_{i}$

Output: An ordering of the jobs such that the total completion time is minimized.

Note: The completion time of a job is defined as the time when it is finished.

Example: n = 5, processing times [2, 8, 1, 10, 5]

Algorithm

order the jobs in non-decreasing processing times

Correctness: Exchange Argument

Let $L = [e_{1}, ..., e_{n}]$ be an optimal solution (as a permutation of $[1, ..., n]$). Suppose that $L$ is not in non-decreasing order of processing times. So there exists $i$ such that $t(e_{i}) > t(e_{i+1})$. Sum of the completion times of $L$ is $nt(e_{1}) + (n − 1)t(e_{2}) + · · · + t(e_{n})$. The contribution of $e_{i}$ and $e_{i+1}$ is $(n − i + 1)t(e_{i}) + (n − i)t(e_{i+1})$. Now, switch $e_{i}$ and $e_{i+1}$ to get a permutation $L'$. Their contribution becomes $(n − i + 1)t(e_{i+1}) + (n − i)t(e_{i})$. Nothing else changes so $T (L') − T (L) = t(e_{i+1}) − t(e_{i}) < 0$. A contradiction.

Lec 09: Dijkstra's Algorithms

Shortest path exists in any directed weighted graph with the assumption that $G$ has no negative-weight cycles

The shortest path weight from $u$ to $v$: \[\delta(u, v) = \begin{cases} \min\{w(P) : u \leadsto v\} & \text{if there exists a path from u to v} \\ \infty & \text{otherwise} \end{cases} \]

Dijkstra's algorithm: Explanation

Dijkstra's algorithm is a greedy algorithm

Input: A weighted directed graph with non-negative edge weights

For all vertices, maintain quantities

d[v]: a shortest-path estimate from s to v
π[v]: predecessor in the path (a vertex or NIL)

Initialize $C = ∅$, repeat the following until $C = V$:

Add $u ∈ V − C$ with smallest $d$-value to $C$
Update $d$-values of vertices $v$ with $(u, v) ∈ E$: \[d[v] ← \min\{d[v], d[u] + w(u, v)\}\]
Update π[v] if d[v] is changed

Which Abstract Data Type (ADT) should we use for vertices?: binary min-heap

Insert: $O(\log n)$
Extract-min: $O(\log n)$
Update-key: $O(\log n)$

Dijkstra's algorithm: Pseudocode

Check the course notes.

Dijkstra's algorithm: Complexity analysis

Array Implementation: $O(|V|^2)$
Heap Implementation: $(|V| + |E|) \log |V|$

Dijkstra's algorithm: Proof of Correctness

Check the notes for a complete proof.

Claim: For each vertex $v ∈ V$ , we have $d[v] = δ(s, v)$ at the time when v is added to set $C$. To prove the claim, we follow these steps:

Proof by contradiction: Assume the claim is not correct and $u ∈ V$ is the first vertex for which $d[u] \neq δ(s, u)$ when it is added to $C$.
Time $t$: Beginning of the iteration in which $u$ is added to $C$.
Use a shortest path $P$ , from $s$ to $u$
On $P$, find $y$ (with predecessor $x$), the first vertex in $V − C$.
At time $t$, we have $d[y] = δ(s, y)$.
Fact 1: $d[y] = δ(s, y) ≤ δ(s, u) ≤ d[u]$
Fact 2: at time $t$ the algorithm chose $u$. Hence \[d[u] ≤ d[y]\]
From equations facts 1 and 2 we conclude: $d[y] = δ(s, y) = δ(s, u) = d[u]$

Lec 10: Minimum Spanning Trees

Spanning trees

Definition

$G = (V, E)$ is a connected graph
A spanning tree in $G$ is a tree of the form $(V, A)$, with $A$ a subset of $E$
in other words: a tree with edges from E that covers all vertices
examples: BFS tree, DFS tree

Now, suppose the edges have weights $w(e_i)$

Goal

A spanning tree with minimal weight

Kruskal's algorithm

For pseudocode, check the notes.

Augmenting sets without cycles

Claim

Let $G$ be a connected graph, and let $A$ be a subset of the edges of $G$. If $(V, A)$ has no cycle and $|A| < n − 1$, then one can find an edge $e$ not in $A$ such that $A ∪ \{e\}$ still has no cycle.

Proof

In any graph, #vertices − #con. comp. ≤ #edges. For $(V, A)$, this gives $n − c < n − 1$ so $c > 1$. Take any edge on a path that connects two components.

Properties of the output

Claim

If the output is $A = [e_{1}, ..., e_{r}]$, then $(V, A)$ is a spanning tree (and $r = n − 1$)

Proof

Of course, $(V, A)$ has no cycle. Suppose $(V, A)$ is not a spanning tree. Then, there exists an edge $e$ not in $A$, such that $(V, A ∪ \{e\})$ still has no cycle.

Case 1: $w(e) < w(e_{1})$. Impossible, since e1 is the element with the smallest weight.
Case 2: $w(e_{i}) < w(e) < w(e_{i+1})$. Impossible: at the moment we inserted $e_{i+1}$, we decided not to include $e$. This means that $e$ created a loop with $e_{1}, ..., e_{i}$.
Case 3: $w(e_{r}) < w(e)$. Impossible: we would have included it in $A$, since there is no loop in $A ∪ \{e\}$.

Exchanging edges

Claim

Let $(V, A)$ and $(V, T)$ be two spanning trees, and let $e$ be an edge in $T$ but not in $A$.
Then there exists an edge $e'$ in $A$ but not in $T$ such that $(V, T + e' − e)$ is still a spanning tree.
Bonus: $e'$ is on the cycle that $e$ creates in $A$.

Proof

Write $e = \{v, w\}$. $(V, A + e)$ contains a cycle $c = v, w, ..., v$. Removing $e$ from $T$ splits $(V, T − e)$ into two connected components $T_1$, $T_2$. $c$ starts in $T_1$, crosses over to $T_2$, so it contains another edge $e'$ between $T_2$ and $T_1$. $e'$ is in $A$, but not in $T$. $(V, T + e' − e)$ is a spanning tree.

Proof of correctness

Let $A$ be the output of the algorithm. Let $(V, T)$ be any spanning tree. If $T \neq A$, let $e$ be an edge in $T$ but not in $A$. So there is an edge $e'$ in $A$ but not in $T$ such that $(V, T + e' − e)$ is a spanning tree, and $e'$ is on the cycle that $e$ creates in $A$. During the algorithm, we considered $e$ but rejected it, because it created a cycle in $A$. All other elements in this cycle have smaller (or equal) weight so $w(e') ≤ w(e)$. So $T' = T + e' − e$ has weight $≤ w(T)$, and one more common element with $A$.

Keep going

Data structures

Operations on disjoint sets of vertices:

Find: identify which set contains a given vertex
Union: replace two sets by their union

Check the course notes for an updated pseudocode.

An OK solution

Using an array of linked list. To do find, add an array of indices, $X[i]$ = set that contains $i$

Worst case

Find is $O(1)$.
Union: worst case $\Theta(n)$.

Kruskal's algorithm

sorting edges $O(m \log(m))$
$O(m)$ Find
$O(n)$ Union

Worst case $O(m \log(m) + n^2)$

A simple heuristics for Union

Modified Union

each set in $U$ keeps track of its size
only traverse the smaller list
also add a pointer to the trail of the lists to concatenate in $O(1)$

Key observation

Worst case for one union is still $\Theta(n)$, but total time improved.

Total cost of union per vertex is $O(\log n)$

Conclusion

$O(n \log(n))$ for all unions and $O(m \log(m))$ total

Lec 11: Dynamic Programming

A slow recursive algorithm

The naive Fibonacci algorithm runs in $T(n) \in \Theta(\varphi^n)$, where $\varphi = \frac{1 + \sqrt{5}}{2}$ is the golden ratio.

A better algorithm

Top-down memoization

Bottom-up tabulation

Bottom-up Two Int Optimization

A Recipe for Designing a D. P. Algorithm

Identify the subproblem
Establish DP-recurrence
Set values for the base cases
Specify the order of computation
Recovery of the solution (if needed)

Dynamic programming

Key features

solve problems through recursion
use a small (polynomial) number of nested subproblems
may have to store results for all subproblems
can often be turned into one (or more) loops

Dynamic programming vs divide-and-conquer

dynamic programming usually deals with all input sizes $1, ..., n$
DAC may not solve "subproblems"
DAC algorithms not easy to rewrite iteratively

The Interval scheduling Problem

Input

$n$ intervals $I_{1} = [s_{1}, f_{1}], . . . , I_{n} = [s_{n}, f_{n}]$ start time, finish time
each interval has a weight $w_{i}$

Output

A choice $T$ of intervals that do not overlap and maximizes $\sum_{i \in T} w_i$
greedy algorithm in the case $w_i = 1$

Sketch of the algorithm

Basic idea

either we choose $I_{n}$ or not.

then the optimum $O(I_{1}, ... , I_{n})$ is the max of two values:
$w_{n} + O(I_{m_{1}} , ... , I_{m_{s}} )$, if we choose $I_{n}$, where $I_{m_{1}} , ..., I_{m_{s}}$ are the intervals that do not overlap with $I_{n}$
$O(I_{1}, . . . , I_{n−1})$, if we don't choose In

The indices $p_{j}$

Assume $I_{1}, ..., I_{n}$ sorted by increasing end time: $f_{i} ≤ f_{i+1}$

Claim

For all $j$, the set of intervals $I_{k} ≤ I_{j}$ that do not overlap $I_{j}$ is of the form $I1, ..., I_{p_{j}}$ for some $0 ≤ p_{j} < j$ ($p_{j} = 0$ if no such interval)

The algorithm will need the $p_{i}$'s.

if $−∞ ≤ s_{i} < f_{1}, p_{i} = 0$, $f_1$ = earliest finish time
if $f_{1} ≤ s_{i} < f_{2}$, $p_{i} = 1$
...

The algorithm of FindPj is available in the notes.

Runtime

$O(n \log(n))$ (sorting $f_i$?) and $O(n)$ (loops)

Main procedure

Definition

$M[i]$ is the maximal weight we can get with intervals $I_{1}, ..., I_{i}$

Recurrence

$M [0] = 0$ and for $i ≥ 1$ \[M [i] = \max(M [i − 1], M [p_{i}] + w_{i})\]

Runtime

$O(n \log(n))$ (sorting twice) and $O(n)$ (finding the $M [i]$'s)

Exercise

recover the optimum set for an extra $O(n)$

The 0/1 Knapsack Problem

Input

items $1, ..., n$ with weights $w_{1}, ..., w_{n}$ and values $v_{1}, ... , v_{n}$
a capacity $W$

Output

a choice of items $S ⊂ \{1, ... , n\}$
that satisfies the constraint $∑_{i∈S} w_{i} ≤ W$
and maximizes the value $∑_{i∈S} v_{i}$

Fractional Knapsack

Can be solved with a greedy algorithm.

Setting up the recurrence

Basic idea

Either we choose item n or not. Then the optimum $O[W, n]$ is the max of two values: $v_{n} + O[W − w_{n}, n − 1]$, if we choose $n$ (and $w_{n} ≤ W$ ). $O[W, n − 1]$, if we don't choose $n$
$O[w, i]$ :=maximum value achievable using a knapsack of capacity $w$ and items $1, ..., i$

Initial conditions

$O[0, i] = 0$ for any $i$
$O[w, 0] = 0$ for any $w$

Algorithm

Pseudocode is available in the notes.

Runtime

$\Theta(nW)$

Discussion

This is called a pseudo-polynomial algorithm. In our word RAM model, we have been assuming all $v_{i}$s and $w_{i}$s fit in a word
So input size is $Θ(n)$ words. But the runtime also depends on the values of the inputs

01-knapsack is NP-complete, so we don't really expect to do much better

Lec 12: Dynamic Programming - Part 2

The Longest Increasing Subsequence Problem

Input

An array $A[1..n]$ of integers

Output

A longest increasing subsequence of $A$ (or just its length) (does not need to be contiguous)

Recurrence Relation

Let $L[i]$ be the length of the longest increasing subsequence of $A[1..i]$ that ends with $A[i]$

Then $L[1] = 1$.

A longest increasing subsequence $S$ ending at $A[i]$ looks like \[S = [..., A[j], A[i]] = S' \textbf{ cat } [A[i]]\] S' is a longest increasing subsequence ending at $A[j]$ (or it is empty) We don't know j, but we can try all $j < i$ for which $A[j] < A[i]$

Iterative algorithm

Check the notes for pseudocode.

Runtime

$\Theta(n^2)$

The Longest Common Subsequence Problem

Input

Arrays $A[1..n]$ and $B[1..m]$ of characters

Output

The maximum length $k$ of a common subsequence to $A$ and $B$ (subsequences do not need to be contiguous)

A bivariate recurrence

let $M [i, j]$ be the longest subsequence between $A[1..i]$ and $B[1..j]$

$M [0, j] = 0$ for all $j$
$M [i, 0] = 0$ for all $i$
$M [i, j]$ is the max of up to three values
- $M [i − 1, j]$ (don't use $A[j]$)
- $M [i, j − 1]$ (don't use $B[j]$)
- $M [i − 1, j − 1] + 1$ if $A[i] = B[j]$

The algorithm computes all $M [i, j]$, using two nested loops, so runtime $\Theta(mn)$

Lec 13: Dynamic Programming - Part 3

Edit Distance

Input

arrays $A[1..n]$ and $B[1..m]$ of characters

Output

The minimum number of operations {add, delete, change} to transform $A$ into $B$

Recurrence

Let $D[i, j]$ be the edit distance between $A[1..i]$ and $B[1..j]$

$D[0, j] = j$ for all $j$ (add $j$ characters)
$D[i, 0] = i$ for all $i$ (delete $i$ characters)
$D[i, j]$ is the min of three values
- $D[i − 1, j] + 1$ (delete $A[i]$)
- $D[i, j − 1] + 1$ (add $B[j]$)
- $D[i − 1, j − 1] + 1$ if $A[i] ≠ B[j]$ (change $A[i]$ to $B[j]$) or $D[i − 1, j − 1]$ if $A[i] = B[j]$

The algorithm computes all $D[i, j]$, using two nested loops, so runtime $\Theta(mn)$

Optimal binary search trees

Input

integers (or something else) $1, ... ,n$
probabilities of access $p_{1}, . . . , p_{n}$, with $p_{1} + · · · + p_{n} = 1$

Output

an optimal BST with keys 1, . . . , n
optimal: minimizes $∑^n_{i=1} p_{i} · (depth(i) + 1)$ = expected number of tests for a search

Recurrence

Definition

define $M [i, j]$ by

$M [i, j]$ = the minimal cost for items $\{i, . . . , j\}$, $1 ≤ i ≤ j ≤ n$
$M [i, j] = 0$ for $j < i$

Recurrence

\begin{align*} M [i, j] & = \min_{i≤k≤j} \left(M [i, k − 1] + \sum^{k−1}_{ℓ=i} p_ℓ + pk + M [k + 1, j] + \sum^j_{ℓ=k+1} p_ℓ\right) \\ & = \min_{i≤k≤j} \left(M [i, k − 1] + M [k + 1, j] \right) + \sum^j_{ℓ=i} p_ℓ \end{align*}

Algorithm

Optimization

Prefix sums

Check the notes for pseudocode

Runtime

$O(n^3)$

Independent Sets in Trees

An independent set of a graph $G = (V, E)$, is $S ⊆ V$ if there are no edges between elements of $S$.

The maximum independent set problem (for a general graph):

Input: $G(V, E)$
Output: An independent set of maximum cardinality.

Algorithm

$I(v)$ := size of largest independent set of subtree rooted at $v$ \[I(v) = \max \left\{1 + \sum_{u \in S(v)} I(u), \sum_{u \in C(v)} I(u)\right\}\] where $C(v)$ is the set of children of $v$ and $S(v)$ is the set of grandchildren of $v$

Lec 14: Dynamic Programming - Part 4

The Bellman-Ford algorithm

source is fixed
if no negative cycle, computes all distances $δ(s, v)$
can detect negative cycles
very simple pseudo-code, but slower than Dijkstra's algorithm

Dynamic programming for shortest paths

Definition

For $i = 0, ..., n - 1$, set $\delta_i(s, v)$ to be the length of a shortest $s \leadsto v$ path with at most $i$ edges. If no such path exists, set $\delta_i(s, v) = \infty$.

Easy observations

$\delta_0(s, v) = \infty$ for $v \neq s$
$\delta_0(s, s) = 0$
If there is no negative cycle, then $\delta_{n - 1}(s, v) = \delta(s, v)$ for all $v$ (shortest path are simple)
In any case, $\delta(s, v) \leq \delta_i(s, v)$ for all $i$ and for all $v$.

Pseudo-code

Check the course notes for the pseudo-code

Runtime

$O(nm)$

Correctness

Claim

For all $i$, after iteration $i$, we have $d[v] ≤ d _{i} [v]$ for all $v$.

Proof

By induction on $i$

true for $i = 0$, so we suppose true at index $i − 1$ and prove true at $i$
at the beginning of the loop, for all $v$, $d[v] ≤ d _{i−1} [v]$
$d[v]$ can only decrease, so this stays true throughout the loop
$d[v]$ is replaced by $\min(d[v], \min_{(u,v)∈E} (□ + w(u, v)))$, where $□ ≤ d _{i−1} [u]$ (cf. above). Note that $d_i[v] = \min(d_{i - 1}[v], \min_{(u,v)∈E} (d_{i - 1}[u] + w(u, v)))$
so at the end of iteration $i$, $d[v] ≤ d _{i} [v]$.

Relaxations

The operation $d[v] ← \min(d[v], d[u] + w(u, v))$ is called a relaxation

Claim

$d[v]$ can only decrease through a relaxation, and if $δ(s, u) ≤ d[u]$ and $δ(s, v) ≤ d[v]$ before relaxation, then $δ(s, v) ≤ d[v]$ post-relaxation.

Proof

First part is obvious. For the second part, we have \begin{align*} δ(s, v) & ≤ δ(s, u) + w(u, v) \\ & ≤ d[u] + w(u, v) \\ \end{align*}

Note that $d[v]$ gets the value of $\min \{d[v], d[u] + w(u, v)\}$. Hence, se say $d[v] \geq \delta(s, v)$.

Correctness, part 2

Claim

For $i = 0, . . . , n − 1$, after iteration $i$, $δ(s, v) ≤ d[v] ≤ δ _{i} (s, v)$ for all $v$.

Proof

This follows from the previous two parts

Summary

If there is no negative cycle

at the end, $d[v] = δ(s, v)$ for all v
in particular, for any edge $(u, v)$, $d[v] ≤ d[u] + w(u, v)$ (triangle inequality)

If there is a negative cycle

$v _{1} , v _{2} , . . . , v _{k}$ , with $v _{k} = v _{1}$.

Claim: there must be an edge $(v _{i} , v _{i+1} )$ with $d[v _{i+1} ] > d[v _{i} ] + w(v _{i} , v _{i+1} )$
Else: $d[v _{i+1} ] ≤ d[v _{i} ] + w(v _{i} , v _{i+1} )$ for all $i$; sum and derive a contradiction

Conclusion

for extra $O(m)$, can check the presence of a negative cycle

The Floyd-Warshall algorithm

Floyd-Warshall

no fixed source : computes all distances $δ(u, v)$
negative weights OK but no negative cycle
very simple pseudo-code, but slower than other algorithms
another application of dynamic programming

Remark

doing Bellman-Ford from all u takes $O(mn^2)$

Looking at subsets of vertices

Subproblems for dynamic programming

Bellman-Ford uses paths with fixed numbers of steps
Floyd-Warshall restricts which vertices can be used

Definition

for $i = 0, ..., n$, set $D_{i} (v _{j} , v _{k} )$ = length of the shortest path $v _{j} \leadsto v _{k}$ with all intermediate vertices in $v _{1} , . . . , v_i$
for i = 0, we get
- $D _{0} (v _{j} , v _{j} ) = 0$
- $D _{0} (v _{j} , v _{k} ) = w(v _{j} , v _{k} )$ if there is an edge $(v _{j} , v _{k} )$
- $D _{0} (v _{j} , v _{k} ) = ∞$ otherwise
$D_{n} (v _{j} , v _{k} ) = δ(v _{j} , v _{k} )$

Pseudo-code

Claim

$D _{i} (v _{j} , v _{k} ) = \min(D _{i−1} (v _{j} , v _{k} ), D _{i−1} (v _{j} , v _{i} ) + D_{i−1} (v _{i} , v _{k} ))$

Check the course notes for the pseudo-code

Runtime and memory

$\Theta(n^3)$

Lec 15: Polynomial Time Reduction

Decision Problems

Given a problem instance $I$, answer a certain question “yes” or “no”.

Problem Instance

Input for the specified problem.

Problem Solution

Correct answer (“yes” or “no”) for the specified problem instance. $I$ is a yes-instance if the correct answer for the instance $I$ is “yes”. $I$ is a no-instance if the correct answer for the instance $I$ is “no”.

Size of a problem instance

Size($I$) is the number of bits required to specify (or encode) the instance $I$.

Polynomial Reduction

A (decision) problem $A$ is polynomial time reducible to a decision problem $B$, if there is a polynomial time algorithm $F$ that transforms any instance $I _{A}$ of $A$ to an instance $I _{B}$ of $B$ such that \[I_{A} \text{ is a yes-instance of } A \iff I_{B} \text{ is a yes-instance of } B\]

We write $A ≤_{ P} B$, if such a polynomial time reduction exists. We also write $A = _{P} B$ if $A ≤ _{P} B$ and $B ≤ _{P} A$.

Transitivity Of Polynomial time Reductions

If $A ≤ _{P} B$ and $B ≤ _{P} C$, then $A ≤ _{P} C$.

Simple Reductions

Maximum Clique (Clique)

$S ⊆ V$ is a clique if, $\{u, v\} ∈ E$ for all $u, v ∈ S$.

Input: $G = (V, E)$ and an integer $k$

Output: (the answer to the question) is there a clique in $G$ with at least $k$ vertices?

Maximum Independent Set (IS)

$S ⊆ V$ is an independent set if, $\{u, v\} ∉ E$ for all $u, v ∈ S$.

Input: $G = (V, E)$ and an integer $k$

Output: (the answer to the question) is there an independent set in $G$ with at least $k$ vertices?

Minimum Vertex Cover (VC)

$S ⊆ V$ is a vertex cover if, $\{u, v\} \cap S \neq \emptyset$ for all $\{u, v\} ∈ E$.

Input: $G = (V, E)$ and an integer $k$

Output: (the answer to the question) is there a vertex cover in $G$ with at most $k$ vertices?

Proposition

Clique =_P IS

Lemma

Assume $G = (V, E)$ is a graph. $S ⊆ V$ is a vertex cover $⇐⇒$ $V −S$ is an independent set in $G$.

Proposition

VC =_P IS

More Simple Reductions

Hamiltonian Cycle (HC)

A cycle is a Hamiltonian Cycle if it touches every vertex exactly once.

Input: Undirected graph $G = (V, E)$

Output: (the answer to the question) Does $G$ have a Hamiltonian Cycle?

Hamiltonian Path (HP)

A path is a Hamiltonian Path if it touches every vertex exactly once.

Input: Undirected graph $G = (V, E)$

Output: (the answer to the question) Does $G$ have a Hamiltonian Path?

Proposition

HC =_P HP

An Important Problem

$X = \{x _{1} , . . . , x _{n} \}$ where $x _{i}$'s are Boolean variables
A literal (term) is either $x _{i}$ or $\bar x_{i}$
A clause is a disjunction of distinct literals, $t _{1} ∨ t _{2} ∨ ··· ∨ t _{ℓ}$, where $t_{i} ∈ \{x _{1} , . . . , x _{n} , \bar x _{1} , . . . , \bar x _{n} \}$. We say that the clause is of length $ℓ$.
an assignment satisfies a clause $C$ if it causes $C$ to evaluate to true.
a conjuction of a finite set of clauese, $C _{1} ∧ C _{2} ∧ ··· ∧ C _{k}$ is called a formula in Conjunctive Normal Form (CNF)

3-SAT

Input: A CNF-fomula in which each clause has at most three literals.

Output: (the answer to the question) is there a truth assignment to the variables that satisfies all the clauses?

A More Complicated Reduction

Theorem

3-SAT ≤_P IS

Lec 16: NP-completeness

NP Class

For a problem $X$, we represent an instance of $X$ as a binary string $S$.

Definition: NP

A problem $X$ is in NP, if there is a polynomial time verification algorithm $ALG_{X}$ such that the input $S$ is a yes-instance iff there is a proof (certificate) $t$ which is a binary string of length $\operatorname{poly}(|S|)$ so that $ALG_{X} (S, t)$ returns yes.

Examples

Example 1: Vertex Cover

$S$ here is an input graph $G = (V, E)$ and an integer $k$
$t$ here is a subset $U$ of $V$ with $|U| ≤ k$

$Alg_{v}(S, t)$: go through all $E$ and check if $t$ covers the edges and $t ≤ k$.

Example 2: 3-SAT

$S$ here is a 3-SAT formula
$t$ here is a truth assignment to the variables

$Alg_{v}(S, t)$: check whether $t$ satisfies all the clauses.

Note: Clique, IS, HC, HP, Subset-Sum are all in NP.

Remarks

Are all problems in NP? No.
Is a given graph non-Hamiltonian (i.e. no Hamiltonian Cycle)?

Definition: Co-NP

co-NP is the set of decision problems whose no-instances can be certified in polynomial time.
Every polynomial time solvable decision problem is in NP.
Definition

P is the set of decision problems that can be solved in polynomial time.

$P \subseteq NP$
The most important open problem in theoretical computer science: is P equal to NP?
The name NP comes from Non-deterministic Polynomial time. A non-deterministic machine, roughly speaking, has the power to correctly guess a solution.

NP-Completeness

Informally we say a problem is NP-complete if it is the hardest problem in NP.

Definition

A problem $X ∈ NP$ is NP-complete if $Y ≤_P X$ for all $Y ∈ NP$.

Fact

P = NP $⇐⇒$ an NP-complete problem can be solved in polynomial time.

Theorem (Cook-Levin)

3-SAT is NP-complete.

Consequences

If we can prove 3-SAT $≤_P X$, then X is NP-complete. For example IS $∈$ NPC, since 3-SAT$≤_P$ IS.

To prove that a problem $X ∈$ NP is NP-complete, we just need to find a NP-complete problem $Y$ , and prove that $Y ≤_{P} X$.

Plan to prove the Cook–Levin theorem

Show that Circuit-Sat is NP-complete
Show that Circuit-SAT $≤_{P}$ 3-SAT.

Actual proof omitted. Check the course notes for details.

Some NP-complete problems

3SAT, SAT
independent set, vertex cover, clique
(directed) Hamiltonian cycle, Hamiltonian path
traveling salesman
subset sum
0/1 knapsack

Lec 17: NP-completeness Part 2