CS62 - Spring 2010 - Lecture 38

CS62 - Spring 2011 - Lecture 38

QUIZ! Problem 16.11

Cycle detection
   - How can we determine if an undirected graph has a cycle?
      - run DFS started at some node marking nodes as visited as we go
      - if we try and visit a node that is already marked, we've found a cycle
      - what if the graph isn't connected?
         - we must make sure to run DFS on all of the different parts
      - how can we make sure this happens?
         - traverse all of the nodes, if we find a node that hasn't been visited, run DFS from there

   - look at hasCycles in graph_algorithms.cpp code
      - why do we need to pass the parent?
      - why do we need two methods?
      - what is the parameter passing mechanism used to pass the graph? why?

   - what is the running time of our cycle detection algorithm?
      - how many times do we call dfsCycle on each vertex?
         - exactly once,
         - the first thing we do is set visited to true for that vertex
         - and will never revisit a visited vertex
      - what is the cost of each call to dfsCycle?
         - depends on the representation
      - adjacency matrix:
         - we need to traverse all V entries to get the neighbors
         - O(|V|^2) overall
      - adjacency list:
         - a little trickier
         - how many times do we process each edge?
            - once
         - O(|V| + |E|), which for a connected graph is O(|E|)

connectedness: given an undirected graph, is the graph connected?
   - how might we do this?
      - run depth first search (or breadth first search)
      - keep track of which nodes we visit
      - if we visit all of the nodes, then it's connected

   - first, let's take a look at dfs in graph_algorithms.cpp
      - what are the parameters?
         - the current vertex v
         - the set of visited vertices
         - the graph itself
      - what does it do?
         - just like before, visits the current node
         - then recursively visits the unvisited neighbors
   - what does grop_isConnected do?
      - runs depth first search
         - what is adjMap.begin()->first?
            - adjMap.begin() is an iterator to the first element of the map/graph
            - ->first will give us the key, i.e. the vertex
            - basically, gets any vertex from the graph
      - what does the for loop check?
         - checks to see if there are any unvisited nodes
         - if there are, then it's NOT connected
      - notice again that we need to use a const_iterator
   - what is the running time?
      - Again, we'd like to ask it with respect to |V| and |E|
      - It's going to depend on the graph... what is the worst case running time?
      - what is the running time of grop_isConnected (without the dfs call)?
         - O(|V|)
      - how many times do we call dfs on for each vertex?
         - exactly once
      - what is the cost of each call to dfs?
         - again, will depend on the graph representation
      - adjacency matrix:
         - we need to traverse all V entries to get the neighbors
         - O(|V|^2) overall
      - adjacency list:
         - again, how many times do we process each edge?
            - once
         - O(|V| + |E|), which for a connected graph is O(|E|)
      - is this surprising?
         - NO, it's very similar to cycle detection, utilizing DFS

connected components: given an undirected graph, return the largest subgraphs that are connected
   - how does it differ from just asking about connectedness?
      - similar idea as connectedness, but like cycle detection, we need to make sure we examine all of the vertices in the graph
   - how might we do this?
      - pick a node
      - run depth first search on that node keeping track of both the nodes we visited
      - all nodes visited during THAT dfs are one connected subgraph
      - pick a node that is unvisited and repeat
      - stop when all nodes have been visited
   - we can code this in psudocode:

      void dfs_get_component(vertex v, list of vertices: component) {
         set v as visited
         add v to component
         for (all neighbors u of v){
            if (we have NOT visited u ){
               dfs_get_component(u, component)
         }       }
      (list of lists of vertices) grop_connected_components(const map<int, list<int> >& adjMap){
         list of lists of vertices connected_components;
         for (all vertices v){
            if (we have NOT visited v) {                create a new empty list of vertices: current
               dfs_get_component(v, current)
               add current to connected_components
            }
         }

         return connected_components
      }

   - dfs_get_component looks is like DFS except we also keep track of the component
      - why doesn't it return anything?
         - the work that is actually done is modifying component
         - what must that mean for component?
            - either it must be a pointer or
            - it must be passed by reference (for our graph assignment, we'll do this option so you don't have to worry about memory management)
   - what does grop_connected_components do?
      - connected_components keeps track of the different subgraphs
      - we traverse each of the vertices, if we haven't visited it, call dfs_get_component
   - if we're representing vertices as ints, what are the types of:
      - component?
         - list<int>
      - connected_componenents?
         - list<list<int> >
   - how are we keeping track of visited?
      - for each call to grop_connected_components, we'll need to keep track of whether or not each node has been visited
      - will need to communicate that to dfs_get_component and make sure it's updated there
   - what is the running time?
      - using similar analysis to the last two approaches O(|V|^2) for adjacency matrix and O(|V| + |E|) for adjacency list

random C++ for the day: sizeof
   - the sizeof command allows us to get the number of bytes of a type (built-in types, classes, etc.)
   - it's used by the compiler in a number of places, including for knowing how far to increment a pointer
   - look at sizeof.cpp code
      - we can ask for the size of built-in types like char, int, float and double
      - as well for classes like vector, string and map
      - notice that we can also ask the size of particular variables
   - what do the results tell us?
      - char, int, float and double are all what we would expect
      - for map and vector must use pointers (which we'd expect) since they're the same size, regardless of type
      - arrays are sizeof(arrayType) * number of elements
      - strings must also be implemented with pointers
         - though literal strings are character arrays