CS 10: Spring 2014

Lecture 17, May 2

Code discussed in lecture

Binary search trees, continued

Last time, we saw how to query a binary search tree and how to insert into one. We started looking at how to remove a node from a binary search tree. Let's restart our look at removing a node.

Removing a node from a binary search tree

The overall strategy for removing a node z from a binary search tree has three basic cases but, as we shall see, one of the cases is a bit tricky.

A word about the case in which z has two children. Many books, including our textbook and the first two editions of Introduction to Algorithms, treated this case differently and, at first blush, more simply. Instead of moving z's successor into z's position in the binary search tree, they took the information in z's successor (here, the key and value instance variables), copied them into z, and then removed this successor. Because z's successor cannot have two children (think about why), removing it will fall into one of the first two cases.

That approach turns out to be quite a bit simpler than what we're going to do. So why are we adding all the extra complexity? Because the code that uses the binary search tree might keep references to nodes within the tree. If the code to remove a node changes what's in a node, and if it removes a different node than the one requested, all heck can break loose in the rest of the program. So, instead, we will guarantee that if you ask to remove node z, then it is node z and only node z that is actually removed, and all references to all other nodes remain valid.

The code for removing a given node z from a binary search tree organizes its cases a bit differently from the three cases outlined previously by considering the four cases shown here:

In order to move subtrees around within the binary search tree, we use a helper method, transplant, which replaces one subtree as a child of its parent with another subtree. When transplant replaces the subtree rooted at node u with the subtree rooted at node v, node u's parent becomes node v's parent, and u's parent ends up having v as its appropriate child.

  protected void transplant(Node u, Node v) {
    if (u.parent == sentinel)     // was u the root?
      root = v;                   // if so, now v is the root
    else if (u == u.parent.left)  // otherwise adjust the child of u's parent
      u.parent.left = v;
    else
      u.parent.right = v;

    if (v != sentinel)      // if v wasn't the sentinel ...
      v.parent = u.parent;  // ... update its parent
  }

This method is protected because subclasses might need to call it, but code using the BST class should not need to know about it.

The first two lines handle the case in which u is the root. Otherwise, u is either a left child or a right child of its parent. Yhe last two cases of the if-ladder take care of updating u.parent.left if u is a left child or u.parent.right if u is a right child. We allow v to be the sentinel, and the last two lines update v.parent if v is not the sentinel. Note that transplant does not attempt to update v.left and v.right; doing so, or not doing so, is the responsibility of transplant's caller.

With the transplant method in hand, here is the method that removes node z from a binary search tree:

  public void remove(Node z) {
    if (z.left == sentinel)       // no left child?
      transplant(z, z.right);     // then just replace z by its right child
    else if (z.right == sentinel) // no right child?
      transplant(z, z.left);      // then just replace z by its left child
    else {
      // Node z has two children.
      Node y = minimum(z.right);  // y is in z's right subtree, and y has no left
                                  // child

      // Splice y out of its current location, and have it replace z in the BST.
      if (y.parent != z) {
        // If y is not z's right child, replace y as a child of its parent by
        // y's right child and turn z's right child into y's right child.
        transplant(y, y.right);
        y.right = z.right;
        y.right.parent = y;
      }

      // Regardless of whether we found that y was z's right child, replace z as
      // a child of its parent by y and replace y's left child by z's left
      // child.
      transplant(z, y);
      y.left = z.left;
      y.left.parent = y;
    }
  }

The remove method executes the four cases as follows. The first two lines handle the case in which node z has no left child, and the next two handle the case in which z has a left child but no right child.

The remainder of the method deals with the remaining two cases, in which z has two children. We first find node y, which is the successor of z. Because z has a nonempty right subtree, its successor must be the node in that subtree with the smallest key; hence the call to minimum(z.right). As we noted before, y has no left child. We want to splice y out of its current location, and it should replace z in the tree. If y is z's right child, then the last three lines replace z as a child of its parent by y and replace y's left child by z's left child. If y is not z's right child, the lines

      // Splice y out of its current location, and have it replace z in the BST.
      if (y.parent != z) {
        // If y is not z's right child, replace y as a child of its parent by
        // y's right child and turn z's right child into y's right child.
        transplant(y, y.right);
        y.right = z.right;
        y.right.parent = y;
      }

replace y as a child of its parent by y's right child and turn z's right child r into y's right child, and then the last three replace z as a child of its parent by y and replace y's left child by z's left child.

Each line of remove, including the calls to transplant, takes constant time, except for the call to minimum. Thus, remove runs in O(h) time on a tree of height h.

Testing the BST class

The code in BSTTest.java tests the BST class. We have not discussed the toString method, but it is similar enough to how the toString method in BinaryTree.java operates that you should be able to figure out how it works.

Traversing a rooted tree

When we visit the nodes in a rooted tree, we are traversing the tree. Of the many possible orders for visiting nodes, three are most common. We think of them recursively, in terms of when to visit a node and when to visit its subtrees. We'll show examples in terms of this binary search tree, where we imagine visiting a node as printing out its key:

Going back to our binary tree (not binary search tree) code in BinaryTree.java, we see methods preorder, inorder, and postorder. Each appends data from a subtree to a List. But List is an interface, and in the driver, you can see that it's actually implemented by a Java LinkedList. Here's the preorder method:

  public void preorder(List<E> dataList) {
    dataList.add(data);
    if (this.hasLeft())
      left.preorder(dataList);  // recurse on left child
    if (this.hasRight())
      right.preorder(dataList); // recurse on right child
  }

The other two methods just move the call to add to different locations in the code:

  public void inorder(List<E> dataList) {
    if (this.hasLeft())
      left.inorder(dataList);   // recurse on left child
    dataList.add(data);
    if (this.hasRight())
      right.inorder(dataList);  // recurse on right child
  }

  public void postorder(List<E> dataList) {
    if (this.hasLeft())
      left.postorder(dataList);   // recurse on left child
    if (this.hasRight())
      right.postorder(dataList);  // recurse on right child
    dataList.add(data);
  }

Traversing rooted trees comes up quite often. For example, the TreeSet implementation of the Set interface stores information in a binary search tree, and the iterator returned by its iterator method is guaranteed to iterate over the set elements in increasing order. Not surprisingly, this iterator traverses the binary search tree using an inorder traversal.

There's another interesting use of traversals in binary trees. If all keys are distinct and you have both a preorder and an inorder traversal, you can reconstruct the entire binary tree. The method reconstructTree in the BinaryTree class does so. How does this method work? The first node in a preorder traversal is the root. The root is somewhere in the middle of an inorder traversal, with everything in its left subtree before it and everything in its right subtree after it. In the preorder traversal, all of the nodes in the left subtree are visited before anything in the right subtree. So we can build the tree recursively as follows:

  1. Save the root's key.
  2. Build lists of the keys in the left subtree, one list from the preorder traversal (leftPre in the code) and one list from the inorder traversal (leftIn in the code). Do so by iterating through the preorder and inorder lists until getting to the root in the inorder list.
  3. Having constructed the leftPre and leftIn lists, use them to recursively build the left subtree.
  4. Build lists of the keys in the right subtree, one list from the preorder traversal (rightPre in the code) and one list from the inorder traversal (rightIn in the code). Do so by iterating through the remainder of the preorder and inorder lists, starting from where we left off in step 1 and jumping over the root in the inorder list.
  5. Having constructed the rightPre and rightIn lists, use them to recursively build the right subtree.
  6. Create a new node for the root with the root's key, and link the reconstructed left and right subtrees to the root. Return the root.

How long does it take to traverse a rooted tree with n nodes? The answer is Θ(n) time. Here's an easy way to see it. We have to consider each edge twice. The first time is when we traverse the edge from the parent down to a child. The second time is when we traverse the edge from the child back to the parent. Conceptually, there is also an edge down into the root, representing the work performed in the initial call of the method. How many edges are there? It is a fact that in any tree—whether or not we think of it as a rooted tree—the number of edges is n − 1. Add in one more for the edge down to the root, and conceptually, there are n edges, each traversed twice. That makes the total time Θ(n).

Expression trees

We can think of expressions as being represented by trees. To keep things simple so that we don't have to think about operator precedence, let's consider only fully parenthesized expressions. Take the expression ((17.5 + (5.0 / x)) * (y - 4.0)). We can represent it by this binary tree:

Although we could build an expression tree on top of the binary tree code, we will write a new set of classes to represent expression trees. Each node will be its own type, so that we'll have nodes for variables, constants, and each of the binary operators. We won't include unary operators (such as unary minus) or ternary operators (such as the ? : operator of Java), but we could—and we could not include unary operators if all expression trees had to be binary trees.

The code for expression trees is in this zip file. (I will show you individual classes, but this is the easy way to download the whole thing.) First, look at Expression.java. It is an interface that says that an Expression must provide two methods: eval and deriv. The eval method evaluates an expression and returns its value in a double. The deriv method gives returns an expression that is the symbolic version of the derivative with respect to a particular variable. An Expression should also override toString.

Next, look at Constant.java. To use a constant, you call its define method, just giving a double with the value of the constant. The define method is static and calls the constructor, which is private, so that the only way to define a constant is by calling define. The constructor just saves the value of the constant in the instance variable myValue, and then the define method returns a reference to the Constant object. Because Constant implements Expression, anywhere you have declared a reference to an Expression, the reference may actually be to a Constant. The eval method just returns the value in myValue, and the deriv method always returns a constant whose value is 0.0, since the derivative of a constant with respect to any variable is 0. The toString method just returns the String representation of myValue.

This scheme, where the constructor is private and there is a static method that calls the constructor is a variant of a design pattern called the factory pattern. Instead of constructing objects directly, users of the class have to go through a method to create an object. One use is to allow the factory to decide to create a subclass object when asked to create a superclass object. Another is to give the user options on what parameters to pass to create the object. For example, complex numbers can be represented using polar or rectangular coordinates. Either way you present two double values, and so you cannot overload the constructor to accept either representation. You can create two factories, however, one of which expects polar coordinates and the other of which expects rectangular coordinates. For example:

class Complex {
  public static Complex fromCartesian(double real, double imag) {
    return new Complex(real, imag);
  }

  public static Complex fromPolar(double mag, double phase) {
    return new Complex(mag * cos(phase), mag * sin(phase));
  }

  private Complex(double a, double b) {
    // Code to store into private instance variables goes here.
  }
}

Complex c = Complex.fromPolar(1, pi);

It is less clear why we would want to use this pattern in the Constant class,, but it will become clearer later.

The Variable class in Variable.java class is similar, but it has a symbol table to save variable values. These are saved in a Map that is implemented using a HashMap. Again, there's a define method, but this time it is overloaded. The two-parameter form takes the name of a variable as a String and a value for the variable, storing the name and value in the symbol table. The one-parameter form takes just the name of the variable, and it stores the name in the symbol table with null for the value. A variable whose value is null is defined but uninitialized. The constructor, called by both define methods, stores the variable's name in the instance variable myName. There is an assign method that stores the value in the symbol table entry for the variable. The eval method looks up the variable in the symbol table, using myName as the key, and it returns the value found in the symbol table for the key given by myName. The deriv method determines whether the variable v given as the parameter matches myName. If it does, then the derivative is 1; otherwise, the derivative is 0. The correct Constant is defined and returned in either case. The toString method just returns myName.

Some errors can occur when using variables, and instead of just printing error messages, we throw exceptions. We have exceptions for three types of errors:

The classes in Sum.java, Difference.java, Product.java, and Quotient.java all perform eval by evaluating their operands and performing an operation on them. The only difference is the operator. Therefore, we have an abstract class in BinaryOp.java that has the template for evaluating a binary expression and another template for toString. These templates call abstract functions doOperation and getOperation, repectively. (The latter returns a String representation of the operator). It also has accessor methods to get the first or second expression.

Notice that evaluating an expression is really a postorder traversal of the expression tree. To evaluate a BinaryOp, first evaluate its two subtrees, and then apply the appropriate binary operator to the results of evaluating the subtrees.

Let's look at the Sum class more carefully. It provides the necessary two methods, which are fairly trivial. It supplies its deriv method, which adds the derivatives of its operands. But the interesting thing is Sum.make. Here we see the power of a factory method. It tries to simplify the expression. If the two expressions it is adding are constants, it adds them to get a new constant. If either operand is 0 it returns the other. So in three of the four cases it does not even create a Sum object! The other three operations are similar in how they try to simplify the resulting expression in their make methods.

Finally, look at the driver in ExpressionDriver.java. After the main method, we have several static wrapper methods. They alleviate the need to call the static methods in the above classes with the name of the class. We also use method names plus, minus, times, and over instead of the class names Sum, Difference, Product, and Quotient. In that way, we can construct the expression tree above using lines of code such as

Variable xVar = define("x", 2.0);
Variable yVar = define("y", 6.0);

Expression first = plus(constant(17.5), over(constant(5.0), xVar));
Expression second = minus(yVar, constant(4.0));
Expression third = times(first, second);

Or we can replace the last three lines by

Expression fourth = times(plus(constant(17.5), over(constant(5.0), xVar)),
    minus(yVar, constant(4.0)));

The driver also verifies that the three exceptions work correctly.