Bounds on Edge Costs

Next: Pruning Rules Up: Finding the Edge Cover Previous: An Edge-wise Cost Function

Bounds on Edge Costs

Although Lemma 5.1 suggests a method of distributing the cost of an annotated edge cover (and thus an edit script) over the component edges, the cost of each edge depends on the other edges present in the edge cover, and is thus not directly useful for computing a minimum-cost edge cover. However, we use that distribution scheme to derive upper and lower bounds on the fair cost of an edge e over all minimal edge covers K.

Intuitively, given that the cost of any UPD annotation on an edge is charged to that edge (by Equation 1), a simple choice for the lower bound on the cost of an edge [m, n] is simply the cost of updating the label m to that of n. However, we can do a little better. In some cases, selecting an edge [m, n] (as part of the edge cover being constructed) may force some of the children m' of m to be moved to n. In particular, this happens for those children of m' for which there is no edge that could possibly match m' to a child of n. We call such moves forced moves. In cases where we can determine a forced move exists, the cost of a MOV is added to the lower bound cost. However, according to Equation 1 not all the cost of a forced move goes to edge [m, n]. In the worst case, the number of edges incident on m, |E(m)|, is large, leaving [m, n] with an insignificant contribution. However, if |E(m)| is greater than 1, we know by Lemma 4.1 that |E(n)|=1, so forced moves on the n side would contribute to [m, n]. Thus, we may add the minimum of the second and the third terms in Equation 1 to the lower bound function.

Formally, let E be the set of edges in the induced graph of and . We define the forced move cost, of a node with respect to another node as follows: , if such that , and 0 otherwise. The cost is defined analogously. We then define the lower bound fair cost, , of an edge as follows:

To help us compute the upper bound, let us now define a conditional move cost, . Intuitively, costs one MOV cost unless there is a partner of m' that is a child of n. Formally, , if such that , and otherwise. The cost is defined analogously. Furthermore, define if m and n are regular nodes, 0 if , if , and if .

Using reasoning similar to that used for deriving the lower bound cost above, we arrive at the following definition for the upper bound fair cost, , of an edge:

Note that both and can be computed by MH-DIFF without knowing the target cover. Furthermore, the following lemma, proved in [&make_named_href('', "node21.html#bbdiffExtended","[CGM97]")], states that the above definitions of and , are upper and lower bounds, respectively, on the fair cost contribution of edge e to any minimal edge cover K that contains e.

Next: Pruning Rules Up: Finding the Edge Cover Previous: An Edge-wise Cost Function

Sudarshan S. Chawathe
Sat Feb 22 12:28:02 PST 1997