A number of simple utilities for manipulating HTML and XML files
http://www.w3.org/Tools/HTML-XML-utils/