dev-java/
htmlcleaner

HTML parser written in Java that can be used as a tool, library or Ant task

https://htmlcleaner.sourceforge.net/

Good job! There are no bugs.

You think something is missing here?
Start with filling a new bug.

Gentoo Bugzilla is where we track bugs of Gentoo and its packages; you are welcome to report, confirm and resolve bugs: