Perl modules for parsing HTML: HTML::Entities - Encode or decode strings with HTML entities HTML::Filter - Filter HTML text through the parser HTML::HeadParser - Parse section of a HTML document HTML::LinkExtor - Extract links from an HTML document HTML::Parser - HTML parser class HTML::PullParser - Alternative HTML::Parser interface HTML::TokeParser - Alternative HTML::Parser interface See https://metacpan.org/pod/HTML::Parser