Gary King: Since I couldn't find it out there already yesterday (many years ago!), I packaged up Franz's open source HTML parser as CL-HTML-Parse, a simple HTML parser. It converts HTML into a Lispy representation. It's also the inverse of lml2. Spiffy?! You decide.
Download
The code is now on github: https://github.com/hraban/cl-html-parseComments
Marco Baringer points out that pxmlutils already has Franz's stuff ported! Cool.Max Mouratov says that pxmlutils also includes an XML parser and requires ACL-COMPAT. CL-HTML-Parse is a lighter package which includes just an HTML parser and doesn't require ACL-COMPAT.
Quick assessment (2023-10-07)
- Line 48 of cl-html-parse.lisp: Note that :MCL is in the features of MCL and OpenMCL, but not Clozure CL.
- Lines 59–61 of cl-html-parse.lisp: Wrong because *current-case-mode* is different from *print-case*.
- Lines 869–873 of cl-html-parse.lisp: Could be problematic to case-fold (upcase or downcase) a partial match.
MIT-LICENSE