closure-html
Closure-html is a portable HTML parsing library that understands malformed HTML. Info Project name: chtml, Closure HTML

Original author: Gilbert Baumann
Current maintainer: David Lichteblau

Categories: HTML, HTML parser

Homepage: common-lisp.net

Download (tarball): Yes, see below

Download (git clone): git://repo.or.cz/closure-html.git (gitweb)

This project is asdf-installable (release tarball) and available in clbuild (git version).

Project An HTML parser written in Common Lisp.

Feature overview

  • Supports HTML 4 and understands malformed HTML.
  • Serialization implemented using an event API called HAX, inspired by SAX.
  • Can be used with XML representations designed for cxml

In-memory representations that can be used with Closure HTML:

  • Lisp lists in "LHTML" format
  • Closure's internal PT structures
  • any representation supported by its sister project cxml, including DOM Core and STP

Relation to Closure XML

chtml is an error-correcting HTML 4 parser; cxml an XML parser

Etymology

chtml is short for Closure HTML, in reference to the Closure web browser it was originally written for.

Note that the Closure web browser is not related to the Clozure implementation of Common Lisp and works on various Common Lisp implementations.

Tarball

ASDF-install package (obsolete) http://www.common-lisp.net/project/closure/download/closure-html.tar.gz