text
Text processing software - string splitters, parsers, and so on
- Francis Leboutte's functions to compute
frequencies of characters, digrams and trigrams from a text file (function
names and comments in French - some comments in English).
- Pierre Mai's PARTITION, and Christophe's (revised) specification.
- A slightly adapted version of the above PARTITION was accepted as a community standard. The name was initially PARTITION, but later it was agreed that SPLIT-SEQUENCE is a better name. Part of Common Lisp Utilities.
-
Kevin Rosenberg's LVG InterfacePackage.
This package provides an interface to the National Library of Medicine's Lexical Variant Generation API.
- RegEx-CLAWK-Lexer by Michael Parker is a package with regular expression, AWK, and lexical analyzer sub-packages.
- Zebu is a parser generator.
- LALR is another one.
- Yet another one is CL-Yacc.
- Meta is a language on top of Lisp for building recursive-descent parsers.
- Pragmatic Parsing in Common Lisp by Henry Baker has a good discussion on parsing from the Computer Science perspective. But it's not just "theory". With code. (This is the method that Meta implements.)
broken, MultiLingual Lisp HyperText Generation
System (or read-only) "Documents want to speak Lisp (...) Many Documents in One
You may be reading this document in English, mainland or taiwanese style Chinese or some other language. You may be reading it on a computer screen with a web browser that supports all linking possibilities of modern hypertext, or you may be reading a paper-oriented version that instead operates with page references, content tables and footnotes. Regardless of the language, medium or otherwise defined version, in which you are reading this document, the source text of it is always the same. This is so, because this document (...) is in fact a computer program.This document is written in Lisp."
- CL-BibTeX is a replacement for the BibTeX program.
- parse-number is a library to parse strings into any Common Lisp number type
- fare-csv helps you import CSV files from PC spreadsheets and databases.
- csv-parser
is an alternative parser to read (and write) CSV files.
- cl-interpol modifies the Lisp reader so you can have string interpolation like in Perl or shell scripts
- Questions are often asked about Unicode support in Common Lisp implementations. The page on Unicode and Lisp provides some commentary.
- Parse dates & times into Universal Times with the tiny library at
http://lisp-p.org/pdl/.
- cl-ppcre Perl5-Compatible Regular Expression Library
- terse-ppcre - macros to make using CL-PPCRE parse trees easier.
- CL-ISBN - library for parsing, validating, and hyphenating International Standard Book Numbers (ISBNs)
- cl-typesetting - A typesetting library to create PDF documents with the help of CL-PDF
- CL-PDF - A standalone PDF generation library
- ODS4CL - Produces spreadsheets in OpenDocument format
- CL-RFC2047 - Encoding and decoding according to CL-RFC2047
See also the pages for Regular Expression, XML libraries, Lisp Markup Languages.
Pages in this topic: Cedilla cl-interpol CL-ISBN cl-markdown cl-peg CL-Yacc Common String diff dso-lex fare-csv guess html-encode LALR metapeg PARSE-NUMBER parser PCRE RegEx-CLAWK-Lexer sequence-search-replace terse-ppcre text-template Texticl xindy yacc Zebu
Also linked from: index
CLiki pages can be edited by anyone at any time. Imagine a fearsomely comprehensive disclaimer of liability. Now fear, comprehensively