Generalities about the Common-Lisp-POSIX API
Note: This page gathers some remarks made by people on news:comp.lang.lisp
- cl-posix-dirent - Common-Lisp-POSIX API - SUSv3 - dirent
Principles
We're not merging Common-Lisp and POSIX!
We want to have access to POSIX primitives from Common-Lisp applications as easily as from a C application. The POSIX API being the Operating System Interface for the applications.
In particular, we won't convert between PATHNAMES and strings containing unix paths, and we won't convert between unix file descriptors and Common-Lisp streams. (There may already exist such a conversion function in the Common-Lisp implementation extension packages).
Packages
The main API
This API is defined by the Single UNIX Specification version 3.
It is recommended that the various implementations reside in packages named like:
COM.INFORMATIMAGO.COMMON-LISP.SUSV3 NET.SOURCEFORGE.CLISP.SUSV3 COM.SMART-LISP-CORP.SUSV3
The user can then choose one implementation or the other, and
re-nickname it to the reserved package name SUSV3
.
Note: The package name POSIX
is already taken in various
implementations. Moreover, it's not descriptive enough.
Alternative:
- Perhaps it should be named:
SUS3
.Extensions
SUSv3 defines various optional or extension API. (See codes).
These extensions will be defined in additional packages nicknamed:
Nickname: Example of package name: ------------- ------------------------------- SUSV3-ADV COM.INFORMATIMAGO.COMMON-LISP.SUSV3.ADV SUSV3-AIO COM.INFORMATIMAGO.COMMON-LISP.SUSV3.AIO SUSV3-BAR COM.INFORMATIMAGO.COMMON-LISP.SUSV3.BAR SUSV3-CPT etc... SUSV3-CS SUSV3-FSC SUSV3-IP6 SUSV3-MC1 (See: abreviation of a combination of options) SUSV3-MC2 (See: abreviation of a combination of options) SUSV3-MC3 (See: abreviation of a combination of options) SUSV3-MF SUSV3-MF-SHM (A combination) SUSV3-ML SUSV3-MR SUSV3-MON SUSV3-MPR SUSV3-MSG SUSV3-MX SUSV3-PIO SUSV3-PS SUSV3-RS SUSV3-RTS SUSV3-REM SUSV3-SHM SUSV3-SIO SUSV3-SPI SUSV3-SPN SUSV3-SS SUSV3-TCT SUSV3-TEF SUSV3-THR SUSV3-TMO SUSV3-TMR SUSV3-TPI SUSV3-TPP SUSV3-TPS SUSV3-TRC SUSV3-TRI SUSV3-TRL SUSV3-TSA SUSV3-TSF SUSV3-TSH SUSV3-TSP SUSV3-TSS SUSV3-TYM SUSV3-XSI SUSV3-XSR
Some code may not be relevant to a Common-Lisp API.
Alternatives:
- Use the *FEATURES* system. I don't like this option, leading to the use of #+ and #-.
Note: When defining lisp structures, we put all the fields, even the optional ones that will be used only when the optional feature is available.
[***SEE***: We need a mean to determine if a given extension is available. Should each package export an AVAILABLE-P predicate? ]
The Symbols
A symbol name is derived from the C binding's name, by:
- uppercasing, then
- replacing underscore (#\_) characters with the hyphen (#\-)
- where the fields of a C structure all have a common prefix (for example, "st_"), we omit it.
- constant names get prefix and suffix '+'s.
creat()
becomes CREAT
, not CREATE
.Parameters
Lisp constants, types, and structures are defined corresponding to the constants, types and structures in the C API.
The API is defined in terms of Common-Lisp.
- length arguments are omitted or optional where the sensible value is obvious. For example:
(read fd buffer &optional (length (length buffer))) => bytes-read
Note: We won't allow passing pathnames as unix path string parameters. Common-Lisp pathnames are messy, no two implementation can agree on their exact semantic, so it would defeat portability.
The above statement defeats the whole purpose of having a CL POSIX API.
Pathnames are part of CL and personally I expect them to work in any extension. The fact that there are rough edges in the spec does not condone falling back to a string based approach as the above suggests. Marco Antoniotti 20040513
Return values
The return value is usually the same as for the C binding, except in error cases: where the C function is defined as returning some sentinel value and setting "errno" on error, we instead signal a condition of type SYSCALL-ERROR. The actual error value ("errno") is stored in this condition and can be accessed with SYSCALL-ERRNO. [***SEE***: some interface to strerror, to get the user-readable translation of the error number]
We do not automatically translate the returned value into "Lispy" objects - for example, POSIX:OPEN returns a small integer, not a stream.
However, we do convert the returned values into Common-Lisp values.
Rationale: This is an interface to POSIX, not a high-level interface that uses POSIX, and many people using it will actually want to mess with the file descriptors directly. People needing Lispy interfaces can implement them atop this - or indeed, use the existing COMMON-LISP package, which already has many high-level constructs built on top of the operating system.
Issues
- Constants may be replaced by keywords (for example: error codes).
- What to do with variable-length functions
like execvp()? Do they take a list, or vector or either?
Don't forget that execlp etc. IMHO have no place in a foreign language
binding. These are just C wrappers to execv(), execve() and execv().
- there are functions whose semantics are to initialize a C object,
like sigemptyset(). You probably want to keep it that way, rather
than turn sigemptyset() into a constructor that returns a a brand
new, empty signal set. Or it could have both behaviors: the set
could be returned as a second value, which is identical to the
optional argument if it is present, or a new object if the
argument is missing.
- (Seems to be more a FFI issue): Similarly, interfacing to [f]printf() is superfluous. vfprintf() is mandatory when coming from a foreign language. Which is *not* to say that there should be no POSIX:PRINTF. I'm all in favour of it! But it is more likely to expand to a call to vprintf(), especially in some implementations which create calls at run-time, instead of generating compile-time native-code.
(defun posix:fprintf (fp format &rest args) (posix:vfprintf fp format (coerce args 'vector)))A compiler-macro optimization might handle the case when posix:printf is called literally, i.e. with a known number of arguments, on those systems which generate native-code. I have yet to see non-wrapper FFI definitions for variable-arg functions.
Which leads to another question: how to deal with [vsnf]printf and its sisters' arguments which are all polymorphic??
(typecase ((unsigned-int 32) ... (signed-int 32) ... single-float ... string ... (array fixnum) (array ... ... FFI-pointer
References
-
Allegro CL -- OS Interface
(Would be a nice base to implement a SUSV3 package on Allegro CL!).
Commentary
I would like to second Marco's comment earlier on this page. A CL binding for POSIX should integrate well with Lisp. I agree that the rules should be simple, so that a CL programmer looking at POSIX documentation could easily extrapolate the corresponding CL interfaces. So keep the rules simple, but make them lispy. It would be a great surprise to any CL programmer if a function expecting a file or directory name wouldn't accept a CL pathname. -- DanMuller 2004-10-05
standards