Another implementation may be found here, which discards the first 3072 bytes of the crypto-stream as recommended. It also provides a general-purpose seedable PRNG (pseudo-random number generator) as part of the API (inherently used as part of any stream-cipher implementation but not always exported).
Ironclad also includes an implementation.