[Squeakfoundation] Harvesting infrastrucure (was: Don't kill
Bert's workstation)
Bert Freudenberg
bert at isg.cs.uni-magdeburg.de
Fri Jun 6 00:08:55 CEST 2003
Am Donnerstag, 05.06.03 um 19:26 Uhr schrieb Brent Vukmer:
>> Please remember these are not UIDs, just a count from 1 to
>> number-of-messages-in-archive. If something (someone?) (me??)
>> messes up
>> my mbox file, these numbers will change.
>>
>
> Right, I know they're not actually UIDs. And in the scenario I
> propose,
> I guess they *still* won't be UIDs -- the process will parse and export
> individual email .txt files, numbered in the order they're extracted.
> But there is no safeguard against a hiccup in the parsing process.
> Ideally someone should be able to download the mbox file from
> lists.squeakfoundation.org, parse it, and get the same output as my
> (theoretical) server.
>
> Anybody got suggestions for a good algorithm generating the UID? How
> about a hash just using the subject line and the date?
Why don't you make the individual numbered .txt files the main
repository? Easily served by Apache, easily updated by a procmail
script or ftp upload or whatever. KISS ;-)
-- Bert
More information about the Squeakfoundation
mailing list