Author: Dave Gomboc
Date: 22:47:40 08/09/99
Go up one level in this thread
On August 10, 1999 at 01:32:46, Tim Mirabile wrote: >Thanks for posting this here. I like the idea of saving disk space, but I would >like to get some feedback from archive users before I jump into it. > >One thing I would consider is concatenating each day's worth of files into a >single file before zipping them. I could automate this in the same perl script >which creates the day's archive. Then the monthly archive would contain 31 >files or less. The only problem with this is that people would never get a >chance to download the messages as separate files. This may not be a concern >for anyone, and if it is it should be possible to format the combined message >file so that it could be easily split. It's handy to have each message separable so that it can be archived into a database or something. On the other hand, I tried to download a bunch and gave up... with a cluster size of 16K, disk space gets chewed awfully quickly! How about using SGML or XML? e.g. <YEAR 1999> <MONTH January> <DATE 16> <MESSAGE 933503> <AUTHOR "Dave Gomboc"> <SUBJECT "crafty 84.3"> blah blah whatever I ranted about :-) </MESSAGE> <MESSAGE 933504> <AUTHOR "Someone else"> <SUBJECT "blah"> blah blah </MESSAGE> </DATE> <DATE 17> you get the idea. One big text file is perhaps a little unwieldy :-), but maybe done by month this would work out well. I'm not sure how difficult this would be to do, but I imagine that a perl script could slam through the messages pretty quickly. The tags would be a big help to anyone who wants to post-process the messages later for their own needs. Dave
This page took 0 seconds to execute
Last modified: Thu, 15 Apr 21 08:11:13 -0700
Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.