Computer Chess Club Archives


Search

Terms

Messages

Subject: Re: New version of junkbase

Author: Dann Corbit

Date: 13:30:42 06/20/05

Go up one level in this thread


On June 20, 2005 at 16:08:51, Dieter Buerssner wrote:

>Thanks, Dann!
>
>From a former discussion I remember that your junkbase cutted long games. I
>remember a 300+ moves game of Yace vs. Arasan (or reverse), that only showed 300
>moves in the database. Is this issue resolved now? If yes - only for newer added
>games, or are also the older games repaired, to show the real length now?
>Regarding this issue - will there be a difference when I download Scid database
>compared to the PGNs?

I was not aware of the issue.
There are many ways that a game could become damaged.
Previously, I did some filtering with the CDB cleaner, which caused some
problems.

I see that many games have been truncated to 300 fullmoves and I do not know the
culprit.  There are games in the database bigger than that.

>There are really many individual PGNs. Perhaps you could consider, to have
>a-openings.pgn.bz2, etc. All in all, this might give you less traffic. The files
>are not so big at the moment, so the overhead to start each individual download
>takes some time. I guess, only people with rather fast connections to the
>internet will try to download such a large database. To me, it seems connections
>are very reliably nowaday. Ftp clients have the ability to restart a download at
>the correct point, when they got disconnected.

I get emails all the time for requests about how to divide the database in other
ways.

Usually, the big files cause the most problems, because people will fail on the
download 2/3 of the way through (sometimes my fault because of a reboot) and
then they get very hot under the collar.
So if they want them all, I just tell them to use mget.


>The advantage of the small files may be, when somebody is really only interested
>in games for one or a few ECO codes. I cannot really judge, if there will be
>many people who want to do that. I guess almost none.
>
>I remember from the past, that in my environment commandline space had
>overflown, when I tried to decompress all the bz2. I had to use some shell
>programming, to do it. I am sure, I will be able to do it again, but many people
>downloading it, might not be able, to find out, how to do it.
>
>No big problem, just some suggestions. Thanks again,

To collect all the games, the fastest way is probably the junkbase scid files.
But I am sure that tomorrow morning I will have a dozen complaints about
problems downloading them.



This page took 0 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.