Computer Chess Club Archives


Search

Terms

Messages

Subject: Re: New iteration of junkbase

Author: William Penn

Date: 03:23:51 11/07/04

Go up one level in this thread


On November 05, 2004 at 20:08:31, Dann Corbit wrote:

>The whole thing, in one big SCID file set (about 300 MB):
>ftp://cap.connx.com/pub/Scid/junkbase.sg3.bz2
>ftp://cap.connx.com/pub/Scid/junkbase.si3.bz2
>ftp://cap.connx.com/pub/Scid/junkbase.sn3.bz2
>
>The individual compressed PGN files are here (sorted by ECO):
>ftp://cap.connx.com/pub/a-openings/
>ftp://cap.connx.com/pub/b-openings/
>ftp://cap.connx.com/pub/c-openings/
>ftp://cap.connx.com/pub/d-openings/
>ftp://cap.connx.com/pub/e-openings/
>
>A bit over 3.5 million games.  Filtered for Duplicates by Pgn-extract, then
>Chess Assistant, then Scid (two cycles each program filtering).
>
>Still a lot of crappy stuff in there.  But hey, free chess games.

So what if I download them? I wouldn't know what to do with those extensions.
Presumably they're decompressible but I'm not sure. I'm running Windows XP Home
(SP2), not linux.

FYI: I recently bought ChessliB's big database containing over 3.6 million chess
games and it's pretty good quality. It doesn't return significant doubles with
ordinary settings in Chessbase 8. However I was able to find some special
settings which find about 1% doubles. Otherwise it seems pretty good, limited to
games with 10 moves or more, no fragments, and the integrity checks OK with CB8.

I suppose my main question would be: What do you use for ECO classifications?
Neither CB8 nor SCID are perfect in that regard. I suppose Chess Assistant may
have an ECO classification routine too, but I don't have CA.
WP




This page took 0 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.