Computer Chess Club Archives


Search

Terms

Messages

Subject: Re: How to build a ChessBase database from a large PGN file..

Author: Dann Corbit

Date: 22:42:15 07/24/01

Go up one level in this thread


On July 25, 2001 at 01:25:28, John Hatcher wrote:
>On July 25, 2001 at 00:51:07, Dann Corbit wrote:
>>On July 25, 2001 at 00:36:00, Dann Corbit wrote:
>>
>>>On July 24, 2001 at 23:06:58, John Hatcher wrote:
>>>
>>>>On July 24, 2001 at 20:57:16, Dann Corbit wrote:
>>>>
>>>>>Not for me, it was asked in this message:
>>>>>http://www.icdchess.com/forums/1/message.shtml?180953
>>>>>
>>>>>Since the header of that message is not descriptive of the actual problem, I
>>>>>thought I would start a new thread so that the OP might find the answer.
>>>>>
>>>>>I'm pretty well ignornant when it comes to CB.
>>>>
>>>>In all seriousness, why would anyone want to build an opening book from 1.5
>>>>million games?
>>>
>>>That's a pipsqueak compared to some database files I know of.
>>>I know of one collection with 7.1 million games between rated players.
>>>
>>>>Surely, 1.3 million of the games would be between Joe Blow and
>>>>Norm Nobody.  Who cares what they played in the opening?  I would be very
>>>>surprised if all the recorded games between International Masters and
>>>>Grandmasters totaled more than 300,000 games.
>>>
>>>Prepare to be surprised.  I have 380K in my tiny (highly filtered) set of 2.5
>>>million games.  I throw out any games with the same move sequence.  There are
>>>lots of non-duplicate games that get clubbed from that.
>>
>>Actually, your 300K number is probably pretty good.  Some of the games in the
>>collection are from computer/computer matches and somehow got high ELO ratings
>>attached.
>>
>>[snip]
>
>There is nothing wrong with having a database with 1.5 million games, or 15
>million games for that matter.  When Chessbase programs annotate a game for you
>it will searh the entire huge database and extract relevant games to insert into
>the annotation.  That could include lots of garbage games between weak players.
>This games in this big database can be analyzed and filtered and searched to
>your heart's contenet.
>
>That is different from having an opening BOOK with 1.5 million games.  Who wants
>an opening book with 1.3 million poorly played openings?

Actually, I do.  I have a book like that with 2.5 million games in it.  I call
it my "human" book, because it plays like a person.  It doesn't just play stupid
openings, because it knows the won/loss/draw statistics and it also discards
rarely played openings.

>My point is that I think the original poster misunderstands how chessbase uses
>databases to annotate games, and the much smaller selection of games that is
>used to build opening books containing (hopefully) good moves.

You might be right.  I have no idea what the OP wants.



This page took 0 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.