Computer Chess Club Archives

Search

Terms
Messages

Subject: Re: Let's back off for a minute from Rc6

Author: Christophe Theron
Date: 19:19:01 10/21/00
On October 21, 2000 at 02:59:53, Ratko V Tomic wrote:

>>That was my point.  If we both make errors at the same frequency, but yours
>>cost you 10 times as much as mine, you go broke first, assuming we start with
>>the same amount of money.
>>
>
>The problem with optimum bet size is a bit trickier. If gambler's odds of
>winning single game (e.g. a coin toss or dice roll) are P and if P>0.5, then the
>optimum bet he should place is C*(2P-1), where C is his current total capital,
>i.e. he should always bet 2P-1 fraction of his current net worth. If P<0.5 then
>player should bet 0 (i.e. if odds are against you, don't bet at all).
>
>Now suppose GT has specialized, very finely tuned king-attack tactics
>algorithms. It can compute in these situations faster, thus deeper and more
>accurately than a program without this specialized code. Since the truncated
>minimax will make errors, the more the shallower its search, the situation is
>similar to gambling. The deeper program will make errors, too, but its odds of
>error are smaller than of the shallower program.
>
>So, if in a double-edged king-side attack GT, due to its specialized king-side
>tactics code, has odds of 55% of outcomputing program X in each step, its
>optimum bet would be about 2*0.55-1=0.10, i.e. it should bet 1/10th of its
>current value. The total (approx) material value GT initially has is, say, (8*1
>+ 4*3 + 2*5 + 10)=40. If some exchanges took place, say 2 pieces and 4 pawns per
>side, the total material capital is around 30 per side. Now if GT can get at
>this point into the kind of position where its odds of outwitting X are 55%, it
>should start betting about 3 pawns, a full piece. The greater its odds of
>outcomputing X on each step, the more it should bet. For example if its odds are
>60% instead of 55%, it should bet 2*0.6-1=20%, i.e. with total capital 30, it
>should bet 6 pawns or 2 pieces worth.
>
>The trigger condition for a program to start gambling profitably is that its
>odds of outcomputing its opponent in a given type of position are better than
>even. In practice one doesn't know what are the odds of outcomputing opponent in
>a given position. So the way one would tune it in practice is to set the risk
>aversion to increase against the better tactician.
>
>Since in computer chess the rating is fairly well correlated with tactical
>abilities of the programs, in absence of any other information besides rating,
>GT should bet less against the higher rated programs. If more specific
>information is available on its next round opponent, it should bet less against
>programs which are good tacticians in king-attack positions. I suspect
>Christophe is already tuning GT's gambling affinity along these lines.
>
>For longer matches against the same opponent, GT could even automate this
>adjustment as a part of its regular learning, e.g. by increasing or decreasing
>the P after each game based on accumulated results so far. Unlike learning of
>the opening lines, where all lines learned are stored in a single data-base,
>this one (the value P or the optimum bet size) has to be opponent specific.
>
>A special case of the match here is GT playing against regular Tiger. Since both
>probably have the same specialized king-attack search code, the odds P are 0.5,
>hence GT should not gamble at all against Tiger, and if it does, it will not
>fare too well. Since it does appear (from reports here) that GT is weaker than
>Tiger in matches against it, it would follow that Christophe doesn't have the
>opponent specific learning (or even hand tuning) of GT's risk aversion (optimum
>bet size) for matches with a fixed opponent.
>
>Although the above dealt with king-side attack and risk taking, the relation of
>risk taking and existence of some kind of edge holds for any other type of edge.
>For example a program which has some special/unique endgame heuristics should
>take risks to steer the game into such endgames, and the greater the edge the
>greater bets it should place. Any other aspect of middlegame would work, too.
>For example some program may have special heuristics for blocked positions, the
>notoriously weak area of most programs. I think Junior has something of this
>sort, since it seems to like to get into blocked positions, more than other
>programs, so I suspect Amir has some special secret heuristics for these
>positions. Now, he only needs to up Junior's risk-taking drive to get into such
>positions (if playing against programs, not against humans). I haven't noticed
>in Junior any risk taking behavior, though.




Once again a smart and very much to the point message.

Thanks Ratko.



    Christophe
Re: Let's back off for a minute from Rc6 Ratko V Tomic 11:15:33 10/22/00
- Re: Let's back off for a minute from Rc6 Christophe Theron 21:32:31 10/22/00
This page took 0 seconds to execute
Last modified: Thu, 15 Apr 21 08:11:13 -0700
Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.