Author: Alessandro Damiani
Date: 02:31:18 09/11/99
Go up one level in this thread
On September 10, 1999 at 21:49:16, Bas Hamstra wrote: >On September 10, 1999 at 17:57:38, Alessandro Damiani wrote: > >>On September 10, 1999 at 17:38:31, Robert Hyatt wrote: >> >>>On September 10, 1999 at 17:05:33, Alessandro Damiani wrote: >>> >>>>[..] >>>>>>>For one moment forget about alpha and beta, you are on the wrong track as >>>>>>>alpha and beta are not a part at all of the code. You need an extra stack >>>>>>>that is set to -INF at each ply. Then before you do A/B you do the bestmove >>>>>>>calculation for that ply. Involved variables: SCORE and STACK, no alpha beta. >>>>>>> >>>>>>>Ed >>>>>> >>>>>>I think the best way to explain is to write a small piece of code in pseudo C, >>>>>>else we talk around the point. >>>>>> >>>>>>Alessandro >>>>> >>>>> >>>>>OK... here is what I did: >>>>> >>>>>Normal alpha/beta first: >>>>> >>>>>int Search(int alpha, int beta, etc...) { >>>>> best=-infinity; >>>>> bestmove=0; >>>>> foreach (move in movelist) { >>>>> MakeMove(); >>>>> value=-Search(-beta,-alpha,etc.) >>>>> if (value > best) { >>>>> best=value; >>>>> bestmove=current.move; >>>>> } >>>>> if (value > alpha) { >>>>> if (value >= beta) { >>>>> return(value); >>>>> } >>>>> alpha=value; >>>>> } >>>>> } >>>>> HashStore(bestmove,alpha, etc...) >>>>>} >>>>> >>>>> >>>>> >>>>>So what I did was to simply take the score for each search made after trying >>>>>one of the moves at this ply, and remember the 'best' score and associated move. >>>>> >>>>>All I am saying is "this does not work". It is a characteristic of the alpha/ >>>>>beta search. It isn't a "it might work if ..." it simply won't work. Because >>>>>the searches below this node (again, assuming this is a fail-low node where _no_ >>>>>move produces a score > alpha, which is the case where I claim there is never a >>>>>best move to try here) return a bound on the value for that move. And I have no >>>>>idea how to choose between a move with a bound <=200 and another move with a >>>>>bound <= 150. Because the first could have a score well below 200. I simply >>>>>don't know as I told the search below this node to stop whenever you find a >>>>>score > X, where X is my negated alpha bound. >>>>> >>>>>Now, we have code. Did I misunderstand what you are saying? If not, then I >>>>>can certainly explain further why that 'best' and 'bestmove' above are no good >>>>>in this context. You can think of "best" as a random number that is <= alpha, >>>>>nothing more. Which means "bestmove" is a random move chosen from the moves >>>>>searched at this ply. And that is _not_ the move we want to try first when we >>>>>get back to this position and it is suddenly not a fail-low position where all >>>>>moves have to be tried, but rather it is a fail high position where the best >>>>>move will let us cut off quickly... >>>> >>>>I never said something against that. When I read Ed's answers I see what you >>>>say: fail-soft AlphaBeta. What you say is what I think all the time: >>>> >>>>best: score of the position after all moves are searched. >>>> >>>>best<=alpha => the real score of the position is <=best. There is no information >>>>about the best move. >>>> >>>>bestmove==0 => best==-INF (after searching all moves) >>>> >>>>I think, Ed should write down a small piece of code. >>>> >>>>Alessandro >>> >>> >>>There are only three choices here. >>> >>>(1) best <= alpha. We don't know anything about any of the moves. Yet I >>>believe Ed (and I thought you as well) said that if you take this 'best' >>>move and store it in the hash, it works well. In my test, it didn't. >>> >>>(2) alpha < best < beta. This is a move with an _exact_ score that is >>>correct. Also known as a PV-candidate move normally. >>> >>>(3) alpha < beta < best. This is a fail high move. It is better than the >>>best that is allowable. It is _not_ the best move at this level however, it is >>>just good enough to produce a score >= beta and terminate the search early. >>>This is the case that feeds (1) above at the previous ply. If we try _another_ >>>move here we might get an even _bigger_ score here, and back up an even worse >>>score to the previous ply. But alpha/beta makes us stop _now_. >> >>This is the postcondition of AlphaBeta. In all three cases I store the best move >>in the transposition table. BUT (I wrote that somewhere in an answer!) I don't >>use that move in the case the score belonging to that move was an upper bound. >>The code in Fortress: >> >>[[ >>precondition: depth>0 && (height, bound, wasSingular) is the information from >> the transposition table. >> >> // there is no best move when the score was an upper bound. >> // But we need the move for Singular Extensions, if it was singular >> if (height>0 && bound==UPPER && !wasSingular) HashMove[ply]= 0; >> >> here: code for move ordering. >>]] >> >>Conclusion: I do the same thing as you do, Bob. >> >>I store the best move to have an additional check to avoid hash errors, when the >>hashtable is looked up (move must be pseudo-legal). >> >>I don't understand what Ed says. >> >>Alessandro > >So you *do* retrieve a LOWERBOUND type of move from the hashtable and use that >as a bestmove? Then you do it not exactly like Bob, I think, who uses only EXACT >moves as bestmoves. Does it work? > >Most informative discussion. I see I have to change things. > > >Regards, > >Bas Hamstra. Yes, I store a random move at UPPER positions, but I don't use this move for move ordering. I need it for two reasons: 1. Singular Extensions. This is very important! 2. To check for hash errors (I am able to store only 52 bit of the hashcode, so I use the check for legality of the move to "add some bits") Alessandro
This page took 0 seconds to execute
Last modified: Thu, 15 Apr 21 08:11:13 -0700
Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.