Computer Chess Club Archives


Search

Terms

Messages

Subject: Re: RE: more information...

Author: Uri Blass

Date: 11:11:16 08/29/01

Go up one level in this thread


On August 29, 2001 at 13:20:07, Robert Hyatt wrote:

>On August 29, 2001 at 11:41:56, Uri Blass wrote:
>
>>On August 29, 2001 at 10:41:45, Robert Hyatt wrote:
>>
>>>
>>>
>>>If you _really_ believe that DF/DJ are better than DB of 1997, you are
>>>going to somehow have to overcome the following problems with that stand:
>>>
>>>1.  We _know_ that DT played at a 2650+ level.  Here is an excerpt from an
>>>email sent to me by Hsu:
>>>This is in the book draft that you have.  The best 25-game performance was over
>>>2655.  The USCF rating was much lower, because the first 20 games in DT's
>>>career, which include the disasters in early phase of the US OPen, had four
>>>times the weighting of the last 35 or so games in DT's career.  This is
>>>so because DT's first official rating after 20 games was well over 2400,
>>>and the subsequent games therefore had only 1/4 weight.  This is somewhat
>>>illogical, but DT was an unusual case to begin with.  If all the games had
>>>the same weighting, then DT's USCF rating would have been over 2600, even with
>>>the disasters included.
>>>                                                        --FH
>>>
>>>
>>>So the 1992 (roughly) version of the program, made using 3 micron ASICS,
>>>was at _least_ as strong as either deep fritz or deep junior on today's
>>>hardware.  I will be conservative and say they are "equal".
>>>
>>>We know that in 1992 they were running at around 1M-2M nodes per second,
>>>which is roughly what DF/DJ will do on an 8-way box.  We also know their
>>>actual results against a bunch of GM players in multiple tournaments and
>>>matches hit a best of 2655 over _25_ consecutive games (not 25 cherry-picked
>>>games).
>>>
>>>2.  The final DB machine was based on .6 micron ASICS.  That is, each chip
>>>contains 25X as many logic circuits as the original 3.0 micron processors (if
>>>you reduce the width and length by 5, area is decreased by 25.  So the final
>>>DB2 chips had 25X as many circuits.  And we also know that 2/3 of the final
>>>chip was in evaluation.  And being generous, 50% of the first chip was eval
>>>since he said "it is now up to 2/3 of the total chip area" which suggests that
>>>DT's chip was not nearly as sophisticated in the evaluation.
>>>
>>>Now let's do the math.  The new DB2 chip has about 50X as much evaluation
>>>circuitry as did deep thought.  The new DB2 machine was over 200X faster than
>>>deep thought.  The original deep thought was at _least_ as strong as today's
>>>best programs.
>>>
>>>Conclusion:  If DB 2 had 50X as much "smarts", plus was 200 times faster, do
>>>you _really_ conclude that DB2 is weaker than Frita or Junior?  _really_?  There
>>>is no way the math will support such a statement.
>>>
>>>A 200X faster DT would have been quite strong (that was basically what DB1 was
>>>all about in fact, more speed, but not a lot more smarts).  A 200X faster, and
>>>a 50X smarter program sure seems, at least to me, to be significantly beyond
>>>anything running today on PC hardware or even on supercomputers.
>>>
>>>How do you counter those arguments, each of which is technically accurate and
>>>given by Hsu's publications or email that I have supplied.
>>>
>>>To say that DF/DJ is stronger is absolutely nonsensical...  with nothing to
>>>support such a conclusion whatsoever...
>>
>>I believe that the rating of 1992 cannot be compared
>>with the rating of today so we cannot use the results
>>of deep thought against humans to learn that deep thought was at the
>>same level that Fritz is today.
>>
>
>What is this based on?  I would personally bet that 2655 in 1992 is
>better than 2655 today, with the inflation in ratings that has occurred.
>However, I left that out of the equation so that DT's 1992 rating would
>look as weak as possible when compared to today.

2655 in 1992 is better place in the world than 2655 today but it does
not prove that it is better level of play.

>
>
>
>>I believe that today humans know better how to play against computers
>>relative to their knowledge many years ago and I expect Deep Fritz
>>also to get more than 2650 against humans.
>>
>
>I don't believe this for one second.  Based on watching humans play programs
>like DF, DJ and Tiger in human events.  There is just nothing to suggest that
>this is the case at all.
>
>And I don't see anything at all that would suggest that DF would be better than
>2655 today.  But even if it is, what does the factor of 50 say about the DB2
>evaluation?  What does the factor of 200 say about DB2's speed?
>
>In one thread you said that Stobor at some ridiculous speed would whip all
>programs today.  And you think Stobor is even a small fraction of what DB2
>is all about?  Knowledge?  Search quality?  Etc?
>
>You have to pick one side of the speed argument and stay there.

Deeper blue had not the ridicilious speed that I suggested and I guess that the
search algorithm of it is better than deeper blue.

I also doubt if the 200M nodes per seconds were used efficiently and it is
possible that 200M nodes of deeper blue is eqvivalent only to 20M nodes of Deep
thought(one possible problem is searching the same node by more than one
processor and I can imagine that there were more problems,for example changing
the search algorithm to something slower that is more  simple to use).
>
>
>
>
>
>>I also doubt the claim that Deeper blue was
>>100 times faster and smarter than deep thought.
>
>There is no "claim". Chiptest searched 350K nodes per second or so.  Deep
>Thought used from 2 to 14 (14 hardly ever worked) of those chips.  Hsu claims
>that DT was doing about 2M in 1992.  He also has clearly stated that DB2 was
>doing 200M.  That is 100X faster.

this is the point that I am not sure that 200M is 100x faster
>
>He also stated that he went to a 25X as dense fabrication process, but stayed
>with the same size chip die.  He then said that for DB2 the evaluation had
>increased to about 2/3 of the total chip area.  That equates to at _least_
>25 times more evaluation and that only accounts for the denser circuitry.
>
>So what is there to doubt?
>
>
>
>
>
>>
>>My impression based on the some analysis of the logfiles is different.
>>I did not give programs hours to analyze every position
>>that Deeper blue pondered and I may change my opinion
>>if a comparison between the logfiles and analysis of top programs
>>can show me that deeper blue is really at least 100 times faster
>>than part of the top programs in significant number of positions.
>
>How can you tell?  DB2 didn't even _count_ nodes it searched.  It would
>take as long to count 'em as it would to search 'em (A direct quote that
>dates back to Ken Thompson).  You can't see the last 1/3 (or more) of its
>PV.  It is hard to compare knowledge unless you find positions that have a
>correct positional move that must be played and see whether everyone else
>can find the correct move or not.

I believe that if we analyze the games carefully we are going to find some non
trivial positinal move that must be played.

If we search the hundreds of positions and everyone for hours then I
expect to find at least 10 positions when it happens.

Uri



This page took 0 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.