Computer Chess Club Archives


Search

Terms

Messages

Subject: Re: Will the Rebel Vs IM Russek match be counted in calculating Rebel's elo?

Author: Vicente Fernández

Date: 16:24:34 12/24/99

Go up one level in this thread


On December 24, 1999 at 18:26:19, John Warfield wrote:

>On December 24, 1999 at 15:19:22, Stephen A. Boak wrote:
>
>>On December 24, 1999 at 05:48:58, John Warfield wrote:
>>
>>>
>>>
>>>  Since these games are 40/2 Will this count in perdicting Rebels rating? I
>>>assume that It will count.
>>
>>Hi John,
>>
>>a bit of philosophy here (rhetorical questions, no need to answer)--
>>
>
>
>Yes you voiced alot of philosophy which I think is very unneccesary since my
>question was very simple, and not addressed to anyone in particular, but to the
>CCC Community at large.  There is not a clear consensus on what games should
>count on what hardware, some believe that the rebel game against hoffman should
>count and others believe that it shouldn't.  I think it is important to know
>that if rebel scores 6-0 against russek, will the CCC heavyweights like Robert
>Hyatt and the Rest Accept Rebel new elo which should be well over 2500. After
>all; these games would not have much meaning if the majority of CCC did not
>accept them as legitamate.  My question was designed to get a feel for how these
>games will be recieved. I see nothing at all rhetorical in the question, since
>some of your remarks shows that the answer is in no way clear, already there is
>a disagreement. You don't think they should count and I think they should.  My
>reason is this:  Based on some of the remarks I have read, it appears that many
>assume that money is the only real motivation to play good chess, I think Most
>chessplayers, especially titled players have alot of pride in their chess, Any
>match which is in public view would be enough incentive for anyone to play their
>hardest. Most USCF tournaments that I go to have very little money to offer,
>must people play for a rating, or for the opportunity to compare their abilities
>to other players. What shall we say about the ratings that are achieved at these
>tournaments?  They are not real because money was not involved?  Sometimes pride
>is an even stronger motivator then money.  So I think this match should by all
>means count. Just because ED Schoeder is not overseeing the match is no reason
>the match should not be counted.  Rebel Does not need Schoeder standing over it
>to be rebel.  A match is a Match, a Win is a Win, 40/2 is 40/2.
>
>
>>To whom do you address your question?
>>
>>Who can/should determine what will 'count'?
>>
>>Anybody can calculate (or predict) SSDF, FIDE, USCF, ELO-type, or TPR ratings,
>>using whatever data they have available, for any chess program.  Just get the
>>test data (including opponent ratings, conditions and results), grab a pencil
>>and some paper, or computer or calculator, and do the mathematics.
>>
>>The real issue, however, is not the math calculation itself.  It is the issue
>>you have raised--what should be counted (included) in the calculation.
>>
>>To be believable, the calculator should show the basis for the calculation, and
>>it is up to the reader to accept or reject, believe or disbelieve, doubt or not
>>doubt the published calculations.
>>
>>To be believable, the basis for the calculation should be believable
>>(trustworthy).  That is the underlying games, conditions and results should be
>>as free from error and controversy as possible.  Otherwise the published results
>>and hence the calculations of rating may be in doubt.
>>
>>Just like science--there are many experimenters, there are many published
>>results of experiments, and there are often many differing opinions on the
>>interpretation of results.
>>
>>When one interpreter points out a testing flaw, or uncontrolled variable that
>>might have affected the results of another person's published results of
>>experimenting, this may raise doubts about the meaning or usefulness or accuracy
>>of the published results.
>>
>>In the same way, there may be some doubts about some calculations of ratings,
>>but as long as the basis for the calculation is shown by the calculator, all are
>>free to interpret the calculation results for themselves.
>>
>>One can calculate the Russek match results in their own assessment of a Rebel
>>rating, if they so choose.
>>
>>My preference is to calculate ratings for Rebel products based on open testing
>>and verifiable results published by Ed Schroder and the Rebel Team.
>>
>>Ed has the highest motivation (Rebel programs are his business) to ensure his
>>testing is thorough and will stand up to public examination.
>>
>>He offers prize money to FIDE-rated GMs and IMs to ensure they are motivated to
>>give their best effort to play his program.
>>
>>Ed releases the testing conditions in advance, for comment by the world.
>>
>>His games are shown 'live' on ICC to the world, so all can see the moves of the
>>games in his testing.
>>
>>A trusted arbiter (good reputation) is at the scene of the human player to
>>ensure fair play.
>>
>>His titled opponents are free to make their remarks after the game, to the
>>world, about the game and the play of Rebel software.
>>
>>All viewers of the 'live' game on ICC may make observe and comments.
>>
>>Ed utilizes the fastest and best hardware he can afford and trust.
>>
>>He makes sure his settings are those he believes are best for the particular
>>test he undertakes (example--he determines if Anti-GM feature is switched on or
>>off, depending on his latest assessment of his own program).
>>
>>Although this (Anti-GM) may be a test variable that changes from game to game,
>>Ed choses which variables he wants to test in his experiments and which he
>>believes will allow his program to play the best it can play--I have no problem
>>with this, as long as it is disclosed and verifiable.
>>
>>The trusted arbiter has a copy of the program to verify all moves were made by
>>the software Ed claims he used to make the moves.
>>
>>Owners of the same software (or relatively recent, similar version) can verify
>>moves to a large degree using their copy of the Rebel software.
>>
>>As long as the Rebel Team continues to disclose its verifiable testing
>>conditions and as long as those conditions seem trustworthy to me, I will
>>continue to use those results in my calculations.
>>
>>Maybe I missed a posting, but I don't know what prize money, if any, is offered
>>to IM Russek, to ensure his motivation is there for the upcoming match with
>>Rebel.  In my mind, this aspect is an example of one of several things that
>>cause me to lean toward use of results published by Rebel Team and not others.
>>
>>Certainly we should not always trust every vendor of a product that makes claims
>>about their software--that goes to the trustworthiness issue and the old saying
>>'consider the source' and 'let the buyer beware'.  However, Ed and the Rebel
>>Team have not made any outlandish claims to my knowledge, they have not tried to
>>inflate their evidence to show Rebel is super-GM in strength (say, over 2600
>>FIDE ELO equivalent).  They have established a fine series of tests, under very
>>conrolled circumstances, verifiable to a highly reasonable degree, and have let
>>the chips fall where they may (win, lose or draw).  Their experiments seem to
>>show Rebel is about 2500 FIDE ELO strength at this time (my calculations using
>>their published results), although more games will perhaps strengthen both the
>>believability of the calculation and of the ELO estimate itself if it changes
>>after more games have been played.
>>
>>And I expect Ed to continue to improve his Rebel program, so it may even grow
>>stronger as he makes continuing programming additions or changes.  Time, and
>>good experiments, will tell.
>>
>>--Steve Boak

You made an important remark: Guil Russek is taking the match VERY seriously.
Although money is not involved, believe me, his reputation is. Before accepting
the match, the "live" broadcast and the ICC announcements, we did talk for long
about this subject. He accepted mainly because he wanted to match his skills
against a strong, serious opponent. He loves to play against the best humans
available because he learns from them. In this case, he is eager to learn from
Rebel, if so is the case. Moreover, he loves chess. Anyone could object that he
could play Rebel in the privacy of his home, without so much fuss, but Mr.
Russek, as most of the titled players do, rarely do play games against computers
at tournament times controls and, most important, he does loves challenges. I
can tell you that he will play his best. I can tell you that the rest of the
people involved will do our best too so that the match is fair. By the way: He
has never played Rebel Century. He has studied the games it played at the Rebel
Monthly Challenges and that's all. I hope everybody enjoys. Whatever the result
may be, you can take my word that every person involved will do his best.

Merry Christmas from Mexico City.
Vicente



This page took 0 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.