Nothing from the Young Talents CD.Sorry I pressed enter in my last message and never got to finish!
My question was, do you use the newest versions of all the engines in the BOC? Like the ones found on the Young Talents CD?
Nothing from the Young Talents CD.Sorry I pressed enter in my last message and never got to finish!
My question was, do you use the newest versions of all the engines in the BOC? Like the ones found on the Young Talents CD?
All the programs had to be original creations from their authors.
All the programs had to be freely available without fee (other than media cost, phone connect time, or whatever)
All the programs had to be available at the start date of the contest.
Programs that made illegal moves or crashed were not to be allowed (one slipped through, though)
No changes to the programs once the contest started.
TSCP did 50% against Monik and 100% against stronger opponents so giving TSCP lower elo than Monik based on your data is illogical.Program Elo + - Games Score Av.Op. Draws
1 LarsenVB : 2672 262 274 8 87.5 % 2334 25.0 %
2 Storm : 2579 310 249 8 62.5 % 2490 25.0 %
3 Ozwald : 2553 198 325 8 43.8 % 2597 37.5 %
4 Noonian : 2551 325 198 8 56.2 % 2507 37.5 %
5 Monik : 2398 214 247 12 62.5 % 2309 8.3 %
6 Zephyr : 2364 215 215 12 50.0 % 2364 16.7 %
7 TSCP : 2340 180 402 12 83.3 % 2060 0.0 %
8 SnailSCP : 2240 214 194 12 62.5 % 2152 25.0 %
9 Raffaela : 1941 297 170 12 8.3 % 2358 16.7 %
10 Golem01 : 1747 0 0 12 0.0 % 2347 0.0 %
The 100% against weaker opponents but it is not logical to punish TSCP for getting 100% against weaker opponents.TSCP did 50% against Monik and 100% against stronger opponents so giving TSCP lower elo than Monik based on your data is illogical.Program Elo + - Games Score Av.Op. Draws
1 LarsenVB : 2672 262 274 8 87.5 % 2334 25.0 %
2 Storm : 2579 310 249 8 62.5 % 2490 25.0 %
3 Ozwald : 2553 198 325 8 43.8 % 2597 37.5 %
4 Noonian : 2551 325 198 8 56.2 % 2507 37.5 %
5 Monik : 2398 214 247 12 62.5 % 2309 8.3 %
6 Zephyr : 2364 215 215 12 50.0 % 2364 16.7 %
7 TSCP : 2340 180 402 12 83.3 % 2060 0.0 %
8 SnailSCP : 2240 214 194 12 62.5 % 2152 25.0 %
9 Raffaela : 1941 297 170 12 8.3 % 2358 16.7 %
10 Golem01 : 1747 0 0 12 0.0 % 2347 0.0 %
Uri
It's logical according to the ELO system considering the number of games played. The ELO will be more plausible when they've all meet each other. You're more than welcome to construct a better system in your spare time.The 100% against weaker opponents but it is not logical to punish TSCP for getting 100% against weaker opponents.TSCP did 50% against Monik and 100% against stronger opponents so giving TSCP lower elo than Monik based on your data is illogical.
Uri
Sjeng73 seems to be doing great...BUT...Programs that made illegal moves or crashed were not to be allowed (one >slipped through, though)
No changes to the programs once the contest started.
It is completely logical. There is not a single game between the programs in different divisions. If you saw some sort of leveling, it would be an indication that I am cheating.TSCP did 50% against Monik and 100% against stronger opponents so giving TSCP lower elo than Monik based on your data is illogical.Program Elo + - Games Score Av.Op. Draws
1 LarsenVB : 2672 262 274 8 87.5 % 2334 25.0 %
2 Storm : 2579 310 249 8 62.5 % 2490 25.0 %
3 Ozwald : 2553 198 325 8 43.8 % 2597 37.5 %
4 Noonian : 2551 325 198 8 56.2 % 2507 37.5 %
5 Monik : 2398 214 247 12 62.5 % 2309 8.3 %
6 Zephyr : 2364 215 215 12 50.0 % 2364 16.7 %
7 TSCP : 2340 180 402 12 83.3 % 2060 0.0 %
8 SnailSCP : 2240 214 194 12 62.5 % 2152 25.0 %
9 Raffaela : 1941 297 170 12 8.3 % 2358 16.7 %
10 Golem01 : 1747 0 0 12 0.0 % 2347 0.0 %
I looked only at this division.It is completely logical. There is not a single game between the programs in different divisions. If you saw some sort of leveling, it would be an indication that I am cheating.TSCP did 50% against Monik and 100% against stronger opponents so giving TSCP lower elo than Monik based on your data is illogical.Program Elo + - Games Score Av.Op. Draws
1 LarsenVB : 2672 262 274 8 87.5 % 2334 25.0 %
2 Storm : 2579 310 249 8 62.5 % 2490 25.0 %
3 Ozwald : 2553 198 325 8 43.8 % 2597 37.5 %
4 Noonian : 2551 325 198 8 56.2 % 2507 37.5 %
5 Monik : 2398 214 247 12 62.5 % 2309 8.3 %
6 Zephyr : 2364 215 215 12 50.0 % 2364 16.7 %
7 TSCP : 2340 180 402 12 83.3 % 2060 0.0 %
8 SnailSCP : 2240 214 194 12 62.5 % 2152 25.0 %
9 Raffaela : 1941 297 170 12 8.3 % 2358 16.7 %
10 Golem01 : 1747 0 0 12 0.0 % 2347 0.0 %
This post shows something that I have been trying to achieve -- there is a broad misunderstanding about how the ELO system works.
I have mailed the binary to you. I am using the executable dated 6-25-00 from your site.Sjeng73 seems to be doing great...BUT...Programs that made illegal moves or crashed were not to be allowed (one >slipped through, though)
No changes to the programs once the contest started.
Just before releasing 7.3 final on my site I downloaded
the one from your site and to my horror it seemed to be
one of the broken versions....
Could you do me a favor and check which version you are using
for BOTC ?
It SHOULD NOT print full PV's in short algebraic notation.
(i.e. just display a score and certainly not something
like 'Nf3 Nf6 e4 e5 Bc4'.)
If it does, please check your mail to make sure you didn't
receive anything newer before the contest started.
I'm sorry for the hassle, but I'm frearing that it WILL
lock up in the middle of a game. It did so during testing
and I don't see why it wouldn't happen again...for all I
know I must have been lucky so far.
But the calculated ELO of its opponents was lower, and the opponents were not the same.I looked only at this division.It is completely logical. There is not a single game between the programs in different divisions. If you saw some sort of leveling, it would be an indication that I am cheating.TSCP did 50% against Monik and 100% against stronger opponents so giving TSCP lower elo than Monik based on your data is illogical.Program Elo + - Games Score Av.Op. Draws
1 LarsenVB : 2672 262 274 8 87.5 % 2334 25.0 %
2 Storm : 2579 310 249 8 62.5 % 2490 25.0 %
3 Ozwald : 2553 198 325 8 43.8 % 2597 37.5 %
4 Noonian : 2551 325 198 8 56.2 % 2507 37.5 %
5 Monik : 2398 214 247 12 62.5 % 2309 8.3 %
6 Zephyr : 2364 215 215 12 50.0 % 2364 16.7 %
7 TSCP : 2340 180 402 12 83.3 % 2060 0.0 %
8 SnailSCP : 2240 214 194 12 62.5 % 2152 25.0 %
9 Raffaela : 1941 297 170 12 8.3 % 2358 16.7 %
10 Golem01 : 1747 0 0 12 0.0 % 2347 0.0 %
This post shows something that I have been trying to achieve -- there is a broad misunderstanding about how the ELO system works.
TSCP drew against Monik and got 100% in the other games so rating that is lower than Monik based only on the games in this division is illogical.
I also think that not using previous data to decide about the elo is not a good decision but this was not my point in my post.
I understand that it is not a ranking list but the list should tell us which program is probably better based on the games.But the calculated ELO of its opponents was lower, and the opponents were not the same.I looked only at this division.It is completely logical. There is not a single game between the programs in different divisions. If you saw some sort of leveling, it would be an indication that I am cheating.TSCP did 50% against Monik and 100% against stronger opponents so giving TSCP lower elo than Monik based on your data is illogical.Program Elo + - Games Score Av.Op. Draws
1 LarsenVB : 2672 262 274 8 87.5 % 2334 25.0 %
2 Storm : 2579 310 249 8 62.5 % 2490 25.0 %
3 Ozwald : 2553 198 325 8 43.8 % 2597 37.5 %
4 Noonian : 2551 325 198 8 56.2 % 2507 37.5 %
5 Monik : 2398 214 247 12 62.5 % 2309 8.3 %
6 Zephyr : 2364 215 215 12 50.0 % 2364 16.7 %
7 TSCP : 2340 180 402 12 83.3 % 2060 0.0 %
8 SnailSCP : 2240 214 194 12 62.5 % 2152 25.0 %
9 Raffaela : 1941 297 170 12 8.3 % 2358 16.7 %
10 Golem01 : 1747 0 0 12 0.0 % 2347 0.0 %
This post shows something that I have been trying to achieve -- there is a broad misunderstanding about how the ELO system works.
TSCP drew against Monik and got 100% in the other games so rating that is lower than Monik based only on the games in this division is illogical.
I also think that not using previous data to decide about the elo is not a good decision but this was not my point in my post.
This is *NOT* a ranking list, but rather, a rating list.
I think it's a great decision. Look at how much awareness of the ELO system has resulted because of it.
Yes, exactly.I think that the rating should tell us which program is probably better.I think it's a great decision. Look at how much awareness of the ELO system has resulted because of it.I also think that not using previous data to decide about the elo is not a good decision but this was not my point in my post.
I think that only games against programs with known rating should be counted for the rating list.
Return to Archive (Old Parsimony Forum)
Users browsing this forum: No registered users and 47 guests