Asian Sudoku Championship 2025
LMI Launches Puzzle Expo!
2011 Double Decathlon — LMI October Puzzle Test — 15th and 16th October62 posts • Page 1 of 3 • 1 2 3
What is your opinion of Instant Grading compared to other grading systems used here at LMI?
Please provide your specific feedback / suggestion about the grading system and/or the penalty system in the forum.
OptionResults
Instant Grading is a good system. Please use it again on other tests, with no changes.30 Votes - [76.92%]
Instant Grading is an okay system. Consider using it again, possibly with some changes or different penalty values.7 Votes - [17.95%]
Instant Grading is a bad system. Return to the more traditional format on future tests.2 Votes - [5.13%]
View Results

@ 2011-10-13 10:50 AM (#5786 - in reply to #5749) (#5786) Top

motris



Posts: 199
10020202020
Country : United States

motris posted @ 2011-10-13 10:50 AM

In response to helpful comments from play-testers, the graphical presentation of Almost Simple Loop will change from the original format in the instructions. Instead of black squares with white numbers and arrows, which can be harder to cross out when solving, the clues will now be in gray squares with black numbers and arrows as in the attached image. The word "black" in the puzzle instructions has also been replaced with the word "gray" to account for this presentation change.

A revised instruction booklet with new images for Almost Simple Loop and some other typographical fixes is now posted.



(AlmostSimpleLoop.png)



Attachments
----------------
Attachments AlmostSimpleLoop.png (2KB - 4 downloads)
@ 2011-10-13 11:54 AM (#5787 - in reply to #5749) (#5787) Top

debmohanty




1000500100100100202020
Country : India

debmohanty posted @ 2011-10-13 11:54 AM

@ 2011-10-14 9:46 PM (#5788 - in reply to #5749) (#5788) Top

motris



Posts: 199
10020202020
Country : United States

motris posted @ 2011-10-14 9:46 PM

Password protected booklet uploaded. It has 10 pages, one page per puzzle type with the easy (20 point) and hard (50 point) puzzles on the same page. There is no cover page.

REMINDER: This test marks the debut of INSTANT GRADING, a new system of grading; if you have not yet practiced using this system, please go to this practice page before the contest using the instruction booklet answers for submission.

Edited by motris 2011-10-14 9:46 PM
@ 2011-10-15 3:44 AM (#5789 - in reply to #5749) (#5789) Top

ColinMacLeod



Posts: 3

Country : United States

ColinMacLeod posted @ 2011-10-15 3:44 AM

The end date for the contest displays as October 16/17, 2012 instead of 2011.

2011 Double Decathlon ends at 10/16/2012 5:00:01 PM local time | 10/17/2012 12:00:01 AM GMT
@ 2011-10-15 5:20 AM (#5790 - in reply to #5749) (#5790) Top

debmohanty




1000500100100100202020
Country : India

debmohanty posted @ 2011-10-15 5:20 AM

changed to 2011.
@ 2011-10-15 8:18 AM (#5791 - in reply to #5749) (#5791) Top

figonometry



Posts: 30
20
Country : Canada

figonometry posted @ 2011-10-15 8:18 AM

I LOVE the instant scoring. Thanks!
@ 2011-10-15 8:21 AM (#5792 - in reply to #5791) (#5792) Top

motris



Posts: 199
10020202020
Country : United States

motris posted @ 2011-10-15 8:21 AM

figonometry - 2011-10-14 7:18 PM

I LOVE the instant scoring. Thanks!


Looks like it worked out for you exactly as intended (recovering points you might not otherwise have earned). We'll most likely run a poll right after the contest to gauge people's responses to the system. It obviously is not the easiest system to use for all puzzle tests, but seems to be an interesting compromise that online solving can allow.

EDIT: Actually, we've added the poll at the top of this thread now.

Edited by motris 2011-10-15 12:30 PM
@ 2011-10-15 2:33 PM (#5793 - in reply to #5749) (#5793) Top

neerajmehrotra



Posts: 329
10010010020
Country : India

neerajmehrotra posted @ 2011-10-15 2:33 PM

Wonderful...............thanks Thomas for such a nice puzzle test....ofcourse it was much beyond my capacity....
@ 2011-10-15 10:06 PM (#5794 - in reply to #5749) (#5794) Top

mucha



Posts: 13

Country : Poland

mucha posted @ 2011-10-15 10:06 PM

Wow, either I'm out of shape or this test was really hard. Very nice puzzles, the ones I managed to crack at least. Also, really like instant scoring!
@ 2011-10-16 12:32 AM (#5795 - in reply to #5749) (#5795) Top

dave8mcrae



Posts: 2

Country : United States

dave8mcrae posted @ 2011-10-16 12:32 AM

So, I used the individual submit buttons, which kept updating a score on the left. But there was also something there that said "1 Correct, 0 Wrong" (or something like that). That figure didn't update. What was that supposed to tell me?
@ 2011-10-16 1:18 AM (#5796 - in reply to #5795) (#5796) Top

motris



Posts: 199
10020202020
Country : United States

motris posted @ 2011-10-16 1:18 AM

dave8mcrae - 2011-10-15 11:32 AM

So, I used the individual submit buttons, which kept updating a score on the left. But there was also something there that said "1 Correct, 0 Wrong" (or something like that). That figure didn't update. What was that supposed to tell me?


That figure was telling you what was true of your most recent submission. It will only ever have more information like "3 correct, 1 Wrong" if you submitted more at a time using "submit all". This does seem like it could be slightly confusing so we can review the report for those doing individual submit if we use this system again.

Edited by motris 2011-10-16 1:18 AM
@ 2011-10-16 5:01 AM (#5797 - in reply to #5749) (#5797) Top

forcolin




Posts: 172
100202020
Country : ITALY

forcolin posted @ 2011-10-16 5:01 AM

All contests on LMI are of good quality, but this one is well above the norm. Excellent stuff, I liked particularly the hard Loop the loop and battleship sudoku.
Also, the instant grading saved me a lot of points, two copying/typing errors and a genuine solving error which I could rectify. The negative effect is that probably I did not pay much attention when typing because I knew there was a second chance.....
stefano
@ 2011-10-16 8:20 AM (#5798 - in reply to #5749) (#5798) Top

yureklis



Posts: 183
10020202020
Country : Turkey

yureklis posted @ 2011-10-16 8:20 AM

First of all I solved all IB puzzles for preparation :) Normally I don't, but this time I tried to push myself to understand puzzle rules/competition rules before the contest. Also I should say that IB puzzle are really fun! After solving those I was looking forward to compete with real ones.

Secondly I am glad with my result although I couldn't get points that I should get, at least in my opinion. I solved one big puzzle in last 5 minutes but my time was not enough to submit my solution. Also I had solved one puzzle of all types but I lost myself in some puzzles and of course it caused me to lose my strategy, and I couldn't. But I am glad with my performance.

Your puzzles are great! They have nice looking, very satisfying solving paths; and of course new point system is cool! You made a great job, thank you so much, Thanks to LMI and shining man Deb :)
@ 2011-10-16 11:23 AM (#5799 - in reply to #5749) (#5799) Top

joshuazucker



Posts: 31
20
Country : United States

joshuazucker posted @ 2011-10-16 11:23 AM

Thanks for a great test! I liked the scoring system, too, both the structure of the bonuses and the penalties with the instant grading. I enjoyed all the puzzles, but particularly the same two that forcolin mentioned, though I still need more time to finish the rest of the test to see if there are some gems there that I didn't want to attempt with time pressure.
@ 2011-10-16 2:29 PM (#5800 - in reply to #5749) (#5800) Top

rob



Posts: 170
100202020
Country : Germany

rob posted @ 2011-10-16 2:29 PM

Loved the test, and the scoring system. The instant grading might have made me a little more careless than usual. Three genuine mistakes in reading off the code feels like a lot for me. I'm amazed I was able to make the same mistake on both "Almost Simple Loop" puzzles!

It did seem the instant grading slightly affected my solving: On one or two puzzles, after I finished them up with some intuition, I used the submit button to verify the solution, instead of double checking by hand.
@ 2011-10-16 7:27 PM (#5801 - in reply to #5749) (#5801) Top

vopani



Posts: 739
50010010020
Country : India

vopani posted @ 2011-10-16 7:27 PM

Very good set of puzzles. Thanks Thomas!

I liked everything about the scoring system. I just wanted to throw open a point that comes to my mind. Should we have different penalties for different puzzles? (High-point puzzles have greater penalty?) Maybe not very large, but at least some amount of distinction.
@ 2011-10-16 10:22 PM (#5802 - in reply to #5801) (#5802) Top

detuned



Posts: 152
1002020
Country : United Kingdom

detuned posted @ 2011-10-16 10:22 PM

So with this new system, I think I was more careful about entering keys then normal, conscious of the four point penalties. And no mistakes!! (at least mistakes I didn't catch, seems I'm a little rusty from not doing any LMI tests in ages). So yeah, thumbs up from me on this system. I'm sure it'd save me lots of future grief, however I'm not sure it should be implemented on every test. Instantly knowing when you have a puzzle right or wrong doesn't accurately match up with an offline solving experience, for instance...
@ 2011-10-16 11:57 PM (#5803 - in reply to #5802) (#5803) Top

motris



Posts: 199
10020202020
Country : United States

motris posted @ 2011-10-16 11:57 PM

Rohan Rao - 2011-10-16 6:27 AM
I liked everything about the scoring system. I just wanted to throw open a point that comes to my mind. Should we have different penalties for different puzzles? (High-point puzzles have greater penalty?) Maybe not very large, but at least some amount of distinction.


I thought a lot about different implementations; certainly the existing typo standard of 80% would suggest a larger penalty but I think, given the time put into solving the puzzle versus the time to enter the submission, it is excessively punitive (should it be 4 points and 10 points on this test, for example?). I will say that one change I would consider looking over the results is possibly an escalating penalty if making many errors on the same puzzle. It also seems possible to use the time to fix an error to split the cases (typos are fixed quickly, puzzle errors most often take 2+ minutes), but this could also be risky for some types of errors.

Considering all these options, I actually prefer the simplicity used here, just one kind of penalty and it is the same everywhere.

detuned - 2011-10-16 9:22 AM
So with this new system, I think I was more careful about entering keys then normal, conscious of the four point penalties. And no mistakes!! (at least mistakes I didn't catch, seems I'm a little rusty from not doing any LMI tests in ages). So yeah, thumbs up from me on this system. I'm sure it'd save me lots of future grief, however I'm not sure it should be implemented on every test. Instantly knowing when you have a puzzle right or wrong doesn't accurately match up with an offline solving experience, for instance...


I expect a few people to make this argument and it may be why several of the better solvers (uvo, melon, Para) have voted negative or neutral on this system. My view is offline contests are offline contests and online contests are online contests. They can borrow from each other at times and innovate and do new things at other times. I would compare this system to being in a playoff round at a WPC and turning in each puzzle as finished. After a fixed amount of time you get a signal if you are correct or not. So it is an offline test mode, just not one people have a lot of experience with. The penalty is set to act like the equivalent WPC penalty, which costs you a small amount of points/time, but puts you back in control of fixing whatever mistake you made.

The results in the stat page so far should reveal at lot of the errors solvers make are "online only"; their paper probably has a correct solution but entering a particular piece of information doesn't come through with high fidelity. Since I'm not grading entire grids, I'm happy to experiment with a system that helps remove the "online only errors" from other errors. I think this has gone very well on this test, and Deb has done a very good job realizing the scoring system I wanted.

Edited by motris 2011-10-17 12:05 AM
@ 2011-10-17 2:46 AM (#5804 - in reply to #5749) (#5804) Top

detuned



Posts: 152
1002020
Country : United Kingdom

detuned posted @ 2011-10-17 2:46 AM

motris: I have a lot of time for your argument, and I don't think anyone can argue that this hasn't been one of the better LMI test scoring innovations (noting that as I've previously argued, these LMI tests are the perfect playground for these innovations). I'd definitely like to see this repeated in future tests. Just, I guess, not *all* of them.
@ 2011-10-17 2:48 AM (#5805 - in reply to #5749) (#5805) Top

jalbert



Posts: 6

Country : United States

jalbert posted @ 2011-10-17 2:48 AM

I got booted off the internet before I had a chance to enter my answers. I guess I should have been entering them as I solved them, but is there anything I can do now?
@ 2011-10-17 2:53 AM (#5806 - in reply to #5805) (#5806) Top

motris



Posts: 199
10020202020
Country : United States

motris posted @ 2011-10-17 2:53 AM

jalbert - 2011-10-16 1:48 PM
I got booted off the internet before I had a chance to enter my answers. I guess I should have been entering them as I solved them, but is there anything I can do now?


This is unfortunately a problem that comes up with online tests and there is nothing we can do to give you an "official score" after the two hour clock has run out. Submitting before the end of the test (sometimes putting in what you have done after 90 of 120 minutes) is probably a good approach for the future. I hope you had fun with some of the puzzles despite the answer entry frustration.
@ 2011-10-17 4:27 AM (#5807 - in reply to #5749) (#5807) Top

pvondrak



Posts: 3

Country : United States

pvondrak posted @ 2011-10-17 4:27 AM

I enjoyed the test and the immediate scoring. I did notice a bit of a difference (about 45 seconds?) between the timer and the submitted time. I ran out of time on the last one according to the countdown, and submitted it (and the resubmit after a typo), and it accepted it, showing as within the 120 minutes in the scoring details. Not sure if that's atypical, or there's a short amount of cushion or something?
@ 2011-10-17 5:30 AM (#5808 - in reply to #5807) (#5808) Top

debmohanty




1000500100100100202020
Country : India

debmohanty posted @ 2011-10-17 5:30 AM

pvondrak - 2011-10-17 4:27 AM

I enjoyed the test and the immediate scoring. I did notice a bit of a difference (about 45 seconds?) between the timer and the submitted time. I ran out of time on the last one according to the countdown, and submitted it (and the resubmit after a typo), and it accepted it, showing as within the 120 minutes in the scoring details. Not sure if that's atypical, or there's a short amount of cushion or something?

The timer is certainly not designed to work that way, I'm hearing it first time. Sometimes the timer could run fast / slow but the difference between countdown timer and Server time would be maximum 2-3 seconds.

We'll try to replicate this behavior at our end.
@ 2011-10-17 6:59 AM (#5809 - in reply to #5808) (#5809) Top

motris



Posts: 199
10020202020
Country : United States

motris posted @ 2011-10-17 6:59 AM

Double Decathlon is over and results can be viewed here. I hope you enjoyed the puzzles as well as the challenge of the contest (whether your goal was completing just the easy puzzles, or going for larger goals).

Five people completed all twenty hurdles. The top three on the podium are MellowMelon (1182.5), deu (1133.1), and xevs (1094.9). Also finishing were ppeetteerr and uvo. Congratulations to them. Overall 237 players started the test and 192 had non-zero scores.

This test marked the debut of Instant grading which - from the administration side of things - seemed to have worked as planned with no technical problems despite being a very new system. Watching the solutions throughout the test, I'll say that the system served its purpose of helping solvers get points on puzzles they had solved, with a large majority of all incorrect entries eventually being corrected, many within just 30 seconds suggesting they were typos. We are very interested in hearing your comments, both good and bad, about instant grading, so if you have not yet voted in the poll at the top of this page, please do so, and leave other comments here in the forum.

I will be writing about these 20 puzzles over the next ten weeks (roughly one post a week, taking the place of my Friday Puzzle) to share insights into their construction and also give solving strategies. You can look for that discussion on my blog.

I would like to especially thank Deb Mohanty for his assistance in getting the new scoring system in place and for general administrative help on the test. I'd like to also thank Wei-Hwa Huang for specific test-solving help and recommendations on puzzle formatting. Congrats again to Palmer on winning the Decathlon.
@ 2011-10-17 8:15 AM (#5810 - in reply to #5749) (#5810) Top

uvo



Posts: 21
20
Country : Germany

uvo posted @ 2011-10-17 8:15 AM

About the scoring system: I like the easy way to correct typos for a slight penalty, but I strongly dislike being told where I made a "real" mistake. Unfortunately, I don't see an easy way to separate those. Funny enough, I managed to do both on the same time - I made a miscount in an already incorrect solution :-)

As to my knowledge, the existing 80% standard (not sure it deserves that label), was introduced at the German online qualification 2010. In that competition, we had puzzles for 10 and 60 points (and almost anything in between, of course); and we decided a fixed penalty could not be appropriate for both. The 20% penalty was just an easy way to keep integer scores. Anyway, I think it is right to have some kind of penalty, and I don't mind which one.

@detuned: Funny that you mention you were more careful entering your keys - for me, it was definitely the other way round.
2011 Double Decathlon — LMI October Puzzle Test — 15th and 16th October62 posts • Page 1 of 3 • 1 2 3
Jump to forum :
Search this forum
Printer friendly version