Asian Sudoku Championship 2025
Sudoku Champs 2024
LMI Screen Test #198 posts • Page 4 of 4 • 1 2 3 4
@ 2010-12-07 10:10 AM (#2811 - in reply to #2692) (#2811) Top

debmohanty




1000500100100100202020
Country : India

debmohanty posted @ 2010-12-07 10:10 AM

The next two puzzles IrregularSplit and CorrectXV turned out to be as bad as the 45-seconds-sudoku puzzles, very few players solving them.

The IrregularSplit was not spectacular, it doesn't work like a sudoku, but more of a jigsaw fitting puzzle. It all depends upon how quickly one can do all the combinations.
While IrregularSplit had a decent success rate, CorrectXV turned out to be least favorite puzzle fetching -0.16 points per attempt.

I'll explain below how we expected players to solve it, but since most players didn't get it, I accept that there was not enough time for this puzzle.
Just like the example - the four 4 grids were similar looking. Either an X or a V was the only difference between the grids.
We thought players would focus on that, rather than solving the grid.

In grid B, there was an additional V on Column 6, Which made column 6 unsolvable.
In grid C, there was less V on Column 3, which made column 3 unsolvable.
In grid A, there was an additional V in Row1 (which means there was no place for 5 in Row1 and hence grid A is also unsolvable)


The example used a similar technique, and somehow we thought that players will be able to solve this in 60 seconds. It was not to be :-)
@ 2010-12-07 10:18 AM (#2812 - in reply to #2692) (#2812) Top

macherlakumar




Posts: 123
10020
Country : India

macherlakumar posted @ 2010-12-07 10:18 AM

It was really good from the regular tests. I think we should go for this kind of test regularly may be once in a month. I am suggesting this because the time for every puzzle is fixed, and many of the times the best comes out when we are under pressure, this is a perfect test for bringing the best from the players and improving their skills.

Thanks a lot for all who made this initiative very successful.
@ 2010-12-07 10:19 AM (#2813 - in reply to #2692) (#2813) Top

debmohanty




1000500100100100202020
Country : India

debmohanty posted @ 2010-12-07 10:19 AM

Next one is CarelessArrows another visual puzzle. But unlike many other visual puzzles, it can be quickly checked whether your answer is correct or not.
I liked this puzzle very much.
It is only because of this puzzle that we had a DropDownList rather than buttons for answer keys.

Mastermind is like standard Mastermind. Given that options were given, it again was more like a check-check-check puzzle.

ImagesCount - When I started thinking about ST#1 around 3 months back, ImagesCount was the first puzzle I created. Originally it had 9 images. We reduced it to 6. Given that there were still 120 circles, it seemed too difficult.
Also that, the reflection possibility was missing from the example image (though the instructions had it), it made matters worse.
There could have been a better puzzle!

MiddleStick - Again one of the earliest puzzles. See image below for original version. One of the wisest decision to drop two sticks and make it 2-colored.
@ 2010-12-07 10:32 AM (#2814 - in reply to #2692) (#2814) Top

debmohanty




1000500100100100202020
Country : India

debmohanty posted @ 2010-12-07 10:32 AM

That brings us to the last 4 puzzles -

MissingDigitKakuro This type went to several iterations, we made multiple puzzles of this type and we thought we chose the easiest one.
It needed to be solved only in the left top region ( i would think that most players start looking that region first :-)
May be this could be have been easier. A smaller range 1-6 would definitely have helped.



CodedSkyscraper - Our original thought was that CodedSkyscraper and CorrectXV are the only 2 tough puzzles which would challenge the top solvers.
In Coded, it takes a while to realize that {A,D}={3,4} and 3+4 = 7
Looking at either column3 or row3, D can't be 4. so A=4
Given the very low success rate for this puzzle, it was not best fit in 60 seconds.


RareSkyscraper - Irrespective of the success rate, I think that this was an easy puzzle. Especially if one had solved the example carefully and found the pattern of 9-8-7-6....

BrilliantColour
The best puzzle of ST1 in my opinion, and we intentionally had kept it at the end.
@ 2010-12-07 10:49 AM (#2815 - in reply to #2692) (#2815) Top

debmohanty




1000500100100100202020
Country : India

debmohanty posted @ 2010-12-07 10:49 AM

Finally, the technical part - It was a good technical challenge to implement the Screen Test since it was very different from what we've done so far at LMI.
But we all love challenges in some way, don't we?

We are glad that ST#1 worked as flawlessly as we designed. Thanks to everyone for sharing improvement ideas, we'll keep them in mind if/when we do ST#2.
@ 2010-12-07 10:57 AM (#2816 - in reply to #2812) (#2816) Top

debmohanty




1000500100100100202020
Country : India

debmohanty posted @ 2010-12-07 10:57 AM

macherlakumar - 2010-12-07 10:18 AM

It was really good from the regular tests. I think we should go for this kind of test regularly may be once in a month. I am suggesting this because the time for every puzzle is fixed, and many of the times the best comes out when we are under pressure, this is a perfect test for bringing the best from the players and improving their skills.

I've no idea if making Screen Tests a monthly contest makes sense or whether it is feasible to write good puzzles so frequently. We've to wait and see.

One of the most satisfying things from ST#1 is there were lot of Indian solvers. Also, it really feels good to see many old solvers back (Omkar/Ritesh, are you reading this?)
30 minutes time and no printer requirement seem to be the key things that worked here.

Keshava Murthy continued from his FLIP performance, and got the best Indian rank (he admitted that he made some lucky guesses :-)
Rajesh did reasonably well, though I think he was very conservative.
Rishi stood 3rd, and it is good that he is spending more time on puzzles now.

Rakesh had a disappointing test - he should have got much more.
Amit - I'm not exactly sure what the problem was - but he certainly could have got more points.

Congratulations to everyone and thank you for participating.
@ 2010-12-07 11:54 AM (#2817 - in reply to #2816) (#2817) Top

macherlakumar




Posts: 123
10020
Country : India

macherlakumar posted @ 2010-12-07 11:54 AM

Thank you Deb for sharing your views on my suggestion.
I agree that it is not easy to design these kind of puzzles which can fit the time frame (60 or 45 sec).
I think we should increase and decrease time depending on the difficulty level of the puzzle.
We can work on this tests if some of the solvers can become authors and contribute some puzzles, may be not 25 puzzles all the time but decent enough number of puzzles with varying difficulty.
Let see how this goes I also will try to desgin puzzles which suits this test (although I have never designed puzzles )

Wish me luck.

Thank You.
@ 2010-12-07 1:51 PM (#2819 - in reply to #2817) (#2819) Top

rakesh_rai




Posts: 774
500100100202020
Country : India

rakesh_rai posted @ 2010-12-07 1:51 PM

It was a nice test, and a good change. And, most importantly, it introduced the ST concept to solvers like us who have never been to a WPC. The interface worked well overall and the puzzle selection was good. The main challenge was to create puzzles which can be solved without use of paper, and the test was largely successful at that.

Deb: Thanks for your views on the puzzles. They are very comprehensive and almost cover everything that I might have wanted to comment on.

One big mistake which I made in this test was to try to check/verify my answer. Inevitably, the time for the puzzle would run out. I'd try to note the answer for entering during the review period. But, during the review period, I instead concentrated on solving one "almost-finished" puzzle, ultimately getting minus points for that, and even not able to enter the planned ones. A better approach would be to enter answers as soon as you arrive at them. No need to double check. And, to not leave too much to do in the review period.

Many people have commented on the amount of time being inadequate. But, is there really a solution for that? If you make the time 90 seconds for each puzzle, inevitably the complexity would also increase a little bit. I think the timing was alright for 80-90% of the puzzles, as eight solvers were able to cross the par score - 1350.

The only cases which need a re-look are the ones which had the least solvers -
(1) Cell Sudoku (the main problem was the size of the sudoku, which put off most solvers...the in-out concept was actually very well applied)
(2) Correct XV (I was able to eliminate A and B in 60 seconds, then could not look at it in the review period)
(3) Coded Skyscraper (size of the grid was the main obstacle, again)
(4) Missing Digit kakuro (I think, due to the time pressure, most solvers were trying to eliminate digits rather than solve the kakuro. At least this was my approach, so I was able to eliminate 1,2,3,4,5,8,9 easily. But could not arrive at the final answer - between 6 and 7)

Maybe, one suggestion for ST2 could be to increase the review period a little bit (~150 seconds)...I was unable to do what I intended to do in the review period...it went off pretty fast. And, perhaps adequate time for sudoku/kakuro puzzles.

An interesting observation: There were 6-7 solvers who got more than 110 points for a single puzzle, i.e. solved a 60-point puzzle in less than 10 seconds (and two of them twice). But none of them finished in the top 50.
@ 2010-12-07 2:33 PM (#2820 - in reply to #2692) (#2820) Top

debmohanty




1000500100100100202020
Country : India

debmohanty posted @ 2010-12-07 2:33 PM

Rakesh - thanks for your views.

After the end of day1 of the test, nobody had solved Cell Sudoku. Thats when we realized that it is trickier that we had expected.
The idea was borrowed from Mock9 Round 1 Page 8
Rohan also got the answer quickly, and we thought it should be solvable. But we were wrong by a huge margin.
I'm not exactly sure if a 6X6 grid with similar trick would have been much better.

As I wrote, I don't have a problem with CorrectXV and Coded Skyscraper being least solvable. We were expecting only the top solvers to do it.

Regarding Review Period :
I have mixed feeling about the review period. Does it really help?
In my original plan, Review Period didn't exist.
But Amit suggested that we must have a review period of at least 120 seconds. Rohan was not very certain too.
I was thinking of an "Add-on" time for each puzzle.
Every puzzle will have an "Add 15 seconds" button. If you are close to solving it but still need few more seconds, you could add 15 seconds to the puzzle (at the cost of losing x% of the points).
To me that is better than Review Period - here is why I think so -
You are working on a puzzle, it is in your brain. It is easy if you have 15 more seconds to solve the puzzle.
If you switch to another puzzle, and again look at it in Review Period, you again have to bring the puzzle back in your brain.

We discarded Addon time for ST#1 because it looked complex to us.

@ 2010-12-07 2:49 PM (#2821 - in reply to #2819) (#2821) Top

debmohanty




1000500100100100202020
Country : India

debmohanty posted @ 2010-12-07 2:49 PM

rakesh_rai - 2010-12-07 1:51 PM
An interesting observation: There were 6-7 solvers who got more than 110 points for a single puzzle, i.e. solved a 60-point puzzle in less than 10 seconds (and two of them twice). But none of them finished in the top 50.

Some players made heavy guesses... Solving MinimumMines in 2 seconds, KnightSafe in 2 seconds.
I'm not sure how to prevent that. Suggestions welcome.

I also have mixed feelings about the -ve scoring that we had.
Ideally -ve scoring should be applicable to random guesses.

Let me take an example -
BeadCount - It is a visual puzzle. J has 11 beads, A has 10.
J is the correct answer. Some players made minor counting mistakes and answered as A. [ A is most incorrect submission ]
But there are players who would either randomly guess or made major counting mistakes, and answered as C / D / E or whatever else.

158 | J
17 | A
3 | C
2 | G
2 | D
1 | F
1 | E

Take MultiESB - The correct answer is 4. If someone answers 5, it could be a counting mistake. But how on earth can the answer be 7 (there are only 10 circles - so the answer could be max 5 without even drawing any loop)
54 | 4
43 | 5
14 | 6
7 | 3
6 | 7

The point I'm trying to make is -
the negative scoring didn't consider this difference - a minor counting/solving mistake OR a random guess.
May be there should not have been any negative scoring for the most incorrect submission.

The negative scoring was introduced to dissuade players from randomly guessing. But given the nature of the -ve scoring, lot of players played conservatively.
@ 2010-12-07 3:26 PM (#2825 - in reply to #2821) (#2825) Top

macherlakumar




Posts: 123
10020
Country : India

macherlakumar posted @ 2010-12-07 3:26 PM

I think the drop down should be active after 4-5 sec this will force few of the solvers who try to guess to solve the puzzle since they dont't know what are the options

For cell sudoku:
The options will be clear because of grid size, to prevent from guessing, the answer must be manipulated to some other form may be some mathematical operations, in this case both the mathematical expression to be used and the drop down will be visible at the same time.

For Lucky Column:
The options will be clear because of alphabets used for the naming of columns, to prevent from guessing, numbers can used to name the columns an then the answer must be manipulated to some other form may be some mathematical operations, in this case both the mathematical expression to be used and the drop down will be visible at the same time.

For ABC Connect:
Same as lucky column replace the alphabets with numbers.

For Black and White:
Manipulate the answer using a mathematical expression which will be shown at same time of dropdown.

For IrregularSplit:
Interchange numbers and alphabets, again same process manipulating the answer.

For Middle stick:
Replace alphabets with numbers, again same process manipulating the answer.

For Missing digit Kakuro, Coded Skyscraper, Rare Skyscraper, Brilliant Color:
Same process of manipulating the answer, replace the colors with numbers in brilliant color puzzle.

Are there any puzzles in ST1 that can be solved in less than 4 sec?

The number of options in the dropdown should increase also not all the answers will fit the mathematical expression (to avoid reverse calculations).

I hope I explained it clearly, any modifications or suggestions to this are welcome.

Thank You.
@ 2010-12-07 4:26 PM (#2829 - in reply to #2825) (#2829) Top

rakesh_rai




Posts: 774
500100100202020
Country : India

rakesh_rai posted @ 2010-12-07 4:26 PM

macherlakumar - 2010-12-07 3:26 PM

I think the drop down should be active after 4-5 sec this will force few of the solvers who try to guess to solve the puzzle since they dont't know what are the options
Are you suggesting something analogous to "answer key being displayed after some time in sudoku tests"? How does this change the situation? The heavy guessers will punch their answers in 5-6 seconds.

...to prevent from guessing, the answer must be manipulated to some other form may be some mathematical operations, in this case both the mathematical expression to be used and the drop down will be visible at the same time
...
I did not understand this totally, perhaps. But it seems you are suggesting that we use a drop down (as in ST1) PLUS an answer key. Is that correct? And that the answer key should not be known beforehand. The idea is good and will certainly bring down guesswork but whether it goes with the spirit of a Screen Test or not needs to be determined...

Are there any puzzles in ST1 that can be solved in less than 4 sec?
Maybe not 4, but definitely within 10 seconds.

The number of options in the dropdown should increase also not all the answers will fit the mathematical expression (to avoid reverse calculations).
Can you explain your point with some example here? I am unable to understand it fully.
@ 2010-12-07 4:30 PM (#2830 - in reply to #2829) (#2830) Top

debmohanty




1000500100100100202020
Country : India

debmohanty posted @ 2010-12-07 4:30 PM

rakesh_rai - 2010-12-07 4:26 PM
The number of options in the dropdown should increase also not all the answers will fit the mathematical expression (to avoid reverse calculations).
Can you explain your point with some example here? I am unable to understand it fully.
I am not able to understand it at all
@ 2010-12-07 4:33 PM (#2831 - in reply to #2821) (#2831) Top

rakesh_rai




Posts: 774
500100100202020
Country : India

rakesh_rai posted @ 2010-12-07 4:33 PM

debmohanty - 2010-12-07 2:49 PM

Some players made heavy guesses... Solving MinimumMines in 2 seconds, KnightSafe in 2 seconds.I'm not sure how to prevent that. Suggestions welcome.
Do you really want to prevent that?

Guessing is inevitable if you have multiple options as answers. But then, if you do not have options, the test will look more like a normal test. And, heavy guessers got more wrong than right. So its not as if someone is going to finish in top 20 by heavy guessing.

Still, if you want to reduce the probability of success of guesses, simply increase the number of options.
@ 2010-12-07 5:08 PM (#2832 - in reply to #2692) (#2832) Top

macherlakumar




Posts: 123
10020
Country : India

macherlakumar posted @ 2010-12-07 5:08 PM

Hi guys I am extremely sorry for not making my points clear enough .

As rakesh said to increase the number of options to prevent from guessing in not always possible, like in the case of missing digit kakuro, rare skyscraper, irregular split etc. the number of options are fixed or rather can be decreased but not increased.

The idea is
1) There will be a mathematical expression in which there will be one variable, i.e is the answer obtained after solving the puzzle. Say 6(A) + A^2 is the expression where A is the answer.
Player after solving the puzzle needs to calculate the result of the expression and then choose it from dropdown box, so now we can increase the number of options.

2)Regarding the reverse calculation I guess I made a mistake it should have been like this, all the possible options that will result from the mathematical expression and some other numbers (to increase the options) should be there in the dropdown.

3)The dropdown and the mathematical expression will be active only after 4-5 sec (which I think should be fine) so that no one gets to know the answers or the expression before hand.

Even after all this if a player is able to guess the answer then it can't be helped, but I think this will decrease the number of playes from guessing.
@ 2010-12-07 5:23 PM (#2833 - in reply to #2832) (#2833) Top

debmohanty




1000500100100100202020
Country : India

debmohanty posted @ 2010-12-07 5:23 PM

Thanks for explaining, and I now understand what you are saying.
But it would simply increase the solving time unnecessarily, and adds an error-prone element to the puzzle.

To answer Rakesh's question "Do you really want to prevent that? " - I don't think we really need to do anything to prevent it. Definitely not by adding more complexities for genuine solvers.
@ 2010-12-07 9:24 PM (#2840 - in reply to #2692) (#2840) Top

keshava.hs



Posts: 10

Country : India

keshava.hs posted @ 2010-12-07 9:24 PM

Hi,

I feel, we should not do anything to prevent Guess work, True Champions will always emerge in a long run.
Guess work will not produce results always, Just to prevent Guess work, we should not complicate Puzzle Interface.
Easier/simple Interface will attract more puzzlers, I think which is also one of the main aim of LMI.

This is my personal feeling.

Thanks again for your precious time in organizing wonderful event.

Best Reagrds,
Keshav

Edited by keshava.hs 2010-12-07 9:45 PM
LMI Screen Test #198 posts • Page 4 of 4 • 1 2 3 4
Jump to forum :
Search this forum
Printer friendly version