Luhn: Update testcase to avoid wrong solution #1480

tqa236 · 2019-03-16T10:59:13Z

The change might seem trivia at first. It isn't. Here's a solution that passes all the test in Java

import java.util.List;
import java.util.stream.Collectors;
import java.util.stream.IntStream;

class LuhnValidator {
  public boolean isValid(String candidate) {
    String number = candidate.replaceAll("\\s+", "");
    if ("0".equals(number)) return false;
    List<Integer> digits = number.chars().map(d -> d - 48).boxed().collect(Collectors.toList());
    return IntStream.range(0, digits.size())
                .map(
                    n ->
                        (n % 2 == digits.size() % 2)
                            ? (2 * digits.get(n) - 9 * (digits.get(n) >= 5 ? 1 : 0))
                            : digits.get(n))
                .sum()
            % 10
        == 0;
  }
}

In this solution, after I remove all the whitespaces, I convert all characters to the ascii number and subtract them to 48 (the ascii code of 0) to get the digits. Then I check with the list I receive and it passes all test cases. I didn't remove invalid characters at all.

The problem is that the ascii code of the invalid characters automatically make the valid number becomes invalid, so no need to remove them.

With the new test case, the sum is 90, and it should still be false because if the invalid character.

Sync

coriolinus

Not having convenient access to a Java dev environment, I've rewritten your example in Python, as directly as possible:

>>> def isValid(candidate):
...     number = candidate.replace(' ', '')
...     if number == '0':
...             return False
...     digits = [ord(c) - 48 for c in number]
...     return sum(2*digits[i] - (9*(1 if digits[i] >= 5 else 0)) if i % 2 == 0 else digits[i] for i in range(len(digits))) % 10 == 0
...

I've confirmed that where the old test case correctly returned False by accident, the new test case will incorrectly return True, failing the test case.

>>> isValid("055a 444 285")
False
>>> isValid("055b 444 285")
True

It's not a perfectly general solution, but it's an elegant way to stop this particular wrong algorithm from succeeding. I like it.

petertseng · 2019-03-16T18:13:44Z

I also don't have access to running Java so I'll have to rely on others' help for the answer to the question: What does it do to :9 from #1246 ? : has value 58, which becomes 10. Doubled and becomes 20, subtract 9 to get 11, add 9 to get 20, so it would do the same. Any differences in behaviour of the supplied code such that it already rejects :9 ?

coriolinus · 2019-03-16T19:40:59Z

Using the previous Python def of isValid:

>>> isValid(':9')
True

Given that the expected value of this case is False, that case should already exclude this solution, assuming always that the python implementation produces results identical to the Java one.

tqa236 · 2019-03-16T19:48:50Z

It seems that this solution is already be ruled out by that test case. My mistake.
My original solution that passes all the test is a little different, it uses integer division. I forgot to recheck it.

(2 * digits.get(n) - 9 * (digits.get(n) / 5))
 // ? (2 * digits.get(n) - 9 * (digits.get(n) >= 5 ? 1 : 0))

Given that the range of 2*digits is from 0 to 18, these 2 formulas are both correct.
I agree that we will need a stronger test suite to completely rules out this type of solution (like randomly generate 1000 tests for example). I just propose a quick fix for now.

petertseng · 2019-03-16T20:17:04Z

That means the interesting thing about these two cases :9 and 055b 444 285 is whether the non-digit character occupies a doubled position (yes in the former, no in the latter). That implies it would be valuable to state this fact in both of their respective test descriptions. And perhaps bring them next to each other. Thus, I recommend this.

I won't stand in the way if the decision is instead to ignore this recommendation and merge this PR as it currently stands.

tqa236 · 2019-03-16T21:41:48Z

I agree that we should update the description to better describe the test, do you have any suggestions? By the way, is it possible to add a comment in a test about the intent of that test? It is not easy to recognize just by looking at the file. Just in case there's a wrong solution with a similar approach that passes, we know we should update the test suite to a more rigorous version.

petertseng · 2019-03-19T18:39:16Z

I agree that we should update the description to better describe the test, do you have any suggestions?

I have a hard time finding a short description for the cases. Even though it is not a requirement that it be short, #1198 seems to imply there are benefits.

I'll try:

Doubled non-digit isn't treated as its ascii value
Non-doubled non-digit isn't treated as its ascii value

Open question, do we need to mention "ascii value offset from the value of 0" or is "ascii value" sufficient

is it possible to add a comment in a test about the intent of that test

it is possible. see an example in

problem-specifications/exercises/largest-series-product/canonical-data.json

Lines 107 to 120 in d28d6a5

    
           "comments": [ 
        
             "There may be some confusion about whether this should be 1 or error.", 
        
             "The reasoning for it being 1 is this:", 
        
             "There is one 0-character string contained in the empty string.", 
        
             "That's the empty string itself.", 
        
             "The empty product is 1 (the identity for multiplication).", 
        
             "Therefore LSP('', 0) is 1.", 
        
             "It's NOT the case that LSP('', 0) takes max of an empty list.", 
        
             "So there is no error.", 
        
             "Compare against LSP('123', 4):", 
        
             "There are zero 4-character strings in '123'.", 
        
             "So LSP('123', 4) really DOES take the max of an empty list.", 
        
             "So LSP('123', 4) errors and LSP('', 0) does NOT." 
        
           ],

see the formal spec in

problem-specifications/canonical-schema.json

Line 44 in d28d6a5

, "comments" : { "$ref": "#/definitions/comments" }

and

problem-specifications/canonical-schema.json

Lines 62 to 67 in d28d6a5

    
           "comments": 
        
               { "description": "An array of strings to fake multi-line comments" 
        
               , "type"       : "array" 
        
               , "items"      : { "type": "string" } 
        
               , "minItems"   : 1 
        
               },

tqa236 · 2019-03-19T21:48:27Z

I updated the descriptions based on your suggestion. I also grouped them together to reflect the fact that they address the same problem.

petertseng · 2019-03-20T17:54:53Z

exercises/luhn/canonical-data.json

+        "Convert non-digits to their ascii value and then offset them by 48 sometimes accidently return the correct result.",
+        "This test is designed to avoid that solution."
+      ],
+      "description": "using ascii value for doubled non-digit isn't allow",


I think for grammar, allow should be allowed here, and in the other test case as well

petertseng · 2019-03-20T17:55:19Z

exercises/luhn/canonical-data.json

@@ -123,7 +115,23 @@
      "expected": true
    },
    {
-      "description": "strings with non-digits is invalid",
+      "comments": [
+        "Convert non-digits to their ascii value and then offset them by 48 sometimes accidently return the correct result.",


spelling accidently -> accidentally

petertseng · 2019-03-20T17:57:37Z

exercises/luhn/canonical-data.json

@@ -123,7 +115,23 @@
      "expected": true
    },
    {
-      "description": "strings with non-digits is invalid",
+      "comments": [
+        "Convert non-digits to their ascii value and then offset them by 48 sometimes accidently return the correct result.",


"accidentally return the correct result" might be easy to misinterpret, or require careful thinking before a reader can determine whether the sentence is true. Can I suggest "accidentally declare an invalid string to be valid" so that requires less brainpower? my feeble mind was having a hard time figuring things out

tqa236 · 2019-03-22T10:05:19Z

Thank you for the review. I made a new commit based on your suggestion.

petertseng

I think I have one last comment, that these two have been swapped.

I plan to approve after the two descriptions are swapped, or someone tells me I got mixed up. I do not plan to wait long after that to merge it. So, if other reviewers have some things to say, you should say them soon.

petertseng · 2019-03-26T05:38:33Z

exercises/luhn/canonical-data.json

+        "Convert non-digits to their ascii values and then offset them by 48 sometimes accidentally declare an invalid string to be valid.",
+        "This test is designed to avoid that solution."
+      ],
+      "description": "using ascii value for doubled non-digit isn't allowed",


can you double-check me on this one? I think in this case, the b is in a non-doubled position, and in the latter case the : is in a doubled position. So these two test case descriptions should be switched, right?

Or did I get them mixed up

tqa236 · 2019-03-26T12:28:22Z

You're correct. I fix that in the new commit.

petertseng

this PR makes the purposes of these test cases clearer, a good improvement.

Relevant PRs: - exercism/problem-specifications#1420 - exercism/problem-specifications#1480 - exercism/problem-specifications#1500 - exercism/problem-specifications#1523

* luhn: Updated the exercise to the version 1.6.1 Relevant PRs: - exercism/problem-specifications#1420 - exercism/problem-specifications#1480 - exercism/problem-specifications#1500 - exercism/problem-specifications#1523 * luhn: Fixed the 'exercise' util generated comments in the test suite

Changes included: 1. exercism/problem-specifications#1480 2. exercism/problem-specifications#1500 3. exercism/problem-specifications#1635

Changes included: 1. exercism/problem-specifications#1480: 1.4.0 to 1.5.0: Change "055a 444 285" test into "055b 444 285" for subtle reasons. New test name: using ascii value for doubled non-digit isn't allowed 2. exercism/problem-specifications#1500: 1.5.0 to 1.6.0: Add test case: valid number with an odd number of spaces 3. exercism/problem-specifications#1635: 1.6.0 to 1.7.0: Add test case: invalid long number with an even remainder

tqa236 added 2 commits March 16, 2019 11:25

Merge pull request #1 from exercism/master

02e3245

Sync

Luhn: Update testcase to remove wrong solution

c3be565

coriolinus approved these changes Mar 16, 2019

View reviewed changes

Reorder, add description and comment to new tests

8aa7ee0

petertseng reviewed Mar 20, 2019

View reviewed changes

Fix typos and clarify descriptions

59fe477

petertseng reviewed Mar 26, 2019

View reviewed changes

Update canonical-data.json

7c7b56d

petertseng approved these changes Mar 27, 2019

View reviewed changes

petertseng merged commit 6aa07c9 into exercism:master Mar 27, 2019

This was referenced Mar 27, 2019

Luhn: Update test suite to latest version exercism/delphi#384

Closed

Luhn: update to v1.5.0 exercism/delphi#385

Merged

ZapAnton mentioned this pull request Nov 12, 2019

luhn: Updated the exercise to the version 1.6.1 exercism/rust#906

Merged

tejasbubane added a commit to tejasbubane/haskell that referenced this pull request Mar 14, 2020

Update luhn exercise to 1.7.0

f17fdd6

Changes included: 1. exercism/problem-specifications#1480 2. exercism/problem-specifications#1500 3. exercism/problem-specifications#1635

tejasbubane mentioned this pull request Mar 14, 2020

Luhn: Update tests to 1.7.0 (1 changed tests, 2 new tests) exercism/haskell#900

Merged

Uh oh!

Luhn: Update testcase to avoid wrong solution #1480

Luhn: Update testcase to avoid wrong solution #1480

Uh oh!

Conversation

tqa236 commented Mar 16, 2019

Uh oh!

coriolinus left a comment

Choose a reason for hiding this comment

Uh oh!

petertseng commented Mar 16, 2019

Uh oh!

coriolinus commented Mar 16, 2019

Uh oh!

tqa236 commented Mar 16, 2019

Uh oh!

petertseng commented Mar 16, 2019

Uh oh!

tqa236 commented Mar 16, 2019

Uh oh!

petertseng commented Mar 19, 2019

Uh oh!

tqa236 commented Mar 19, 2019

Uh oh!

petertseng Mar 20, 2019

Choose a reason for hiding this comment

Uh oh!

petertseng Mar 20, 2019

Choose a reason for hiding this comment

Uh oh!

petertseng Mar 20, 2019

Choose a reason for hiding this comment

Uh oh!

tqa236 commented Mar 22, 2019

Uh oh!

petertseng left a comment

Choose a reason for hiding this comment

Uh oh!

petertseng Mar 26, 2019

Choose a reason for hiding this comment

Uh oh!

tqa236 commented Mar 26, 2019

Uh oh!

petertseng left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!