Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

starlark: fix bugs in str.{title,capitalize} #3

Merged
merged 3 commits into from
Nov 2, 2018

Conversation

alandonovan
Copy link
Contributor

Behavior now matches the spec (and Python3).

  • Tests.

Fixes google/skylark#140

Behavior now matches the spec (and Python3).
+ Tests.

Fixes google/skylark#140

Change-Id: I02f1f9775f4b64095d5e2aeb33f39db94046b50e
@alandonovan alandonovan requested a review from jayconrod November 2, 2018 01:39
@alandonovan
Copy link
Contributor Author

@laurentlb

doc/spec.md Outdated

```python
"hello, world!".capitalize() # "Hello, World!"
"hElLo, wOrLd!".capitalize() # "Hello, World!"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

the two comments look wrong (lowercase "w")

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Well spotted! Thanks.

Copy link
Contributor

@laurentlb laurentlb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Behavior now matches the spec (and Python3).
+ Tests.

Fixes google/skylark#140

Change-Id: I02f1f9775f4b64095d5e2aeb33f39db94046b50e
@@ -3505,11 +3505,14 @@ See also: `string·elems`.
<a id='string·capitalize'></a>
### string·capitalize

`S.capitalize()` returns a copy of string S with all Unicode letters
that begin words changed to their title case.
`S.capitalize()` returns a copy of string S with its first code point
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Python says "first character" instead of "first code point". Is the difference intended?

In Python:

>>> s = '\uFB03'
>>> s
'ffi'
>>> s.capitalize()
'FFI'
>>> len(s.capitalize())
3

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry, this comment was too nitpicky. I don't think the Python behavior actually makes sense: given the wording, I'd expect it to return three characters: F f i.

I think having the spec say this applies to code points, not characters makes sense. It may be good to point out somewhere that this is different than Python though.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The Python function is even more underspecified than ours: it seems to attempt to uppercase (not capitalize) the decomposition of a lower case letter that has no corresponding uppercase letter. I don't want to go there.

Thanks for educating me about this particular naughty string. :)

…o string-capitalize

Change-Id: I6fe54df6cb48d46b5fa0fc194f2ef2276dd464e1
@alandonovan
Copy link
Contributor Author

Scratch the surface of any program that uses Unicode and you find bugs---I found some more. Please take another look.

@alandonovan alandonovan force-pushed the string-capitalize branch 3 times, most recently from 2c5c207 to 150f739 Compare November 2, 2018 17:42
@alandonovan alandonovan merged commit 4c43ff3 into master Nov 2, 2018
@alandonovan alandonovan deleted the string-capitalize branch November 2, 2018 18:50
andponlin-canva added a commit to andponlin-canva/starlark-go that referenced this pull request Jan 27, 2025
adonovan pushed a commit that referenced this pull request Jan 27, 2025
* implement set.update method

The `update` method on the `set` is defined
in the Starlark specification [1] but is not
present in the starlark-go implementation.
This change introduces this functionality.

[1] https://github.com/bazelbuild/starlark/blob/master/spec.md#setupdate

* implement set.update method

Fixes from @adonovan in PR

* implement set.update method

Fixes #2 from @adonovan in PR

* implement set.update method

Fixes #3 from @adonovan in PR
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants