Description
http://microformats.org/wiki/microformats2-parsing#parsing_a_u-_property
if there is a gotten value, return the normalized absolute URL of it, following the containing document's language's rules for resolving relative URLs
I looked at this in the context of microformats/mf2py#79, which is a crash due to the attempt to normalize the invalid URL http://www.southside.de]
. Obviously, crashing the parser is not good behavior. Feedback in IRC can be summarized as "if it is not a valid URL, just pass the raw value through". Given that further steps in the parsing allow for arbitrary values to be returned and the consumer thus has to be prepared to handle any of them anyways this seems acceptable, but I'd still like to see it clarified in the parsing documentation. (An alternative would be dropping the value entirely, but I'm not sure if this is not more surprising and as far as I know isn't done in any other case of mf2 parsing)
(Some background reading regarding URL parsing and normalization:
RFC3986 - Uniform Resource Identifier (URI): Generic Syntax and the WHATWG URL Standard both clearly describe the URL as invalid. The WHATWG spec explicitly describes parsing to "return failure" for invalid URLs)