Fixed typo in test vectors, clarified proquint and identity, added links to specs. #85

sg495 · 2021-10-20T11:01:00Z

Closes #76, closes #80, closes #83, closes #84.

I have edited the multibase proquint RFC to clarify how the full prefix works, and I have added an explicit reference to the RFC in the table (because the proquint data encoded by multibase is different from the data encoded by the original proquint spec). I have also added an explicit reference to the RFC for base8 in the table, because base8 according to the multibase RFC is significantly different from the base8 provided by other standard implementations (e.g. Python).

Closes multiformats#83

Closes multiformats#80

…tion Closes multiformats#76

Stebalien · 2021-10-28T01:26:00Z

rfcs/identity.md

+The multibase identity prefix is the character non-printable ASCII/UTF-8 character with codepoint 0x00. Note that this is different from the multibase prefix 0 listed for base2, which is the ASCII/UTF-8 character "0" with codepoint 0x30.
+
+
+## Encoding


This won't work with invalid utf8, unfortunately.

Really, 0x00 (or NUL) isn't a true multibase, that's why it's so hard to describe. It exists to distinghish between text and binary in a context where 0x0 means "everything following is binary".

Really, the right way to do this would be to use a raw/utf8/ascii/etc. multicodec to specify the encoding of the "following" data. E.g.:

0x55 ... - binary.

0xXX ... - utf8 where XX is some marker byte. Unfortunately, the BOM (0xFEFF) isn't a valid varint, so we'd probably need to pick another.

All this is saying... I'd rather just drop it entirely. I don't think we're actually using it anywhere (for real, at least).

Thoughts @vmx?

I'd be happy to remove the identity Multibase in case there are no objections. I think in the realm of Multibase it doesn't really makes sense as it is really about string encoding things.

Though things are sadly a bit complicated. Technically the identity Multibase is used in every DAG-CBOR encoded CID. The reason is that CID is specified with the Multibased prefix being part of the actual CID: https://github.com/multiformats/cid/tree/97ff4a329f04b70c1ab7255c62af48192146b025#how-does-it-work

Though talking with other folks, hardly anyone I know thinks of CIDs that way. I (and the people I've talked to) think of CIDs without the Multibase prefix and there there is of course Multibase prefixed CIDs.

To conclude, I'd remove the the identity Multibase and update the CID spec to reflect how CIDs are thought of today.

Added the Python module `multiformats` to the list of implementations.

bumblefudge · 2023-08-12T08:45:30Z

@vmx @Stebalien Was the rest of this PR other than the rfcs/identity.md valid, tho? It seemed to fix other typos that might still need fixing. Happy to cherry pick the rest of the PR and re-open it if so!

ben221199 · 2023-08-12T12:19:22Z

@bumblefudge I created a new pull request that fixes some of these typos. The identity is excluded from there.

sg495 added 5 commits October 20, 2021 12:00

Fix incorrect test bytestring in case_insensitivity.csv

246742f

Closes multiformats#84

Update README.md

dd8f0f5

Closes multiformats#83

Update Base2.md

bf71203

Closes multiformats#80

Added links to specs, created an explicit identity spec for clarifica…

b2cec76

…tion Closes multiformats#76

sg495 changed the title ~~Fix incorrect test bytestring in case_insensitivity.csv~~ Fixed typo in test vectors, clarified proquint and identity, added links to specs. Oct 20, 2021

Stebalien reviewed Oct 28, 2021

View reviewed changes

Update README.md

3a414e6

Added the Python module `multiformats` to the list of implementations.

sg495 marked this pull request as draft July 21, 2022 13:28

sg495 closed this Jul 21, 2022

ben221199 mentioned this pull request Aug 12, 2023

Fix typos #113

Merged

vmx mentioned this pull request Aug 16, 2023

Getting Multibase ready for IETF review (align with IANA terminology and registry governance style) #109

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixed typo in test vectors, clarified proquint and identity, added links to specs. #85

Fixed typo in test vectors, clarified proquint and identity, added links to specs. #85

sg495 commented Oct 20, 2021 •

edited

Loading

Stebalien Oct 28, 2021

vmx Oct 28, 2021

bumblefudge commented Aug 12, 2023

ben221199 commented Aug 12, 2023

		The multibase identity prefix is the character non-printable ASCII/UTF-8 character with codepoint 0x00. Note that this is different from the multibase prefix 0 listed for base2, which is the ASCII/UTF-8 character "0" with codepoint 0x30.


		## Encoding

Fixed typo in test vectors, clarified proquint and identity, added links to specs. #85

Fixed typo in test vectors, clarified proquint and identity, added links to specs. #85

Conversation

sg495 commented Oct 20, 2021 • edited Loading

Stebalien Oct 28, 2021

Choose a reason for hiding this comment

vmx Oct 28, 2021

Choose a reason for hiding this comment

bumblefudge commented Aug 12, 2023

ben221199 commented Aug 12, 2023

sg495 commented Oct 20, 2021 •

edited

Loading