Possibly reduce the amount of hash functions used in the code #2

fjarri · 2020-07-28T19:58:33Z

In the Python implementation we have, in different places, SHA256, Blake2b and Keccak. In this implementation some occurrences of Blake2b are replaced with SHA3 temporarily because of digest length requirements (e.g. in hash_to_scalar()). Could we use a single hash function everywhere?

The text was updated successfully, but these errors were encountered:

tuxxy · 2020-08-12T08:49:21Z

Absolutely we can do this. Problematically, there's some places we may need to keep as keccak or sha256 to ensure compatibility with Ethereum, but that needs re-scoping.

I'm going to add the scoping label to this and leave the actionable item in this issue to identify which hash function calls are required for compatibility.

Be aware that we should likely also update pyUmbral at the same time, and subsequently release on nucypher to ensure compatibility. I'd put this at a relatively high priority should we want these changes before a mainnet release.

vepkenez · 2020-12-04T18:17:51Z

After I spent so much time finding a useable and compatible blake2b library for JS 😢😢😢😢

fjarri · 2021-03-02T17:39:32Z

From the discussion: use SHA-256 everywhere. Seems like unsafe_hash_to_point() can use it as well.

cygnusv · 2021-03-03T12:43:14Z

To be clear, we will use SHA-256 "everywhere", but in some cases we can't use it directly; for example, when hashing to a scalar, where we will use a more complex construction which internally uses SHA256 (but no other hashing function).

After reading the security discussion in the hash-to-curve standard, it's clear that SHA256 alone could introduce a small bias when used modulo p directly, so it needs to be "expanded".

For the particular case of hashing to a scalar, we need to use the hash_to_field algorithm described in the standard. Internally, and for the particular combination of SHA256 and secp256k1, this uses a construction called expand_message_xmd which outputs 48-byte digests. See here some test vectors for expand_message_xmd instantiated with SHA256.

So, in essence, every time that we were using hash_to_curvebn in pyumbral, we would use now hash_to_field from the standard. Note that hash_to_field natively supports domain separation tags (#38), so we don't need to add them in an ad-hoc manner as we were doing in pyumbral.

fjarri · 2021-03-03T20:12:38Z

Hm, I'm confused here. hash_to_field is not an exact analogue of PyUmbral's hash_to_curvebn. The former hashes to the domain of curve coordinates, the latter to the domain of scalars. So they do the same thing, true, but security requirements may be different.

What was hash_to_curvebn in PyUmbral is now ScalarDigest, and uses Scalar::from_digest() from the backend library (k256), which is implemented as modulo reduction of the digest without expansion. Is that a potential problem? If it is, I would prefer to fix it in k256 and not here (may not be even possible to fix it here if k256 does not expose enough internals).

If I understand the implementation of hash_to_curvebn() in PyUmbral, it does the same as what is done here:

the DST is added just as another input to the digest
the digest is taken modulo the order without any expansion

The only difference I noticed is that it is guaranteed to return a non-zero value. Is that important?

fjarri added enhancement New feature or request cryptography Needs attention of someone who knows what they're doing labels Aug 5, 2020

tuxxy added the scoping Scoping required for further action label Aug 12, 2020

This was referenced Mar 2, 2021

Binary compatibility with PyUmbral #27

Closed

Hash functions cleanup #36

Merged

fjarri removed the cryptography Needs attention of someone who knows what they're doing label Mar 3, 2021

fjarri mentioned this issue Mar 3, 2021

Get rid of exposed domain separation tags and use dedicated functions instead #38

Merged

fjarri closed this as completed in #36 Mar 6, 2021

cygnusv mentioned this issue Nov 18, 2022

Hash function selection for tpke nucypher/ferveo#16

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Possibly reduce the amount of hash functions used in the code #2

Possibly reduce the amount of hash functions used in the code #2

fjarri commented Jul 28, 2020

tuxxy commented Aug 12, 2020

vepkenez commented Dec 4, 2020

fjarri commented Mar 2, 2021

cygnusv commented Mar 3, 2021

fjarri commented Mar 3, 2021

Possibly reduce the amount of hash functions used in the code #2

Possibly reduce the amount of hash functions used in the code #2

Comments

fjarri commented Jul 28, 2020

tuxxy commented Aug 12, 2020

vepkenez commented Dec 4, 2020

fjarri commented Mar 2, 2021

cygnusv commented Mar 3, 2021

fjarri commented Mar 3, 2021