Implement new default constr ID #262

nielstron · 2023-07-14T22:43:31Z

This is a possible implementation of a fix for #239

Properties

almost unique
- pretty much random value between 1-2**32
- based on class name, field names and field types
deterministic (sha256 defined behaviour)
overwritable (just an attribute)

nielstron · 2023-07-14T22:52:39Z

As suggested by @jkoppel we should maybe use constructor IDs up to 2^32 since the numbers up to this value are treated all the same by the plutus VM wrt costing. I suspect that the additional 3 bytes don't make much of a costing difference either when looking at script and inline datum size.

codecov-commenter · 2023-07-14T22:58:41Z

Codecov Report

Merging #262 (8b7d2d6) into main (a88d725) will increase coverage by 0.13%.
The diff coverage is 100.00%.

❗ Your organization is not using the GitHub App Integration. As a result you may experience degraded service beginning May 15th. Please install the Github App Integration for your organization. Read more.

@@            Coverage Diff             @@
##             main     #262      +/-   ##
==========================================
+ Coverage   85.01%   85.14%   +0.13%     
==========================================
  Files          26       26              
  Lines        2983     2996      +13     
  Branches      715      719       +4     
==========================================
+ Hits         2536     2551      +15     
+ Misses        336      335       -1     
+ Partials      111      110       -1

Files Changed	Coverage Δ
pycardano/plutus.py	`89.06% <100.00%> (+1.46%)`	⬆️

cffls

Elegant implementation!

cffls · 2023-07-17T09:40:55Z

pycardano/plutus.py

+        on class attributes, types and class name.
+        """
+        det_string = (
+            cls.__name__ + "*" + "*".join([f"{f.name}~{f.type}" for f in fields(cls)])


Does it make sense to sort the fields before hashing?

Personally, I'd like to see both

@dataclass class A(PlutusData): a: int b: bytes

and

@dataclass class A(PlutusData): b: bytes a: int

hash into the same digest.

Also, is there a way to cache this result? The calculation might be expensive.

Does it make sense to sort the fields before hashing?

No this doesnt make sense. The order of fields is relevant for plutusData! (think of how C objects have their fields arranged in memory, same applies to Plutus data how their fields are arranged in the internal list). In particular, the two classes you listed do not constitute the same plutus data

Okay makes sense, thanks for the explanation!

I played around with some trivial caching methods (i.e. overriding CONSTR_ID after initial call) but get some really weird errors in pytest (CONSTR_IDs are the same for different classes, but only if run in the suite, not when run individually)

I played around with some trivial caching methods (i.e. overriding CONSTR_ID after initial call) but get some really weird errors in pytest (CONSTR_IDs are the same for different classes, but only if run in the suite, not when run individually)

Will take a look at this issue.

This seems to work:

@classproperty def CONSTR_ID(cls): """ Constructor ID of this plutus data. It is primarily used by Plutus core to reconstruct a data structure from serialized CBOR bytes. The default implementation is an almost unique, deterministic constructor ID in the range 1 - 2^32 based on class attributes, types and class name. """ if not hasattr(cls, "_CONSTR_ID"): det_string = ( cls.__name__ + "*" + "*".join([f"{f.name}~{f.type}" for f in fields(cls)]) ) det_hash = sha256(det_string.encode("utf8")).hexdigest() setattr(cls, "_CONSTR_ID", int(det_hash, 16) % 2**32) return cls._CONSTR_ID

If you could grant me write access to https://github.com/OpShin/pycardano.git, I can push this change. Also, I think there is an option to allow others to push changes to your PR when creating it.

Cool! I have given you write access

cffls · 2023-07-23T06:30:05Z

pycardano/plutus.py

+        """
+        if not hasattr(cls, "_CONSTR_ID"):
+            det_string = (
+                cls.__name__ + "*" + "*".join([f"{f.name}~{f.type}" for f in fields(cls)])


Some fields are metadata, e.g. ClassVar, which is not serialized to cbor. I think we should try to exclude those fields.

Maybe you can recycle the logic from the to_cbor or to_primitive function? these would exactly describe which parts are not serialized if I understand correctly?

It's a bit tricky to reuse to_cbor or to_primitive here because they are not class methods. We can check if the type is in primitive list (recursively).

As of now, all fields are checked and serialized. We can revisit this when field exclusion is enabled in the future.

…STR_ID

nielstron · 2023-07-29T07:40:25Z

pycardano/plutus.py

-    CONSTR_ID: ClassVar[int] = 0
-    """Constructor ID of this plutus data.
-       It is primarily used by Plutus core to reconstruct a data structure from serialized CBOR bytes."""
+    AUTO_ID: ClassVar[bool] = False


I would honestly much prefer if AUTO ID would default to true.

When writing a smart contract and defining classes, setting the constr id manually requires more expertise than automatically having the unique constructor. If you think about it, there is no reason you should even know about constructor IDs when writing smart contracts, you should be able to assume that classes are distinct and distinguishable. So setting the constructor manually should be the non-default, reserved for experts that know what they are doing.

When you are using pycardano to model some existing contract written in Plutus, you definitely should be an expert and manually specify the constructor id, as you are working across different language implementations.

This is a breaking change, but I think it would much benefit the usability of pycardano.

Making AUTO ID default to True would then make the entire flag superfluous - when you are setting it to false manually you can just as well manually specify the constructor id.

Makes sense to enable auto id by default as you explained. Added auto id because I wanted to make it a non-breaking change. I will revert this commit.

This reverts commit 787e160.

nielstron · 2023-08-29T14:01:59Z

@cffls any update on this?

cffls · 2023-08-30T00:21:56Z

Added a comment regarding field exclusion. The PR looks good to me. If you don't have more to add, I will merge it.

nielstron · 2023-08-30T06:03:11Z

It looks good to me as well :)

cffls reviewed Jul 17, 2023

View reviewed changes

nielstron linked an issue Jul 22, 2023 that may be closed by this pull request

Automatic unique CONSTR_IDs #239

Closed

cffls force-pushed the feat/default_unique_constr branch from 71b54a9 to 59a64ee Compare July 23, 2023 06:30

cffls reviewed Jul 23, 2023

View reviewed changes

nielstron and others added 11 commits July 24, 2023 00:32

Implement new default constr ID

7f3b083

Formatting

70a984c

Fix plutus data hash

ef449e2

Introduce Unit default empty constructor

55b90e8

Formatting much

6a6425d

Add test for constructor id uniqueness

586141f

Add test for determinism of constructor id

5bb4a7a

Remove unused imports

308343c

Fix integration test

955cd7a

Cache CONSTR_ID

9bcea9b

Avoid using _CONSTR_ID from parent class by adding class name to _CON…

c4e06d6

…STR_ID

cffls force-pushed the feat/default_unique_constr branch from 6142794 to c4e06d6 Compare July 23, 2023 16:33

Add a flag to enable/disable auto CONSTR_ID

787e160

nielstron commented Jul 29, 2023

View reviewed changes

Revert "Add a flag to enable/disable auto CONSTR_ID"

8b7d2d6

This reverts commit 787e160.

cffls approved these changes Aug 31, 2023

View reviewed changes

cffls merged commit 0d5b3d7 into Python-Cardano:main Aug 31, 2023

Uh oh!

Implement new default constr ID #262

Implement new default constr ID #262

Uh oh!

Conversation

nielstron commented Jul 14, 2023

Uh oh!

nielstron commented Jul 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov-commenter commented Jul 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

cffls left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cffls Jul 31, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nielstron commented Aug 29, 2023

Uh oh!

cffls commented Aug 30, 2023

Uh oh!

nielstron commented Aug 30, 2023

Uh oh!

Uh oh!

nielstron commented Jul 14, 2023 •

edited

Loading

codecov-commenter commented Jul 14, 2023 •

edited

Loading

cffls Jul 31, 2023 •

edited

Loading