Standardize chain.Action.Marshal function #1198

containerman17 · 2024-07-25T03:23:59Z

As we aim to reduce the amount of code in VMs by following a "Sane Defaults" approach, I propose removing the need for explicit Marshal/Unmarshal functions in actions. Instead, we should use automatic marshalling by default, with an option for easy overrides.

~~Currently, manually marshalling 100,000 actions takes 40ms. Under this PR, it would take 94ms. An additional 54ms per 100k transactions seems acceptable.~~

Implement JavaScript counterpart of marshal/unmarshal
Allow overriding Marshal/Unmarshal
Integrate it into hypersdk
~~Explore a further speed up with go:generate~~
~~Check if Fuzz tests are working and required~~

containerman17 · 2024-08-01T01:47:24Z

Benchmark Results

TL;DR

It is double or even triple slower, but no one cares because marshal-unmarshal takes an insignificant portion of time at 100k TPS.

Transfer Action

The current manual approach takes 13ms per 100k cycles when single-threaded and 5ms when using 8 threads. The proposed reflection-based solution takes 44ms for single-threaded and 15ms for 8-threaded cycles. 15ms for 100k full cycles! Even on an 8-core machine (and potentially 16-32 cores), it is fast enough.

A Struct with 100 Inner Structs

For an exaggerated structure with an array of 100 inner structs, the 8-threaded result for the reflection-based approach takes 1067ms for 100k cycles, while the manual approach takes 535ms.

A Megabyte []byte Array

For an array with a million byte elements over 100k iterations, the reflection approach takes 13084ms on 8 cores, and the manual approach takes 14082ms. I commented out this test as there is no significant difference when comparing a single-field struct.

Raw results

// goos: linux
// goarch: amd64
// pkg: github.com/ava-labs/hypersdk/chain
// cpu: AMD EPYC 7763 64-Core Processor
// BenchmarkMarshalUnmarshal/Transfer-Reflection-1-8                     25          44908275 ns/op        35200248 B/op     500002 allocs/op
// BenchmarkMarshalUnmarshal/Transfer-Reflection-2-8                     39          28402783 ns/op        35200235 B/op     500003 allocs/op
// BenchmarkMarshalUnmarshal/Transfer-Reflection-4-8                     52          19412363 ns/op        35200397 B/op     500006 allocs/op
// BenchmarkMarshalUnmarshal/Transfer-Reflection-8-8                     67          15940310 ns/op        35201158 B/op     500012 allocs/op
// BenchmarkMarshalUnmarshal/Transfer-Manual-1-8                         79          13639643 ns/op        14400048 B/op     200002 allocs/op
// BenchmarkMarshalUnmarshal/Transfer-Manual-2-8                        138           8520965 ns/op        14400173 B/op     200003 allocs/op
// BenchmarkMarshalUnmarshal/Transfer-Manual-4-8                        190           6174020 ns/op        14400186 B/op     200005 allocs/op
// BenchmarkMarshalUnmarshal/Transfer-Manual-8-8                        230           5177846 ns/op        14400366 B/op     200009 allocs/op
// BenchmarkMarshalUnmarshal/Complex-Reflection-1-8                       8         131652394 ns/op        62400150 B/op    1200002 allocs/op
// BenchmarkMarshalUnmarshal/Complex-Reflection-2-8                      15          79716490 ns/op        62400238 B/op    1200003 allocs/op
// BenchmarkMarshalUnmarshal/Complex-Reflection-4-8                      22          53110410 ns/op        62400367 B/op    1200005 allocs/op
// BenchmarkMarshalUnmarshal/Complex-Reflection-8-8                      27          42143834 ns/op        62400900 B/op    1200010 allocs/op
// BenchmarkMarshalUnmarshal/Complex-Manual-1-8                          19          57420578 ns/op        40800108 B/op     900002 allocs/op
// BenchmarkMarshalUnmarshal/Complex-Manual-2-8                          33          33921873 ns/op        40800186 B/op     900003 allocs/op
// BenchmarkMarshalUnmarshal/Complex-Manual-4-8                          51          22917034 ns/op        40800419 B/op     900005 allocs/op
// BenchmarkMarshalUnmarshal/Complex-Manual-8-8                          60          18573280 ns/op        40800467 B/op     900009 allocs/op
// PASS
// ok      github.com/ava-labs/hypersdk/chain      20.763s

Transfer struct setup

type Transfer struct {
	To    codec.Address `json:"to"`
	Value uint64        `json:"value"`
	Memo  []byte        `json:"memo"`
}

transfer := Transfer{
	To:    codec.Address{1, 2, 3, 4, 5, 6, 7, 8, 9},
	Value: 12876198273671286,
	Memo:  []byte("Hello World"),
}

Complex struct setup

type InnerStruct struct {
	Field1 int32
	Field2 string
	Field3 bool
	Field5 []byte
}

type TestStruct struct {
	Uint64Field uint64
	StringField string
	BytesField  []byte
	IntField    int
	BoolField   bool
	Uint16Field uint16
	Int8Field   int8
	InnerField  []InnerStruct
}

test := TestStruct{
	Uint64Field: 42,
	StringField: "Hello, World!",
	BytesField:  []byte{1, 2, 3, 4, 5},
	IntField:    -100,
	BoolField:   true,
	Uint16Field: 65535,
	Int8Field:   -128,
	InnerField: func() []InnerStruct {
		inner := make([]InnerStruct, 100)
		for i := 0; i < 100; i++ {
			inner[i] = InnerStruct{
				Field1: int32(i),
				Field2: fmt.Sprintf("Inner string %d", i),
				Field3: i%2 == 0,
				Field5: []byte{byte(i), byte(i + 1), byte(i + 2), byte(i + 3), byte(i + 4)},
			}
		}
		return inner
	}(),
}

All of this is time per 100k cycles, not for a single unmarshal!

containerman17 · 2024-08-01T03:20:45Z

func (p *Packer) PackInt(v int) {
	p.p.PackInt(uint32(v))
}

func (p *Packer) UnpackInt(required bool) int {
	v := p.p.UnpackInt()
	if required && v == 0 {
		p.addErr(fmt.Errorf("%w: Int field is not populated", ErrFieldNotPopulated))
	}
	return int(v)
}

I clearly restricted it to int32. As the previous implementation assumed that we are working on a 32-bit machine. Here is the new implementation:

// PackInt now accepts uint32 to ensure full support for 64-bit machines
func (p *Packer) PackInt(v uint32) {
	p.p.PackInt(v)
}

func (p *Packer) UnpackInt(required bool) uint32 {
	v := p.p.UnpackInt()
	if required && v == 0 {
		p.addErr(fmt.Errorf("%w: Int field is not populated", ErrFieldNotPopulated))
	}
	return v
}

Here is a test showing the bug:

func TestPackInt(t *testing.T) {
	require := require.New(t)
	wp := NewWriter(5, 5)
	wp.PackIntOld(math.MaxInt64)
	require.NoError(wp.Err(), "Error packing int.")

	rp := NewReader(wp.Bytes(), 5)
	unpackedVal := rp.UnpackIntOld(true)
	require.NoError(rp.Err(), "Error unpacking int.")

	require.EqualValues(math.MaxInt64, unpackedVal, "Unpacked value does not match packed value.")
}
// --- FAIL: TestPackInt (0.00s)
//     packer_test.go:134: 
//                 Error Trace:    /workspaces/hypersdk/codec/packer_test.go:134
//                 Error:          Not equal: 
//                                 expected: 9223372036854775807
//                                 actual  : 4294967295
//                 Test:           TestPackInt
//                 Messages:       Unpacked value does not match packed value.

containerman17 · 2024-08-29T07:09:48Z

I redid the PR today and minimized the number of changes.

chain/dependencies.go

codec/packer.go

aaronbuchwald · 2024-08-22T01:35:17Z

codec/packer.go

+// Deprecated: Use PackBytes for better performance.
 func (p *Packer) PackString(s string) {
 	p.p.PackStr(s)
 }

+// Deprecated: Use UnpackBytes for better performance.


Why mark these as deprecated?

Unpacking strings is very slow, much slower than working with the same data in bytes. It should never be used in on-chain operations. However, it is used in the WebSocket server, where performance is not a critical concern.

chain/dependencies.go

codec/type_parser.go

examples/morpheusvm/registry/registry.go

chain/actions.go

chain/actions_test.go

chain/transaction_test.go

Signed-off-by: containerman17 <8990432+containerman17@users.noreply.github.com>

chain/actions_test.go

aaronbuchwald

LGTM with two nits that we should include as code quality improvements in the tests

* naive first marshalling attemmpt * int 8-int64 supported * negative numbers support * support maps * funky benchmark * structs reflection caching attempt * some fuzz tests * fix speed comment * relocate implementation out of test * benchmark * rename test to TestMakeSureMarshalUnmarshalIsNotTooSlow * fix fuzz test * spec tests for js implant * update spec tests * remove fuzz test * simplify to 2 types * restore original logic * rewrite marshal with avalanchego's wrappers.Packer * pack bytes with uint32 and everything else with uint16 * check for long arrays and strings, marshal maps with uint16 * update benchmark results * move to codec * come back to codec.packer * speed up TestMakeSureMarshalUnmarshalIsNotTooSlow a bit * support pointer to a struct * auto marshaller integration * lint * remove a slow test breaking CI * fix linter errors * faster reflection cache * minimize test to exclude testing errors * unsafe type caching * simplify benchmark * update benchmarks * add benchmark results * add benchmem * add benchmem results * deprecate string operations * move empty address error * empty file * lint * remove .prof * correct 'marshall' to 'marshal' according to Go conventions * simplify codec.Packer * get chainid from tmpnet instead of the platform (#1458) * lint * change to linearcodec * lint * go mod tidy * add serialize tag to Transfer * abi generation * spec tests * ABI in RPC * HasTypeID as a separate iface * add TypeParser.GetRegisteredTypes method * move ABI to core API * remove unused errors * auto size calculation * rename LinearCodecInstance * lint * update from #1198 * from clean slate * return abi logic * nit: remove function name in panics * transaction test nit * catch up with main * lint * rename HasTypeID to Typed * separate package for abi * restore spec tests * treat codec.Address as a byte array while serializing * comments and nits * use set.Set instead of map[reflect.Type]bool * lint * remove memo field * require serialize=true * remove a breaker * calculate ABI in place * use ABI as struct in implementation * stable ABI hash * ABI wants its own ABI * rename abi to vmabi * remove ABI for ABI * redo map as an array * clean up test specs a bit * basic codegen and refactor tests WIP * trying different naming * spec simplification WIP * go generate * lower case json * proper codegen test * further simplify spec tests * transfer test * file based spec test * simplify tests * ABI of ABI * Outer/Inner struct tests for TS debug * lint * check ABI * lost in merge * remove comment lines in abi test * remove a debug statement * move mock abi file * rename to abigen * use cobra * Update codec/address.go Co-authored-by: aaronbuchwald <aaron.buchwald56@gmail.com> Signed-off-by: containerman17 <8990432+containerman17@users.noreply.github.com> * require that the " characters in address string * rename ABI-related stuff * fix tests after renaming * test full marshal cycle * TestDescribeVM * Update abi/codegen.go Co-authored-by: aaronbuchwald <aaron.buchwald56@gmail.com> Signed-off-by: containerman17 <8990432+containerman17@users.noreply.github.com> * remove StringAsBytes * add unicode package * lint * go mod tidy * flatten types def in abi * don't use mixed receivers * inline vm.Hash into a test * rename abi.VM to abi.ABI * remove embed * funish renaming * get rid of vm.getabi * nit avoid redundant import alias * DescribeVM -> NewABI * lint * mock gen * share typesAlreadyProcessed across describing multiple actions * put a comment on each test * Update abi/auto_marshal_abi_spec_test.go Co-authored-by: aaronbuchwald <aaron.buchwald56@gmail.com> Signed-off-by: containerman17 <8990432+containerman17@users.noreply.github.com> * nit: objectBytes * remove mustPrintOrderedJSON * comment on empty names * comment on IsUpper * revert typealias * TODO here to switch to the new address format * We should follow the style of funcName does X * use rune in cobra * comment on Dereference * comment on serialize tag * use %s and t instead of t.String() * readme * rename mockabi_test * lint --------- Signed-off-by: containerman17 <8990432+containerman17@users.noreply.github.com> Co-authored-by: aaronbuchwald <aaron.buchwald56@gmail.com>

containerman17 added 15 commits July 24, 2024 02:35

naive first marshalling attemmpt

92e08ec

int 8-int64 supported

6a3d006

negative numbers support

5618d6c

support maps

9fbd36c

funky benchmark

9b0c5c2

structs reflection caching attempt

0f4d1c6

some fuzz tests

1c5b8a2

fix speed comment

fe39c66

relocate implementation out of test

66c86f8

benchmark

02c3643

rename test to TestMakeSureMarshalUnmarshalIsNotTooSlow

8fb31da

fix fuzz test

acb7e45

spec tests for js implant

529d1c4

update spec tests

1a38839

Merge branch 'main' into containerman/standardize-marshal-function

bfa9aea

containerman17 self-assigned this Jul 25, 2024

containerman17 added 8 commits July 31, 2024 16:26

Merge branch 'main' into containerman/standardize-marshal-function

55e9bcd

remove fuzz test

d31dc72

simplify to 2 types

5c08120

restore original logic

d7ec6e4

rewrite marshal with avalanchego's wrappers.Packer

103db88

pack bytes with uint32 and everything else with uint16

2df75a4

check for long arrays and strings, marshal maps with uint16

c2c0e64

Merge branch 'main' into containerman/standardize-marshal-function

3279d3d

containerman17 added 3 commits August 1, 2024 01:58

update benchmark results

0d62a73

move to codec

1229c35

come back to codec.packer

917335b

speed up TestMakeSureMarshalUnmarshalIsNotTooSlow a bit

c7b49bd

containerman17 mentioned this pull request Aug 29, 2024

Should Automatic Marshaling Remove or Retain Size, Marshal, and Unmarshal in the Action Interface? #1473

Closed

containerman17 added 3 commits August 29, 2024 05:23

add GetRegisteredTypes fun to registry

00c6d62

bring back optional interface for marshal, add optional iface for size

b924105

automatically generate unmarshal function

cfd6bfc

containerman17 added a commit that referenced this pull request Aug 29, 2024

update from #1198

cb157e0

aaronbuchwald requested changes Aug 29, 2024

View reviewed changes

RodrigoVillar reviewed Aug 29, 2024

View reviewed changes

chain/transaction_test.go Outdated Show resolved Hide resolved

chain/transaction_test.go Outdated Show resolved Hide resolved

containerman17 added 11 commits August 29, 2024 23:22

tx test nit

88c317a

single Marshaler iface

619e9a0

nit: comment

80631b6

keep chain.Marshaler implementation uncommented

61c56bb

rename HasTypeID to Typed

8c7e53b

nit: constants in action test

292f815

use actions.UnmarshalTransfer

559fe74

Merge branch 'main' into containerman/standardize-marshal-function

9badd2f

Signed-off-by: containerman17 <8990432+containerman17@users.noreply.github.com>

lint

94ae55d

take interface in getActionSize and marshalActionInto funcs

c6de833

clean up actions_test

855145f

aaronbuchwald reviewed Aug 30, 2024

View reviewed changes

chain/actions_test.go Outdated Show resolved Hide resolved

aaronbuchwald reviewed Aug 30, 2024

View reviewed changes

chain/actions_test.go Outdated Show resolved Hide resolved

containerman17 marked this pull request as ready for review August 30, 2024 02:41

aaronbuchwald previously approved these changes Aug 30, 2024

View reviewed changes

use consts for mockObjectSize and mockBytes in actions_test

71d8825

containerman17 dismissed aaronbuchwald’s stale review via 71d8825 August 30, 2024 02:48

containerman17 added 2 commits August 30, 2024 02:51

mockObjectSize as var

f08be88

make mockObjectSize local var

14a21d0

aaronbuchwald approved these changes Aug 30, 2024

View reviewed changes

containerman17 merged commit 7ce0081 into main Aug 30, 2024
15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Standardize chain.Action.Marshal function #1198

Standardize chain.Action.Marshal function #1198

containerman17 commented Jul 25, 2024 •

edited

Loading

containerman17 commented Aug 1, 2024 •

edited

Loading

containerman17 commented Aug 1, 2024 •

edited

Loading

containerman17 commented Aug 29, 2024

aaronbuchwald Aug 22, 2024

containerman17 Aug 30, 2024

aaronbuchwald left a comment

Standardize chain.Action.Marshal function #1198

Standardize chain.Action.Marshal function #1198

Conversation

containerman17 commented Jul 25, 2024 • edited Loading

containerman17 commented Aug 1, 2024 • edited Loading

Benchmark Results

TL;DR

Transfer Action

A Struct with 100 Inner Structs

A Megabyte []byte Array

Raw results

Transfer struct setup

Complex struct setup

containerman17 commented Aug 1, 2024 • edited Loading

containerman17 commented Aug 29, 2024

aaronbuchwald Aug 22, 2024

Choose a reason for hiding this comment

containerman17 Aug 30, 2024

Choose a reason for hiding this comment

aaronbuchwald left a comment

Choose a reason for hiding this comment

containerman17 commented Jul 25, 2024 •

edited

Loading

containerman17 commented Aug 1, 2024 •

edited

Loading

containerman17 commented Aug 1, 2024 •

edited

Loading