cmd/compile: read-only escape analysis and avoiding string -> []byte copies #2205

bradfitz · 2011-08-29T22:35:43Z

Many functions take a []byte but only read from it.

If the escape analysis code could flag parameters of a function as read-only, then code
which passes in a []byte(string) conversion could be cheaper.

bytes.IndexByte is one example.

For example, http/sniff.go does:

// -------------------
  // Index of the first non-whitespace byte in data.                                                                                               
  firstNonWS := 0
  for ; firstNonWS < len(data) && isWS(data[firstNonWS]); firstNonWS++ {
  }

func isWS(b byte) bool {
        return bytes.IndexByte([]byte("\t\n\x0C\n "), b) != -1
}

// -------------------

But it's faster to avoid re-creating the []byte:

func BenchmarkIndexByteLiteral(b *testing.B) {
        for i := 0; i < b.N; i++ {
        IndexByte([]byte("\t\n\x0C\n "), 'x')
        }
}

func BenchmarkIndexByteVariable(b *testing.B) {
        var whitespace = []byte("\t\n\x0C\n ")
        for i := 0; i < b.N; i++ {
                IndexByte(whitespace, 'x')
        }
}

bytes_test.BenchmarkIndexByteLiteral    20000000           125 ns/op
bytes_test.BenchmarkIndexByteVariable   100000000           25.4 ns/op

Related is issue #2204.

gopherbot · 2011-08-30T21:38:29Z

Comment 1 by jp@webmaster.ms:

It seem to be much easier to add the possibility of zerocopish taking string slices from
byte slices/arrays.
As far I understand, string slices are actually readonly and cap()'less byte slices:
http://research.swtch.com/2009/11/go-data-structures.html

bradfitz · 2011-08-30T21:43:08Z

Comment 2:

zerocopish?

rsc · 2011-08-30T21:43:18Z

Comment 3:

Can you be more specific?
Examples of programs that you think should be
handled specially would be a good way to do that.

rsc · 2011-09-15T00:23:55Z

Comment 4:

Owner changed to @rsc.

Status changed to Accepted.

lvdlvd · 2011-11-07T16:36:42Z

Comment 5:

Labels changed: added compilerbug, performance.

rsc · 2011-12-09T19:45:42Z

Comment 6:

Labels changed: added priority-later.

rsc · 2012-09-12T21:42:49Z

Comment 8:

Labels changed: added go1.1maybe.

bradfitz · 2012-10-25T14:54:57Z

Comment 9:

I run into this often, but I should start listing examples.
In goprotobuf, text.go calls writeString with both a string and a []byte converted to a
string:
// writeAny writes an arbitrary field.
func writeAny(w *textWriter, v reflect.Value, props *Properties) {
        v = reflect.Indirect(v)
        // We don't attempt to serialise every possible value type; only those
        // that can occur in protocol buffers, plus a few extra that were easy.
        switch v.Kind() {
        case reflect.Slice:
                // Should only be a []byte; repeated fields are handled in writeStruct.
                writeString(w, string(v.Interface().([]byte)))
        case reflect.String:
                writeString(w, v.String())
Note that the function writeString reallly just wants a read-only slice of bytes:
func writeString(w *textWriter, s string) {
        w.WriteByte('"')
        // Loop over the bytes, not the runes.
        for i := 0; i < len(s); i++ {
                // Divergence from C++: we don't escape apostrophes.
                // There's no need to escape them, and the C++ parser
                // copes with a naked apostrophe.
                switch c := s[i]; c {
                case '\n':
                        w.Write([]byte{'\\', 'n'})
                case '\r':
                        w.Write([]byte{'\\', 'r'})
                case '\t':
                        w.Write([]byte{'\\', 't'})
                case '"':
                        w.Write([]byte{'\\', '"'})
                case '\\':
                        w.Write([]byte{'\\', '\\'})
                default:
                        if isprint(c) {
                                w.WriteByte(c)
                        } else {
                                fmt.Fprintf(w, "\\%03o", c)
                        }
                }
        }
        w.WriteByte('"')
}
It doesn't matter that it's frozen (like a string), nor writable (like a []byte).  But
Go lacks that type, so if instead it'd be nice to write writeAny with a []byte parameter
and invert the switch above to be like:
        switch v.Kind() {
        case reflect.Slice:
                // Should only be a []byte; repeated fields are handled in writeStruct.
                writeString(w, v.Interface().([]byte))
        case reflect.String:
                writeString(w, []byte(v.String())) // no copy!
Where the []byte(v.String()) just makes a slice header pointing in to the string's
memory, since the compiler can verify that writeAny never mutates its slice.

bradfitz · 2012-11-17T17:44:37Z

Comment 10:

See patch and CL description at http://golang.org/cl/6850067 for the opposite
but very related case: strconv.ParseUint, ParseBool, etc take a string but calling code
has a []byte.

robpike · 2013-03-07T22:10:03Z

Comment 11:

Labels changed: removed go1.1maybe.

rsc · 2013-03-12T20:18:47Z

Comment 12:

[The time for maybe has passed.]

bradfitz · 2013-03-31T19:42:51Z

Comment 13:

People who like this bug also like issue #3512 (cmd/gc: optimized map[string] lookup from
[]byte key)

dvyukov · 2013-03-31T19:57:58Z

Comment 14:

FWIW
var m map[string]int
var b []byte
_ = m[string(b)]
case must be radically simpler to implement than general read-only analysis.
It's peephole optimization, when the compiler sees such code it can generate hacky
string object using the []byte pointer.
Another example would be len(string(b)), but this seems useless.

bradfitz · 2013-03-31T19:59:38Z

Comment 15:

Yes. That's why they're separate bugs.
I want issue #3512 first, because it's easy.

bradfitz · 2013-04-26T03:42:14Z

Comment 16:

Labels changed: added garbage.

rsc · 2013-07-30T14:26:28Z

Comment 18:

Labels changed: added priority-someday, removed priority-later.

rsc · 2013-12-04T01:43:48Z

Comment 19:

Labels changed: added repo-main.

rsc · 2014-03-03T20:46:13Z

Comment 20:

Adding Release=None to all Priority=Someday bugs.

Labels changed: added release-none.

jfcg · 2022-03-25T11:26:29Z

In any case, this issue is for the conversion the other way, string -> []byte.

Simpler example (tested with Go 1.17.8 / 1.18):
pkg.go

package pkg

// ToLower converts ascii string s to lower-case
func ToLower(s string) string {
	if s == "" {
		return ""
	}
	buf := make([]byte, len(s))

	for i := 0; i < len(s); i++ {
		c := s[i]
		if 'A' <= c && c <= 'Z' {
			c |= 32
		}
		buf[i] = c
	}
	return string(buf)
}

pkg_test.go

package pkg

import "testing"

func BenchmarkToLower(b *testing.B) {
	str := "SomE StrInG"
	want := "some string"
	var res string

	for i := 0; i < b.N; i++ {
		res = ToLower(str)
	}
	if res != want {
		b.Fatal("ToLower error")
	}
}

go test -v -bench ToLower -benchmem -count=3 prints

BenchmarkToLower-4      18710726                66.86 ns/op           32 B/op          2 allocs/op
BenchmarkToLower-4      17420910                65.57 ns/op           32 B/op          2 allocs/op
BenchmarkToLower-4      17085405                63.16 ns/op           32 B/op          2 allocs/op

There is a redundant allocation & copy in return string(buf). It looks too trivial to me that I am surprised Go compiler cannot handle this.

gopherbot · 2023-08-16T22:38:01Z

Change https://go.dev/cl/520259 mentions this issue: cmd/compile/internal/escape: optimize indirect closure calls

gopherbot · 2023-08-17T06:02:34Z

Change https://go.dev/cl/520395 mentions this issue: cmd/compile: enable zero-copy string->[]byte conversions

This CL extends escape analysis in two ways. First, we already optimize directly called closures. For example, given: var x int // already stack allocated today p := func() *int { return &x }() we don't need to move x to the heap, because we can statically track where &x flows. This CL extends the same idea to work for indirectly called closures too, as long as we know everywhere that they're called. For example: var x int // stack allocated after this CL f := func() *int { return &x } p := f() This will allow a subsequent CL to move the generation of go/defer wrappers earlier. Second, this CL adds tracking to detect when pointer values flow to the pointee operand of an indirect assignment statement (i.e., flows to p in "*p = x") or to builtins that modify memory (append, copy, clear). This isn't utilized in the current CL, but a subsequent CL will make use of it to better optimize string->[]byte conversions. Updates #2205. Change-Id: I610f9c531e135129c947684833e288ce64406f35 Reviewed-on: https://go-review.googlesource.com/c/go/+/520259 Run-TryBot: Matthew Dempsky <mdempsky@google.com> TryBot-Result: Gopher Robot <gobot@golang.org> Auto-Submit: Matthew Dempsky <mdempsky@google.com> Reviewed-by: Cuong Manh Le <cuong.manhle.vn@gmail.com> Reviewed-by: Dmitri Shuralyov <dmitshur@google.com>

dmitshur · 2023-08-17T20:16:36Z

CL 520395 that resolved this issue was reverted in CL 520596, so reopening.

gopherbot · 2023-08-18T00:05:58Z

Change https://go.dev/cl/520599 mentions this issue: cmd/compile: restore zero-copy string->[]byte optimization

gopherbot · 2023-08-18T00:05:59Z

Change https://go.dev/cl/520600 mentions this issue: cmd/compile: enable -d=zerocopy by default

jfcg · 2023-08-18T03:33:38Z

Hi,
These changes address redundant copying for string <--> []byte conversions both ways, right?

mdempsky · 2023-08-18T05:01:11Z

@jfcg No, at this time only string->[]byte.

cespare · 2023-08-18T05:16:17Z

I believe that #43752 discusses at least some of the []byte->string cases.

(@mdempsky it's exciting that this 4-digit issue is so close to being solved!)

This CL implements the remainder of the zero-copy string->[]byte conversion optimization initially attempted in go.dev/cl/520395, but fixes the tracking of mutations due to ODEREF/ODOTPTR assignments, and adds more comprehensive tests that I should have included originally. However, this CL also keeps it behind the -d=zerocopy flag. The next CL will enable it by default (for easier rollback). Updates #2205. Change-Id: Ic330260099ead27fc00e2680a59c6ff23cb63c2b Reviewed-on: https://go-review.googlesource.com/c/go/+/520599 Auto-Submit: Matthew Dempsky <mdempsky@google.com> TryBot-Result: Gopher Robot <gobot@golang.org> Reviewed-by: Than McIntosh <thanm@google.com> Run-TryBot: Matthew Dempsky <mdempsky@google.com> Reviewed-by: Cuong Manh Le <cuong.manhle.vn@gmail.com>

zigo101 · 2023-11-28T07:36:42Z

BTW, should the old optimization for for range []byte(aString) be unified into the new optimization? Maybe it has been?

gopherbot · 2024-05-08T17:28:29Z

Change https://go.dev/cl/584118 mentions this issue: encoding/hex: don't overallocate memory in DecodeString

Now as []byte(string) doesn't always cause heap allocation (CL 520599, #2205) we can make DecodeString simpler and more performant, by not allocating x2 the required memory. goos: linux goarch: amd64 pkg: encoding/hex cpu: AMD Ryzen 5 4600G with Radeon Graphics │ beforehex │ afterhex │ │ sec/op │ sec/op vs base │ DecodeString/256-12 197.9n ± 1% 172.2n ± 1% -13.01% (p=0.000 n=10) DecodeString/1024-12 684.9n ± 1% 598.5n ± 1% -12.61% (p=0.000 n=10) DecodeString/4096-12 2.764µ ± 0% 2.343µ ± 1% -15.23% (p=0.000 n=10) DecodeString/16384-12 10.774µ ± 1% 9.348µ ± 1% -13.23% (p=0.000 n=10) geomean 1.417µ 1.226µ -13.53% │ beforehex │ afterhex │ │ B/s │ B/s vs base │ DecodeString/256-12 1.205Gi ± 1% 1.385Gi ± 1% +14.94% (p=0.000 n=10) DecodeString/1024-12 1.393Gi ± 1% 1.593Gi ± 1% +14.42% (p=0.000 n=10) DecodeString/4096-12 1.380Gi ± 0% 1.628Gi ± 1% +17.97% (p=0.000 n=10) DecodeString/16384-12 1.416Gi ± 1% 1.632Gi ± 1% +15.25% (p=0.000 n=10) geomean 1.346Gi 1.556Gi +15.64% │ beforehex │ afterhex │ │ B/op │ B/op vs base │ DecodeString/256-12 256.0 ± 0% 128.0 ± 0% -50.00% (p=0.000 n=10) DecodeString/1024-12 1024.0 ± 0% 512.0 ± 0% -50.00% (p=0.000 n=10) DecodeString/4096-12 4.000Ki ± 0% 2.000Ki ± 0% -50.00% (p=0.000 n=10) DecodeString/16384-12 16.000Ki ± 0% 8.000Ki ± 0% -50.00% (p=0.000 n=10) geomean 2.000Ki 1.000Ki -50.00% │ beforehex │ afterhex │ │ allocs/op │ allocs/op vs base │ DecodeString/256-12 1.000 ± 0% 1.000 ± 0% ~ (p=1.000 n=10) ¹ DecodeString/1024-12 1.000 ± 0% 1.000 ± 0% ~ (p=1.000 n=10) ¹ DecodeString/4096-12 1.000 ± 0% 1.000 ± 0% ~ (p=1.000 n=10) ¹ DecodeString/16384-12 1.000 ± 0% 1.000 ± 0% ~ (p=1.000 n=10) ¹ geomean 1.000 1.000 +0.00% Change-Id: I5676e48f222d90786ea18e808cb4ecde9de82597 GitHub-Last-Rev: aeedf3f GitHub-Pull-Request: #67259 Reviewed-on: https://go-review.googlesource.com/c/go/+/584118 Auto-Submit: Ian Lance Taylor <iant@google.com> Reviewed-by: Cherry Mui <cherryyz@google.com> LUCI-TryBot-Result: Go LUCI <golang-scoped@luci-project-accounts.iam.gserviceaccount.com> Reviewed-by: Ian Lance Taylor <iant@google.com>

zigo101 · 2024-06-23T19:37:35Z

So now we can take string element addresses without unsafe.StringData:

package main

func main() {
	s := "go"
	
	// All true.
	println(StringData(s) == StringData(s))
	println(*StringData(s) == 'g')
	println(*StringData(s[1:]) == 'o')
}

func StringData(s string) *byte {
	if len(s) == 0 {
		return nil
	}
	return &[]byte(s)[0]
}

;D

It looks like we don't need to rely on the common package. The `[]byte` to `string` conversions shouldn't be an issue for us. The Go compiler is smart enough to make some optimizations already and there are some work on this area: golang/go#2205

bradfitz added accepted Performance GarbageCollector labels Mar 3, 2014

bradfitz assigned rsc Mar 3, 2014

rsc added this to the Unplanned milestone Apr 10, 2015

rsc removed priority-someday labels Apr 10, 2015

randall77 changed the title ~~cmd/compile: read-only escape analysis and avoiding string <-> []byte copies~~ cmd/compile: read-only escape analysis and avoiding string -> []byte copies Mar 15, 2022

dchest mentioned this issue Apr 28, 2022

[WIP] Add generic versions of Hash and Hash128 dchest/siphash#23

Closed

dmitshur modified the milestones: Unplanned, Go1.22 Aug 17, 2023

gopherbot closed this as completed in c8adb30 Aug 17, 2023

dmitshur reopened this Aug 17, 2023

gopherbot closed this as completed in d63c88d Aug 18, 2023

mateusz834 mentioned this issue Sep 1, 2023

cmd/compile: optimize []byte(string1 + string2) #62407

Closed

aaronbee mentioned this issue Sep 20, 2023

Slice bounds panic in radius.NewUserPassword layeh/radius#115

Closed

thepudds mentioned this issue Oct 19, 2023

unsafe: add StringData, String, SliceData #53003

Closed

mateusz834 mentioned this issue May 8, 2024

encoding/hex: don't overallocate memory in DecodeString #67259

Closed

rmg mentioned this issue Aug 19, 2024

encoding/hex: DecodeString wastes memory #29802

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cmd/compile: read-only escape analysis and avoiding string -> []byte copies #2205

cmd/compile: read-only escape analysis and avoiding string -> []byte copies #2205

bradfitz commented Aug 29, 2011

gopherbot commented Aug 30, 2011

bradfitz commented Aug 30, 2011

rsc commented Aug 30, 2011

rsc commented Sep 15, 2011

lvdlvd commented Nov 7, 2011

rsc commented Dec 9, 2011

rsc commented Sep 12, 2012

bradfitz commented Oct 25, 2012

bradfitz commented Nov 17, 2012

robpike commented Mar 7, 2013

rsc commented Mar 12, 2013

bradfitz commented Mar 31, 2013

dvyukov commented Mar 31, 2013

bradfitz commented Mar 31, 2013

bradfitz commented Apr 26, 2013

rsc commented Jul 30, 2013

rsc commented Dec 4, 2013

rsc commented Mar 3, 2014

jfcg commented Mar 25, 2022

gopherbot commented Aug 16, 2023

gopherbot commented Aug 17, 2023

dmitshur commented Aug 17, 2023

gopherbot commented Aug 18, 2023

gopherbot commented Aug 18, 2023

jfcg commented Aug 18, 2023

mdempsky commented Aug 18, 2023

cespare commented Aug 18, 2023

zigo101 commented Nov 28, 2023

gopherbot commented May 8, 2024

zigo101 commented Jun 23, 2024 •

edited

Loading

cmd/compile: read-only escape analysis and avoiding string -> []byte copies #2205

cmd/compile: read-only escape analysis and avoiding string -> []byte copies #2205

Comments

bradfitz commented Aug 29, 2011

gopherbot commented Aug 30, 2011

bradfitz commented Aug 30, 2011

rsc commented Aug 30, 2011

rsc commented Sep 15, 2011

lvdlvd commented Nov 7, 2011

rsc commented Dec 9, 2011

rsc commented Sep 12, 2012

bradfitz commented Oct 25, 2012

bradfitz commented Nov 17, 2012

robpike commented Mar 7, 2013

rsc commented Mar 12, 2013

bradfitz commented Mar 31, 2013

dvyukov commented Mar 31, 2013

bradfitz commented Mar 31, 2013

bradfitz commented Apr 26, 2013

rsc commented Jul 30, 2013

rsc commented Dec 4, 2013

rsc commented Mar 3, 2014

jfcg commented Mar 25, 2022

gopherbot commented Aug 16, 2023

gopherbot commented Aug 17, 2023

dmitshur commented Aug 17, 2023

gopherbot commented Aug 18, 2023

gopherbot commented Aug 18, 2023

jfcg commented Aug 18, 2023

mdempsky commented Aug 18, 2023

cespare commented Aug 18, 2023

zigo101 commented Nov 28, 2023

gopherbot commented May 8, 2024

zigo101 commented Jun 23, 2024 • edited Loading

zigo101 commented Jun 23, 2024 •

edited

Loading