Add scoped thread-local encoding and decoding contexts to cstore. #30140

michaelwoerister · 2015-12-01T16:38:30Z

With this commit, metadata encoding and decoding can make use of thread-local encoding and decoding contexts. These allow implementers of serialize::Encodable and Decodable to access information and
datastructures that would otherwise not be available to them. For example, we can automatically translate def-id and span information during decoding because the decoding context knows which crate the data is decoded from. Or it allows to make ty::Ty decodable because the context has access to the ty::ctxt that is needed for creating ty::Ty instances.

Some notes:

tls::with_encoding_context() and tls::with_decoding_context() (as opposed to their unsafe versions) try to prevent the TLS data getting out-of-sync by making sure that the encoder/decoder passed in is actually the same as the one stored in the context. This should prevent accidentally reading from the wrong decoder.
There are no real tests in this PR. I had a unit tests for some of the core aspects of the TLS implementation but it was kind of brittle, a lot of code for mocking ty::ctxt, crate_metadata, etc and did actually test not so much. The code will soon be tested by the first incremental compilation auto-tests that rely on MIR being properly serialized. However, if people think that some tests should be added before this can land, I'll try to provide some that make sense.

r? @nikomatsakis

arielb1 · 2015-12-01T21:52:21Z

AST autoserialization causes a ton of metadata bloat and a decent amount of compile-time bloat. I am not sure I would like to stay with MIR autoserialization if that would be a problem. I am not aware of the current astencode manual serialization causing problems (I looked at that topic and it looks like the most annoying part in serializing MIR is dealing with ConstVal and its annoying NodeId contents, but that should probably be refactored anyway).

oli-obk · 2015-12-02T15:59:06Z

@arielb1: I'm improving ConstVal to get rid of the NodeId stuff. See some explanation of the problems with that here: https://internals.rust-lang.org/t/removing-const-eval-duplication-of-labor-between-librustc-and-librustc-trans/1786

michaelwoerister · 2015-12-07T08:41:02Z

Sorry for not responding earlier, was ill last week.

Some thoughts:

It's not auto-serialization in principle that causes the bloat but its unfortunate interaction with the verbose RBML format. We don't have to stick with RBML in the future.
The Ty and Substs values in the MIR use the tyencode/tydecode infrastructure so they won't contribute to data bloat even now. The rest of the MIR data structures will though.
While discussing this with @nikomatsakis before starting the implementation, we came to the conclusion that it would be great for maintainability and refactorability if we could get rid of as much serialization boilerplate code as possible. I think that making the autoserialization support in the compiler more powerful so it can support space-efficient encodings and deal with context-dependent (de-)serialization is the way to go (although I'm not convinced that the TLS-based implementation of the latter is the best solution in the long term).

arielb1 · 2015-12-07T17:38:32Z

I don't recall having maintainability problems with the old astencode manual-decoding code - it mostly just works. Our MIR format is not really supposed to rapidly change.

astencode for Substs is fairly bloated. Luckily, it is not used that often. I would however prefer to encode (AdtDef,Substs) as a Ty.

nikomatsakis · 2015-12-08T19:22:13Z

I'm pretty happy with this PR. Writing serialization code is painful --- not so much a source of bugs (though I've had a few from tyencode), but really annoying, and to little purpose. I don't think autogenerated serialization needs to be inefficienct. Any inefficiences we see can easily be overcome by implementing serialization by hand for a particular type -- you can then use the autogenerated code as a building block, by writing things like

// serialize:
serialize((&self.foo, &self.bar)); // serialize first 2 fields, but not self.span

// deserialize:
let (foo, bar) = deserialize();
MyType { foo: foo, bar: bar, span: DUMMY_SP }

nikomatsakis · 2015-12-08T19:22:47Z

@bors r+

bors · 2015-12-08T19:22:48Z

📌 Commit e254867 has been approved by nikomatsakis

bors · 2015-12-09T00:31:39Z

⌛ Testing commit e254867 with merge 56670b1...

bors · 2015-12-09T00:41:25Z

💔 Test failed - auto-win-msvc-64-opt

With this commit, metadata encoding and decoding can make use of thread-local encoding and decoding contexts. These allow implementers of serialize::Encodable and Decodable to access information and datastructures that would otherwise not be available to them. For example, we can automatically translate def-id and span information during decoding because the decoding context knows which crate the data is decoded from. Or it allows to make ty::Ty decodable because the context has access to the ty::ctxt that is needed for creating ty::Ty instances.

michaelwoerister · 2015-12-09T15:10:34Z

@bors r=nikomatsakis

bors · 2015-12-09T15:10:36Z

📌 Commit f65823e has been approved by nikomatsakis

@nikomatsakis

With this commit, metadata encoding and decoding can make use of thread-local encoding and decoding contexts. These allow implementers of `serialize::Encodable` and `Decodable` to access information and datastructures that would otherwise not be available to them. For example, we can automatically translate def-id and span information during decoding because the decoding context knows which crate the data is decoded from. Or it allows to make `ty::Ty` decodable because the context has access to the `ty::ctxt` that is needed for creating `ty::Ty` instances. Some notes: - `tls::with_encoding_context()` and `tls::with_decoding_context()` (as opposed to their unsafe versions) try to prevent the TLS data getting out-of-sync by making sure that the encoder/decoder passed in is actually the same as the one stored in the context. This should prevent accidentally reading from the wrong decoder. - There are no real tests in this PR. I had a unit tests for some of the core aspects of the TLS implementation but it was kind of brittle, a lot of code for mocking `ty::ctxt`, `crate_metadata`, etc and did actually test not so much. The code will soon be tested by the first incremental compilation auto-tests that rely on MIR being properly serialized. However, if people think that some tests should be added before this can land, I'll try to provide some that make sense. r? @nikomatsakis

bors · 2015-12-09T15:10:38Z

⌛ Testing commit f65823e with merge eebf674...

bors · 2015-12-09T16:57:45Z

☀️ Test successful - auto-linux-32-nopt-t, auto-linux-32-opt, auto-linux-64-debug-opt, auto-linux-64-nopt-t, auto-linux-64-opt, auto-linux-64-x-android-t, auto-linux-cross-opt, auto-linux-musl-64-opt, auto-mac-32-opt, auto-mac-64-nopt-t, auto-mac-64-opt, auto-win-gnu-32-nopt-t, auto-win-gnu-32-opt, auto-win-gnu-64-nopt-t, auto-win-gnu-64-opt, auto-win-msvc-32-opt, auto-win-msvc-64-opt

rust-highfive assigned nikomatsakis Dec 1, 2015

michaelwoerister force-pushed the tls-encoding branch 2 times, most recently from 0b1e599 to e254867 Compare December 8, 2015 18:46

michaelwoerister force-pushed the tls-encoding branch from e254867 to f65823e Compare December 9, 2015 14:50

bors merged commit f65823e into rust-lang:master Dec 9, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add scoped thread-local encoding and decoding contexts to cstore. #30140

Add scoped thread-local encoding and decoding contexts to cstore. #30140

Uh oh!

michaelwoerister commented Dec 1, 2015

Uh oh!

arielb1 commented Dec 1, 2015

Uh oh!

oli-obk commented Dec 2, 2015

Uh oh!

michaelwoerister commented Dec 7, 2015

Uh oh!

arielb1 commented Dec 7, 2015

Uh oh!

nikomatsakis commented Dec 8, 2015

Uh oh!

nikomatsakis commented Dec 8, 2015

Uh oh!

bors commented Dec 8, 2015

Uh oh!

bors commented Dec 9, 2015

Uh oh!

bors commented Dec 9, 2015

Uh oh!

michaelwoerister commented Dec 9, 2015

Uh oh!

bors commented Dec 9, 2015

Uh oh!

bors commented Dec 9, 2015

Uh oh!

bors commented Dec 9, 2015

Uh oh!

Uh oh!

Add scoped thread-local encoding and decoding contexts to cstore. #30140

Add scoped thread-local encoding and decoding contexts to cstore. #30140

Uh oh!

Conversation

michaelwoerister commented Dec 1, 2015

Uh oh!

arielb1 commented Dec 1, 2015

Uh oh!

oli-obk commented Dec 2, 2015

Uh oh!

michaelwoerister commented Dec 7, 2015

Uh oh!

arielb1 commented Dec 7, 2015

Uh oh!

nikomatsakis commented Dec 8, 2015

Uh oh!

nikomatsakis commented Dec 8, 2015

Uh oh!

bors commented Dec 8, 2015

Uh oh!

bors commented Dec 9, 2015

Uh oh!

bors commented Dec 9, 2015

Uh oh!

michaelwoerister commented Dec 9, 2015

Uh oh!

bors commented Dec 9, 2015

Uh oh!

bors commented Dec 9, 2015

Uh oh!

bors commented Dec 9, 2015

Uh oh!

Uh oh!