Find and fix possible performance bottlenecks

Yesterday I did some profiling using the setup described [here](https://llogiq.github.io/2015/07/15/profiling.html).
The resulting callgrind file is attached. This can be opened with qcachegrind on Mac or kcachegrind on Linux.

[callgrind.out.35583.zip](https://github.com/mre/hyperjson/files/2161224/callgrind.out.35583.zip)

If you don't have any of those programs handy, I've added a screenshot for the two main bottlenecks that I can see. I'm not an expert, but it looks like we spend a lot of time allocating, converting, and dropping the BTreeMap, which will be converted to a dictionary and returned to Python in the end.

I guess we could save a lot of time by making this part more efficient. E.g. by copying less and instead working on references. Might be mistaken, though. Help and pull requests are very welcome.
😊 

<img width="1151" alt="hyperjson-bench" src="https://user-images.githubusercontent.com/175809/42246989-13c82328-7f1f-11e8-9cd1-b4a3c5564735.png">



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Find and fix possible performance bottlenecks #16

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Uh oh!

Find and fix possible performance bottlenecks #16

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions