Investigate feasibility of UTF8StreamJsonParser without canonicalization

This is a more specific follow-up from one of the components of https://github.com/FasterXML/jackson-benchmarks/pull/6

I'd like to determine whether this comment still holds true, initial benchmarking showed a fairly substantial improvement for small inputs, and the `UTF8StreamJsonParser` has optimizations outside of keys that I'd expect to tip the scales in its favor.
https://github.com/FasterXML/jackson-core/blob/36cd882098c448c944f44edb10659d5f25d59ece/src/main/java/com/fasterxml/jackson/core/json/ByteSourceJsonBootstrapper.java#L259-L270

	if (enc == JsonEncoding.UTF8) {
	/* and without canonicalization, byte-based approach is not performant; just use std UTF-8 reader
	* (which is ok for larger input; not so hot for smaller; but this is not a common case)
	*/
	if (JsonFactory.Feature.CANONICALIZE_FIELD_NAMES.enabledIn(factoryFeatures)) {
	ByteQuadsCanonicalizer can = rootByteSymbols.makeChild(factoryFeatures);
	return new UTF8StreamJsonParser(_context, parserFeatures, _in, codec, can,
	_inputBuffer, _inputPtr, _inputEnd, bytesProcessed, _bufferRecyclable);
	}
	}
	return new ReaderBasedJsonParser(_context, parserFeatures, constructReader(), codec,
	rootCharSymbols.makeChild(factoryFeatures));

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Investigate feasibility of UTF8StreamJsonParser without canonicalization #994

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Investigate feasibility of UTF8StreamJsonParser without canonicalization #994

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions