Scatter-gather support on Query APIs #36

jeqo · 2019-08-28T20:12:28Z

Rational

The storage layer is based on Kafka Streams local store, that is aligned with partitioning. Currently we have specified that our implementation supports running only a standalone instance for storage, because if we scale the Zipkin instances, storage will get partitioned between servers.

In order to cope with this scenario I'd like to propose a scatter-gather support that allows storage layer to query other instances to build a response.

Example Scenario

Given a partitioned back-end with 3 zipkin servers (a,b,c) running as a cluster, if we receive a query from client-side, zipkin-a receive the request, and forward the same query to zipkin-b and zipkin-c with an additional query param (e.g. peer=true) so b and c don't propagate the query. zipkin-a receives responses and build response.

Feature Request

This feature will require:

Register current instance URL via metadata API [1]
Have a client to call other instances.
Have a way to distinguish between peer calls and client calls to avoid repeating calls.

Kafka Streams already supports a metadata API to register peers URLs [1]

[1] https://kafka.apache.org/documentation/streams/developer-guide/interactive-queries.html#adding-an-rpc-layer-to-your-application

codefromthecrypt · 2019-08-29T01:34:37Z

I think a private api will also allow you to tell difference between peer and client calls it can still use the same syntax. otoh if you need to tunnel through same endpoint all the way to storage layer you could add a fake annotation query item to signal what you need. ZK is part of Kafka so yeah should be possible to find a means of registration. you may end up with the usual clustering concerns like what if the partition node goes down, who takes over etc. maintaining health and partition metadata about the healthy ones etc. one question is if there is an existing partition aware layer over Kafka .. probably someone made a project like this and we could list lessons learned if not anything else.

jeqo · 2019-08-29T12:13:09Z

@adriancole thanks for the feedback!

ZK is part of Kafka so yeah should be possible to find a means of
registration. you may end up with the usual clustering concerns like what
if the partition node goes down, who takes over etc. maintaining health and
partition metadata about the healthy ones etc.

StreamsMetadata already supports this, no need to use ZK imo. Also ZK will potentially be out of Kafka if this goes through https://cwiki.apache.org/confluence/display/KAFKA/KIP-500%3A+Replace+ZooKeeper+with+a+Self-Managed+Metadata+Quorum (not any time zoom I suspect)

one question is if there is an existing partition aware layer over Kafka ..
probably someone made a project like this and we could list lessons learned
if not anything else.

IIRC Lightbend did a library on top of KStream Metadata API https://www.lightbend.com/blog/kafka-http-interactive-layer which does scatter-gather as well. But we should be safe with Metadata API: https://kafka.apache.org/documentation/streams/developer-guide/interactive-queries.html#discovering-and-accessing-application-instances-and-their-local-state-stores

Do we have any zipkin-api client library to start playing on this, or would a plain HTTP client be enough?

codefromthecrypt · 2019-08-29T12:25:31Z

Do we have any zipkin-api client library to start playing on this, or would a plain HTTP client be enough?

v1 we used to have this. I can't find the issue but I thought we had one about a proxied storage openzipkin/zipkin@8f99f3c#diff-171d8f5bcef53d0f6f81ac312c454a75

jeqo · 2019-08-29T12:50:00Z

thanks! will give a try with OkHttp

jeqo added the enhancement New feature or request label Aug 28, 2019

jeqo mentioned this issue Aug 28, 2019

Feature request: Scatter-gather support on Query APIs openzipkin/zipkin#2784

Closed

This was referenced Sep 3, 2019

Fresh results #34

Merged

Scatter-gather support for Query API #38

Merged

jeqo closed this as completed in #38 Sep 17, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scatter-gather support on Query APIs #36

Scatter-gather support on Query APIs #36

jeqo commented Aug 28, 2019

codefromthecrypt commented Aug 29, 2019 via email

jeqo commented Aug 29, 2019

codefromthecrypt commented Aug 29, 2019 via email

jeqo commented Aug 29, 2019

Scatter-gather support on Query APIs #36

Scatter-gather support on Query APIs #36

Comments

jeqo commented Aug 28, 2019

codefromthecrypt commented Aug 29, 2019 via email

jeqo commented Aug 29, 2019

codefromthecrypt commented Aug 29, 2019 via email

jeqo commented Aug 29, 2019