[Feature] About an idea to add a UDF management module for StreamPark #1782

green241 · 2022-10-09T06:30:05Z

Search before asking

I had searched in the feature and found no similar feature requirement.

Description

I have been in contact with StreamPark for a while now and have had a pretty good experience in terms of ease of use and stability. Currently, StreamPark itself supports UDF functions, but there doesn't seem to be a unified UDF management menu, so it is recommended to add a new UDF management menu for UDF management. The main purpose is to.

implement a unified management of new UDFs created by users (providing the main API for manipulating UDF objects).
Currently, SP uses UDF to upload UDF jar when creating jobs, but in the process of actual use, users may encounter some problems, such as: it is not clear what UDFs were created before, what the identififier is, etc. Therefore, the UDF management module can be used to solve these problems.

Usage Scenario

Note:

This feature is currently only implemented based on SQL jobs in Yarn Application mode;
the JAR is saved on top of HDFS.

Related issues

No response

Are you willing to submit a PR?

Yes I am willing to submit a PR!

Code of Conduct

I agree to follow this project's Code of Conduct

green241 · 2022-10-10T06:02:45Z

The current plan is mainly based on the yarn application model, so the following is mainly idea of implement.

When creating a job, select the required UDF (e.g., a drop-down box showing the UDF available to the current user, associated with udfId);
When starting a job, it will query the paths of these udf stores according to the selected udfId (there can be more than one), and at the same time stitch these storage paths into strings, and finally pass them into yarn.provided.lib.dirs when submitting the job to achieve dynamic loading.

datayangl · 2022-10-25T08:49:01Z

Actually, your plan is pretty much like zepplin's way of managing udf. I would like to contribute. Firstly, overall design for udf management.Secondly, stages to implement.

green241 · 2022-10-27T05:06:26Z

hi datayangl,

So nice,warmly welcome.
After the next version , we may make a discussion in weekly-meeting including the proposal、solutions etc.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] About an idea to add a UDF management module for StreamPark #1782

[Feature] About an idea to add a UDF management module for StreamPark #1782

green241 commented Oct 9, 2022 •

edited

Loading

green241 commented Oct 10, 2022 •

edited

Loading

datayangl commented Oct 25, 2022

green241 commented Oct 27, 2022

[Feature] About an idea to add a UDF management module for StreamPark #1782

[Feature] About an idea to add a UDF management module for StreamPark #1782

Comments

green241 commented Oct 9, 2022 • edited Loading

Search before asking

Description

Usage Scenario

Related issues

Are you willing to submit a PR?

Code of Conduct

green241 commented Oct 10, 2022 • edited Loading

datayangl commented Oct 25, 2022

green241 commented Oct 27, 2022

green241 commented Oct 9, 2022 •

edited

Loading

green241 commented Oct 10, 2022 •

edited

Loading