Implement FieldMappingIngestionMessageMapper for pull-based ingestion#20729
Conversation
PR Reviewer Guide 🔍(Review updated until commit 5e1c081)Here are some key observations to aid the review process:
|
PR Code Suggestions ✨Latest suggestions up to 5e1c081 Explore these optional code suggestions:
Previous suggestionsSuggestions up to commit c30fb6b
Suggestions up to commit 163581c
Suggestions up to commit 9d08270
Suggestions up to commit d65edd6
Suggestions up to commit 86540d2
|
|
Persistent review updated to latest commit 7f3c5b6 |
|
❌ Gradle check result for 7f3c5b6: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
|
Persistent review updated to latest commit 835b1d4 |
|
❌ Gradle check result for 835b1d4: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
9d08270 to
e7b82bd
Compare
|
Persistent review updated to latest commit 163581c |
|
❌ Gradle check result for 163581c: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com>
|
Persistent review updated to latest commit c30fb6b |
|
❌ Gradle check result for c30fb6b: FAILURE Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change? |
Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com>
|
Persistent review updated to latest commit 5e1c081 |
varunbharadwaj
left a comment
There was a problem hiding this comment.
LGTM. Let's add the integration tests for the field message mapper in the next PR as discussed.
…opensearch-project#20729) * Implement FieldMappingIngestionMessageMapper for pull-based ingestion
…opensearch-project#20729) * Implement FieldMappingIngestionMessageMapper for pull-based ingestion Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com>
|
As a suggestion, @imRishN and @varunbharadwaj -- don't merge from main into a PR branch. Instead, rebase the PR branch on top of main. The squashed commit has a very noisy commit message as it picked up a bunch of other commits that were merged into the PR branch: 4ecd720 |
|
@msfroh yeah, I realized this after merging it. Discussed offline with Peter to see if it is possible to amend the commit to fix it - but looks like it is not possible. Will double check the commits next time. |
…opensearch-project#20729) * Implement FieldMappingIngestionMessageMapper for pull-based ingestion Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Add changelog Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Address bot comment Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Address comments Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Address comments Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Remove affiliation column for emeritus maintainers (opensearch-project#20725) Emeritus maintainers are not active in the project, therefore I don't see a lot of value in tracking their affiliation. Signed-off-by: Andrew Ross <andrross@amazon.com> Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Add bitmap64 query support (opensearch-project#20606) --------- Signed-off-by: Divya <DIVYA2@ibm.com> Signed-off-by: Divya <divyaruhil999@gmail.com> Co-authored-by: Divya <DIVYA2@ibm.com> Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * fix stream transport TLS cert hot-reload by using live SSLContext from SecureTransportSettingsProvider (opensearch-project#20734) Signed-off-by: Rishabh Maurya <rishabhmaurya05@gmail.com> Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Bump OpenTelemetry to 1.59.0 and OpenTelemetry Semconv to 1.40.0 (opensearch-project#20737) Signed-off-by: Andriy Redko <drreta@gmail.com> Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * [Pull-based Ingestion] Remove experimental tag for pull-based ingestion (opensearch-project#20704) * remove experimental tag for pull-based ingestion Signed-off-by: Varun Bharadwaj <varunbharadwaj1995@gmail.com> * update BroadcastRequest to be marked as public API Signed-off-by: Varun Bharadwaj <varunbharadwaj1995@gmail.com> --------- Signed-off-by: Varun Bharadwaj <varunbharadwaj1995@gmail.com> Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Bump Apache Lucene from 10.3.2 to 10.4.0 (opensearch-project#20735) Signed-off-by: Ankit Jain <jainankitk@apache.org> Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Minor Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Fix spotless check Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Address bot comment Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Make id mandatory when id field provided Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Fix spotless check Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Introducing indexing & deletion strategy planner interfaces (opensearch-project#20585) Signed-off-by: Shashank Gowri <shnkgo@amazon.com> Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Add changelog Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Refactor Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Fix spotless check Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Empty commit Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Remove duplicate changelog Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Empty commit Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> --------- Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> Signed-off-by: Andrew Ross <andrross@amazon.com> Signed-off-by: Divya <DIVYA2@ibm.com> Signed-off-by: Divya <divyaruhil999@gmail.com> Signed-off-by: Rishabh Maurya <rishabhmaurya05@gmail.com> Signed-off-by: Andriy Redko <drreta@gmail.com> Signed-off-by: Varun Bharadwaj <varunbharadwaj1995@gmail.com> Signed-off-by: Ankit Jain <jainankitk@apache.org> Signed-off-by: Shashank Gowri <shnkgo@amazon.com> Co-authored-by: Andrew Ross <andrross@amazon.com> Co-authored-by: Divya <117009486+divyaruhil@users.noreply.github.com> Co-authored-by: Divya <DIVYA2@ibm.com> Co-authored-by: Rishabh Maurya <rishabhmaurya05@gmail.com> Co-authored-by: Andriy Redko <drreta@gmail.com> Co-authored-by: Varun Bharadwaj <varunbharadwaj1995@gmail.com> Co-authored-by: Ankit Jain <jainankitk@apache.org> Co-authored-by: Shashank Gowri <shashankgowri@gmail.com> Signed-off-by: Aparajita Pandey <aparajita31pandey@gmail.com>
…opensearch-project#20729) * Implement FieldMappingIngestionMessageMapper for pull-based ingestion Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Add changelog Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Address bot comment Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Address comments Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Address comments Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Remove affiliation column for emeritus maintainers (opensearch-project#20725) Emeritus maintainers are not active in the project, therefore I don't see a lot of value in tracking their affiliation. Signed-off-by: Andrew Ross <andrross@amazon.com> Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Add bitmap64 query support (opensearch-project#20606) --------- Signed-off-by: Divya <DIVYA2@ibm.com> Signed-off-by: Divya <divyaruhil999@gmail.com> Co-authored-by: Divya <DIVYA2@ibm.com> Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * fix stream transport TLS cert hot-reload by using live SSLContext from SecureTransportSettingsProvider (opensearch-project#20734) Signed-off-by: Rishabh Maurya <rishabhmaurya05@gmail.com> Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Bump OpenTelemetry to 1.59.0 and OpenTelemetry Semconv to 1.40.0 (opensearch-project#20737) Signed-off-by: Andriy Redko <drreta@gmail.com> Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * [Pull-based Ingestion] Remove experimental tag for pull-based ingestion (opensearch-project#20704) * remove experimental tag for pull-based ingestion Signed-off-by: Varun Bharadwaj <varunbharadwaj1995@gmail.com> * update BroadcastRequest to be marked as public API Signed-off-by: Varun Bharadwaj <varunbharadwaj1995@gmail.com> --------- Signed-off-by: Varun Bharadwaj <varunbharadwaj1995@gmail.com> Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Bump Apache Lucene from 10.3.2 to 10.4.0 (opensearch-project#20735) Signed-off-by: Ankit Jain <jainankitk@apache.org> Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Minor Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Fix spotless check Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Address bot comment Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Make id mandatory when id field provided Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Fix spotless check Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Introducing indexing & deletion strategy planner interfaces (opensearch-project#20585) Signed-off-by: Shashank Gowri <shnkgo@amazon.com> Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Add changelog Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Refactor Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Fix spotless check Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Empty commit Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Remove duplicate changelog Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> * Empty commit Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> --------- Signed-off-by: Rishab Nahata <rishabnahata07@gmail.com> Signed-off-by: Andrew Ross <andrross@amazon.com> Signed-off-by: Divya <DIVYA2@ibm.com> Signed-off-by: Divya <divyaruhil999@gmail.com> Signed-off-by: Rishabh Maurya <rishabhmaurya05@gmail.com> Signed-off-by: Andriy Redko <drreta@gmail.com> Signed-off-by: Varun Bharadwaj <varunbharadwaj1995@gmail.com> Signed-off-by: Ankit Jain <jainankitk@apache.org> Signed-off-by: Shashank Gowri <shnkgo@amazon.com> Co-authored-by: Andrew Ross <andrross@amazon.com> Co-authored-by: Divya <117009486+divyaruhil@users.noreply.github.com> Co-authored-by: Divya <DIVYA2@ibm.com> Co-authored-by: Rishabh Maurya <rishabhmaurya05@gmail.com> Co-authored-by: Andriy Redko <drreta@gmail.com> Co-authored-by: Varun Bharadwaj <varunbharadwaj1995@gmail.com> Co-authored-by: Ankit Jain <jainankitk@apache.org> Co-authored-by: Shashank Gowri <shashankgowri@gmail.com>
Description
Implements the field_mapping mapper type that extracts
_id,_version, and_op_typefrom configurable top-level fields in raw stream messages. Extracted fields are removed from_source. Supports boolean op_type detection by default, with an optionaldelete_valuesetting for custom value matching (e.g., "Y"/"N", "expired", integer flags).Also updates the mapper factory to accept mapper settings and wires mapper settings from
IngestionSourcethroughDefaultStreamPoller.Related Issues
Resolves #20728
Check List
By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.