-
Notifications
You must be signed in to change notification settings - Fork 35
Bug 1654558 Implement RerouteDocuments for xfocsp-error-report #1323
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Codecov Report
@@ Coverage Diff @@
## master #1323 +/- ##
============================================
- Coverage 84.71% 83.00% -1.71%
+ Complexity 666 610 -56
============================================
Files 94 73 -21
Lines 4042 3277 -765
Branches 385 352 -33
============================================
- Hits 3424 2720 -704
+ Misses 488 439 -49
+ Partials 130 118 -12
Flags with carried forward coverage won't be shown. Click here to find out more.
Continue to review full report at Codecov.
|
|
My expectation (for the future mechanism, not for this PR) was that the json schema from the original namespace would be used for decode, and the decoder would apply rerouting based on metadata only on output; separately MSG would only generate the bq schema for the rerouted dataset. At least to me this seems like the least sanity-questioning approach. It occurs to me now that that might break some metadata extraction in the bq provisioning pipeline (that assumes certain conventions about pathing between bq and json schemas), but in any case this PR appears to do the right thing to me, if not in the way I was expecting. |
I agree it would be better to reroute only at the BQ sink step. That, however, would require more complex logic changes since BQ loading can happen either via ingestion-sink for the streaming pipeline or via ingestion-sink in the batch reprocessing case. I also had imagined that once we have schema metadata in place, we would set |
Relies on mozilla-services/mozilla-pipeline-schemas#581