-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Use values provided by users for WindowDuration and NumShards #41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Thanks for your pull request. It looks like this may be your first contribution to a Google open source project (if not, look below for help). Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA). 📝 Please visit https://cla.developers.google.com/ to sign. Once you've signed (or fixed any issues), please reply here (e.g. What to do if you already signed the CLAIndividual signers
Corporate signers
ℹ️ Googlers: Go here for more info. |
|
I signed it! |
|
CLAs look good, thanks! ℹ️ Googlers: Go here for more info. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hi Barata - Thank you for the PR! A lot of the methods used in the Beam IO transforms do no support ValueProvider as an input and the current release of Dataflow templates requires any runtime parameters to be wrapped inside of ValueProviders. I think the better approach to doing this would be to use a new feature called Dynamic Templates that we are looking to roll out soon.
For e.g.:
AvroIO.Write.withNumShards
| @Description("The maximum number of output shards produced when writing.") | ||
| @Default.Integer(1) | ||
| Integer getNumShards(); | ||
| ValueProvider<Integer> getNumShards(); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This will not work unfortunately as AvroIO.Write.withNumShards(int) does not support a ValueProvider as an input. The same goes for some of the other parameters below.
|
I see it now. I was trying to fix an existing bug when you create DF jobs using GCP Console "Export from Pubsub to GCS" feature. I have an on-going internal ticket with Google that I will follow up on. Thanks for your comments anyway. |
|
Is there any status update on this issue? I'm encountering a similar issue with |
* Added Connection Helper * Added Formatter * Added File Reader * Refectored * Added Changes for Schema and Spanner Schema * Added Schema Changes to read Spanner Table in Schema object * Added Schema Changes * Added Changes for Schma * Added Source Writn Fn Changes * Added Source Factory Changes * Added Fixed for the Source factory and Casssandra Connection helper * Added Cassandra Schema Reader * Added Pipeline Process * Removed Unwanted Validation * Added Access validator * removed unwanted Return * Added Thread safe optimization in cassandra Connection helper * Applied spotless:apply * spotless:apply * Added Constructor for Test case * Added DUMMY Generator For UT * Fixed UT for Metadata config (#30) Co-authored-by: Narendra Rajput <narendra.rajput@ollion.com> Cassandra metadata PR To accomodate Driver Loader Class (#33) * Removed * * Create README.md for UDF samples (#2083) This commit adds a README.md file to the directory. The README file provides descriptions for each of the sample Javascript UDF files in the directory, including their purpose and usage examples. Co-authored-by: labs-code-app[bot] <161369871+labs-code-app[bot]@users.noreply.github.com> * CassandraDriverConfigLoader from GCS (#2077) * Added Config File Path * Added Fix for Loading Driver Options * Added Dependecy Fixes * Fix UT --------- Co-authored-by: labs-code-app[bot] <161369871+labs-code-app[bot]@users.noreply.github.com> Co-authored-by: Vardhan Vinay Thigle <39047439+VardhanThigle@users.noreply.github.com> * PR Review Comments (#35) * Convert it to builder Pattern * Convert Waring to Error * remove the unwanted comments * Removed Unwanted options * Address the PR * Address the PR Comments * Added Missing Getter for Configuration * Address to removed configuration changes * removed unwanted getters * Removal of getter from cassandra dao and test case fixes * Handle Optionmap in cassandra DAO for serialization * removed dependecy of getKeyOrder * Removed And Update Exception * Fix Checkstyle Voilation * Missing UT Added (#37) * Added Dummy Test case * Removed * from import * removed unwanted * Added Fixes * Added test case and fixes * Added Some more PR comments * Added Test case for Cassandra Reader * Added New testcase * Added Dependecny * Update test case and remove Dependency of Jupiter (#39) * Update test case and remove Dependecy of jupiter * Added UT fixes * Added Missing Getter for Configuration * [Sourcedb-to-spanner] Bulk migration Mysql to spanner 1tb Load test (#2063) * [Sourcedb-to-spanner] Bulk migration Mysql to spanner 1tb Load test * Updating row counts, added static sql resources * Renaming the test and addressing comments * PipelineController Changes for Cassandra (#2086) * Fix Spanner Load tests and add display test report (#2092) * correcting lt failures * Adding test report * Meta code coverage (#41) Meta code coverage * Pr bug fixes (#42) * Schema Reader Optimization (#43) * Added Pr Fix related to Changes Year in Javadoc * Address the revert of 2025 and removal of extra constructor * Added Extra Testcase to reach target patch --------- Co-authored-by: taherkl <taher.lakdawala@ollion.com> Co-authored-by: Taher Lakdawala <78196491+taherkl@users.noreply.github.com> Co-authored-by: pawankashyapollion <v-pawan.kumar@ollion.com> Co-authored-by: labs-code-app[bot] <161369871+labs-code-app[bot]@users.noreply.github.com> Co-authored-by: Vardhan Vinay Thigle <39047439+VardhanThigle@users.noreply.github.com> Co-authored-by: darshan-sj <darshan-sj@users.noreply.github.com>
Previous version ignores users parameters and always takes the defaults (1 shard and 5m window duration).