Destination Snowflake: Improve error handling in StagingClient #39135

gisripa · 2024-06-05T17:48:25Z

What

Use the returned results to determine success/failure for PUT and COPY INTO commands.

Review guide

User Impact

Can this PR be safely reverted and rolled back?

YES 💚
NO ❌

vercel · 2024-06-05T17:48:30Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
airbyte-docs	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Jun 5, 2024 8:39pm

gisripa · 2024-06-05T17:48:43Z

Destination Snowflake: Sync Id, generation_id and Meta #39107
Destination Snowflake: Improve error handling in StagingClient #39135 👈
master

This stack of pull requests is managed by Graphite. Learn more about stacking.

Join @gisripa and the rest of your teammates on Graphite

edgao

is SnowflakeStagingClientTest still useful, now that there's a proper integration test?

(two minor comments, neither blocking)

edgao · 2024-06-05T21:27:47Z

...ain/kotlin/io/airbyte/integrations/destination/snowflake/operation/SnowflakeStagingClient.kt

@@ -84,7 +111,8 @@ class SnowflakeStagingClient(private val database: JdbcDatabase) {
            filePath,
            stageName,
            stagingPath,
-            Runtime.getRuntime().availableProcessors()
+            // max allowed param is 99, we don't need so many threads for a single file upload
+            minOf(Runtime.getRuntime().availableProcessors(), 4)


why not minOf(..., 99)? wouldn't this restrict us to 4 threads in most cases?

in our cloud case it is always 1, our pod config is 1 cpu i believe based on my previous dev release run.

@gisripa
The pod config could also run on 2 cpus by values.yaml definition ain't im wrong? (worker resources)
I couldn't understand why not use minOf(..., 99)?

@talnidam surely can use minOf(..,99) however there is no added benefit in the way we upload 1 file only. We use a 200M file and only 1 file per PUT and snowflake chunks it if > 64M which I'm guessing won't be more than 3 or 4 chunks. The advantage of this shows up if we do PUT with a directory/* with many files in the directory. Also the default is 4 in snowflake settings if not provided. so just used that as max.

edgao · 2024-06-05T21:32:02Z

...irbyte/integrations/destination/snowflake/operation/SnowflakeStagingClientIntegrationTest.kt

+    private val datasource =
+        SnowflakeDatabaseUtils.createDataSource(config, OssCloudEnvVarConsts.AIRBYTE_OSS)
+    private val database: JdbcDatabase = SnowflakeDatabaseUtils.getDatabase(datasource)
+    // Intentionally not using actual columns, since the staging client should be agnostic of these


random thought: do we eventually want to explicitly specify column names in our COPY command? 😅

I guess so.. it requires COPY INTO .. select filename#colindex or something i think right ?

something like that 🤷 not sure about the exact syntax, presumably there's some way to specify "please load into exactly these columns in the target table"

which I guess is actually slightly different from what you have (i.e. "read these columns out of the file" vs "write to these columns in the table")

gisripa · 2024-06-05T22:08:30Z

is SnowflakeStagingClientTest still useful, now that there's a proper integration test?

Yeah not useful for happy path, only useful part was if the correct exception caught and thrown as ConfigError.

octavia-squidington-iii added the area/connectors Connector related issues label Jun 5, 2024

gisripa mentioned this pull request Jun 5, 2024

Destination Snowflake: Sync Id, generation_id and Meta #39107

Merged

2 tasks

octavia-squidington-iii added the connectors/destination/snowflake label Jun 5, 2024

gisripa changed the title ~~snowflake-stagingclient-enh~~ Destination Snowflake: Improve error handling in StagingClient Jun 5, 2024

gisripa marked this pull request as ready for review June 5, 2024 17:49

gisripa requested a review from a team as a code owner June 5, 2024 17:49

gisripa force-pushed the gireesh/snowflake-stagingclient-enh branch from 393d5f1 to f94814f Compare June 5, 2024 19:03

octavia-squidington-iii added the area/documentation Improvements or additions to documentation label Jun 5, 2024

vercel bot deployed to Preview June 5, 2024 19:08 View deployment

gisripa force-pushed the gireesh/snowflake-stagingclient-enh branch from f94814f to c754c40 Compare June 5, 2024 19:55

vercel bot deployed to Preview June 5, 2024 19:59 View deployment

snowflake-stagingclient-enh

459c37c

gisripa force-pushed the gireesh/snowflake-stagingclient-enh branch from c754c40 to 459c37c Compare June 5, 2024 20:34

vercel bot deployed to Preview June 5, 2024 20:39 View deployment

edgao approved these changes Jun 5, 2024

View reviewed changes

gisripa merged commit 3a9dabb into master Jun 5, 2024
34 checks passed

gisripa deleted the gireesh/snowflake-stagingclient-enh branch June 5, 2024 22:08

gisripa mentioned this pull request Jun 5, 2024

[destination-snowflake] SnowflakeSQLException: SQL compilation error #39118

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Destination Snowflake: Improve error handling in StagingClient #39135

Destination Snowflake: Improve error handling in StagingClient #39135

gisripa commented Jun 5, 2024 •

edited

Loading

vercel bot commented Jun 5, 2024 •

edited

Loading

gisripa commented Jun 5, 2024

edgao left a comment

edgao Jun 5, 2024

gisripa Jun 5, 2024

talnidam Jun 6, 2024 •

edited

Loading

gisripa Jun 6, 2024

edgao Jun 5, 2024

gisripa Jun 5, 2024

edgao Jun 6, 2024

edgao Jun 6, 2024

gisripa commented Jun 5, 2024 •

edited

Loading

Destination Snowflake: Improve error handling in StagingClient #39135

Destination Snowflake: Improve error handling in StagingClient #39135

Conversation

gisripa commented Jun 5, 2024 • edited Loading

What

Review guide

User Impact

Can this PR be safely reverted and rolled back?

vercel bot commented Jun 5, 2024 • edited Loading

gisripa commented Jun 5, 2024

edgao left a comment

Choose a reason for hiding this comment

edgao Jun 5, 2024

Choose a reason for hiding this comment

gisripa Jun 5, 2024

Choose a reason for hiding this comment

talnidam Jun 6, 2024 • edited Loading

Choose a reason for hiding this comment

gisripa Jun 6, 2024

Choose a reason for hiding this comment

edgao Jun 5, 2024

Choose a reason for hiding this comment

gisripa Jun 5, 2024

Choose a reason for hiding this comment

edgao Jun 6, 2024

Choose a reason for hiding this comment

edgao Jun 6, 2024

Choose a reason for hiding this comment

gisripa commented Jun 5, 2024 • edited Loading

gisripa commented Jun 5, 2024 •

edited

Loading

vercel bot commented Jun 5, 2024 •

edited

Loading

talnidam Jun 6, 2024 •

edited

Loading

gisripa commented Jun 5, 2024 •

edited

Loading