.dbtignore #5897

ChenyuLInx · 2022-09-21T00:17:09Z

resolves #5733

Description

Add .dbtignore function

Checklist

I have read the contributing guide and understand what's expected of me
I have signed the CLA
I have run this code in development and it appears to resolve the stated issue
This PR includes tests, or tests are not required/relevant for this PR
I have opened an issue to add/update docs, or docs changes are not required/relevant for this PR
I have run changie new to create a changelog entry

github-actions · 2022-09-21T00:17:28Z

Thank you for your pull request! We could not find a changelog entry for this change. For details on how to document a change, see the contributing guide.

jtcohen6

Looks much simpler than I was afraid it would be. Big thanks to the maintainer of pathspec!

In some local testing, this seems to work just fine with partial parsing. Only real risk here is degraded parsing performance—adding more overhead to read_files_elapsed—which I haven't observed to a noticeable degree.

We'll want to document this, I just opened an issue so we don't lose track: dbt-labs/docs.getdbt.com#2043

jtcohen6 · 2022-09-21T09:59:30Z

core/dbt/parser/read_files.py

@@ -1,3 +1,5 @@
+import os
+import pathspec  # type: ignore


New dependency to add in setup.py. Checked and confirmed that license is good to use

ChenyuLInx · 2022-09-21T21:53:41Z

core/dbt/clients/system.py

    reobj = re.compile(regex, re.IGNORECASE)

    for relative_path_to_search in relative_paths_to_search:
+        # potential speedup for ignore_spec


Some potential things we can do, not so sure we need to do them before we run into a performance issue here

ChenyuLInx · 2022-09-21T21:54:53Z

core/dbt/parser/read_files.py

+        [".sql"],
+        ParseFileType.Macro,
+        saved_files,
+        dbt_ignore_spec,


Don't really like the pass through here but we either do this or need to refactor the whole read files part of code I think

gshank

The signatures for the 'read_files' calls are getting kind of long, but I suspect that we'll be doing some major refactoring in that area sometime in the next year, so I think we can leave it until then to deal with it.

iknox-fa · 2022-09-22T14:01:52Z

core/dbt/clients/system.py

+        # if ignore_spec.matches(relative_path_to_search):
+        #     continue
        absolute_path_to_search = os.path.join(root_path, relative_path_to_search)
        walk_results = os.walk(absolute_path_to_search)


We should really avoid using os.walk or os.path in favor of Pathlib. All the os.path stuff is old AF and has weird operating system specific gotchas.

Agree! I think we probably leave it to the larger refactor? Same as the signatures for read_files calls are getting long. I also don't like that but feels like right now might be okay to leave it.

inital commit

2dac606

cla-bot bot added the cla:yes label Sep 21, 2022

jtcohen6 mentioned this pull request Sep 21, 2022

Ignore files via .dbtignore dbt-labs/docs.getdbt.com#2043

Closed

1 task

jtcohen6 reviewed Sep 21, 2022

View reviewed changes

fixing existing tests and add new test for ignore

7b5388d

ChenyuLInx requested review from emmyoop, gshank and lostmygithubaccount September 21, 2022 21:36

ChenyuLInx marked this pull request as ready for review September 21, 2022 21:36

ChenyuLInx requested review from a team as code owners September 21, 2022 21:36

ChenyuLInx added 2 commits September 21, 2022 14:50

make ignore_spec optional

a83fb46

add changelog

50bc2fc

ChenyuLInx commented Sep 21, 2022

View reviewed changes

gshank approved these changes Sep 21, 2022

View reviewed changes

iknox-fa reviewed Sep 22, 2022

View reviewed changes

ChenyuLInx merged commit 207cc03 into main Sep 22, 2022

ChenyuLInx deleted the feature/dbtignore branch September 22, 2022 16:06

rlh1994 mentioned this pull request Jul 20, 2023

[CT-2854] [Feature] .dbtignore doesn't ignore files in dbt_packages #8169

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

.dbtignore #5897

.dbtignore #5897

Uh oh!

ChenyuLInx commented Sep 21, 2022 •

edited

Loading

Uh oh!

github-actions bot commented Sep 21, 2022

Uh oh!

jtcohen6 left a comment •

edited

Loading

Uh oh!

jtcohen6 Sep 21, 2022

Uh oh!

ChenyuLInx Sep 21, 2022

Uh oh!

ChenyuLInx Sep 21, 2022

Uh oh!

ChenyuLInx Sep 21, 2022

Uh oh!

gshank left a comment

Uh oh!

iknox-fa Sep 22, 2022

Uh oh!

ChenyuLInx Sep 22, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

.dbtignore #5897

.dbtignore #5897

Uh oh!

Conversation

ChenyuLInx commented Sep 21, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Uh oh!

github-actions bot commented Sep 21, 2022

Uh oh!

jtcohen6 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jtcohen6 Sep 21, 2022

Choose a reason for hiding this comment

Uh oh!

ChenyuLInx Sep 21, 2022

Choose a reason for hiding this comment

Uh oh!

ChenyuLInx Sep 21, 2022

Choose a reason for hiding this comment

Uh oh!

ChenyuLInx Sep 21, 2022

Choose a reason for hiding this comment

Uh oh!

gshank left a comment

Choose a reason for hiding this comment

Uh oh!

iknox-fa Sep 22, 2022

Choose a reason for hiding this comment

Uh oh!

ChenyuLInx Sep 22, 2022

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

ChenyuLInx commented Sep 21, 2022 •

edited

Loading

jtcohen6 left a comment •

edited

Loading