[CT-2385] [Feature] Use different "state" for state comparison versus deferral

### Is this your first time submitting a feature request?

- [X] I have read the [expectations for open source contributors](https://docs.getdbt.com/docs/contributing/oss-expectations)
- [X] I have searched the existing issues, and I could not find an existing issue for this feature
- [X] I am requesting a straightforward extension of existing dbt functionality, rather than a Big Idea better suited to a discussion

### Describe the feature

Scenario:
- Have a recent run in production
- Pull the artifact from that run
- Change Model A &rarr; rerun _only_ Model A _without_ needing to build all upstream references
- Change Model B (without changing Model A) &rarr; you should be able to _just_ rerun Model B, while still using the production artifact to know the _locations_ of all other upstream models

A challenge today is that `dbt-core` expects to use the same "previous state" manifest for both:
1. **State comparison:** What’s changed? What should be selected to run? (= `--select state:modified+`)
2. **"Defer":** rewriting upstream unbuilt references, for anything that doesn’t exist in this schema, wherever it exists in prod (= `--defer`)

Those are **related** concepts, and it’s often reasonable to provide the same input to both, but the conflation does make it hard for us to pursue a more-advanced use case.

Ideally, we might be able to do something more like:
1. Given two project states, parse both projects, and get the "diff" between them (`state:modified`)
2. Always use a stable production artifact for deferral (rewriting upstream references). This is how we know where the models actually live (as views/tables) in the prod environments.

I think the change could be as simple as, `dbt-core` gaining the ability to use a _different_ manifest for each of:
- (1) Stateful selection (`state:`, `result:`, `source_status:`)
- (2) Deferral / cloning (#7256)

If a custom `--defer-diff-state` path is not specified, deferral should keep using the same `--state` path by default. I expect this will continue to make sense for ~80% of cases, and it's a reasonable out-of-the-box behavior.

These are tricky concepts, and so the naming matters a lot! I'm not thrilled with this breakdown, and so I'm very open to thoughts/feedback:
- `--defer` (boolean)
- `--state` (path)
- `--defer-diff-state` (path)

### Describe alternatives you've considered

Not doing this. In theory, you could keep using the previous run's manifest for slimmer and slimmer state comparison. For any nodes that were deferred, it will be the "production" version of those nodes in your manifest.

I don't think this would work in the case where you change a model, and then remove the changes (revert it to `main` / prod state). It feels like a leaky approach, when we'd be better off providing a clearer delineation.

### Who will this benefit?

Users/applications pursuing ever-slimmer CI

### Are you interested in contributing this feature?

_No response_

### Anything else?

Currently, we load up the `--state` manifest once, into the `previous_state` container:

https://github.com/dbt-labs/dbt-core/blob/7045e11aa03c9425a0522f48c2631ddef22a4ccc/core/dbt/task/runnable.py#L79-L83

Then we pass the same `previous_state.manifest` into _both_ node selection and deferral:

https://github.com/dbt-labs/dbt-core/blob/7045e11aa03c9425a0522f48c2631ddef22a4ccc/core/dbt/task/compile.py#L94-L106

The idea here would be, allowing users to configure different previous-state manifests for use in one versus the other.

	def set_previous_state(self):
	if self.args.state is not None:
	self.previous_state = PreviousState(
	path=self.args.state, current_path=Path(self.config.target_path)
	)

	def _get_deferred_manifest(self) -> Optional[WritableManifest]:
	if not self.args.defer:
	return None

	state = self.previous_state
	if state is None:
	raise DbtRuntimeError(
	"Received a --defer argument, but no value was provided to --state"
	)

	if state.manifest is None:
	raise DbtRuntimeError(f'Could not find manifest in --state path: "{self.args.state}"')
	return state.manifest

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CT-2385] [Feature] Use different "state" for state comparison versus deferral #7300

Is this your first time submitting a feature request?

Describe the feature

Describe alternatives you've considered

Who will this benefit?

Are you interested in contributing this feature?

Anything else?

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[CT-2385] [Feature] Use different "state" for state comparison versus deferral #7300

Description

Is this your first time submitting a feature request?

Describe the feature

Describe alternatives you've considered

Who will this benefit?

Are you interested in contributing this feature?

Anything else?

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions