feat: specify node for rebuild instance ops #8151

cjc7373 · 2024-09-14T03:14:10Z

Note: I add nodeSelector to pod in a way that won't affect pod's revision hash, because adding nodeSelector happens after calculating hash.

…rebuild

codecov · 2024-09-14T03:41:17Z

Codecov Report

Attention: Patch coverage is 54.90196% with 46 lines in your changes missing coverage. Please review.

Project coverage is 61.56%. Comparing base (197596a) to head (e38f13a).
Report is 2 commits behind head on main.

Files with missing lines	Patch %	Lines
pkg/controller/instanceset/instance_util.go	59.61%	15 Missing and 6 partials ⚠️
controllers/apps/operations/rebuild_instance.go	58.33%	7 Missing and 3 partials ⚠️
...ollers/apps/operations/rebuild_instance_inplace.go	40.00%	6 Missing and 3 partials ⚠️
pkg/controller/instanceset/reconciler_status.go	25.00%	4 Missing and 2 partials ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #8151      +/-   ##
==========================================
+ Coverage   61.11%   61.56%   +0.45%     
==========================================
  Files         359      360       +1     
  Lines       41299    41786     +487     
==========================================
+ Hits        25240    25726     +486     
+ Misses      13812    13792      -20     
- Partials     2247     2268      +21

Flag	Coverage Δ
unittests	`61.56% <54.90%> (+0.45%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

zjx20 · 2024-09-14T09:50:15Z

I suspect there's a potential data race in this implementation: The rebuild workflow adds an annotation to the InstanceSet and then deletes the pod, prompting the InstanceSet to recreate the pod. The InstanceSet controller clears the annotation after determining that the pod has been created. If the InstanceSet controller enters reconciliation and completes successfully before the rebuild workflow deletes the pod (which is possible because the InstanceSet CR's annotation has been modified), the annotation would be cleared prematurely (since the pod hasn't been deleted yet), causing this feature to fail.

I suggest that only clear the annotation of a pod if pod.spec.nodeName equals the one specified in the annotation.

wangyelei · 2024-09-19T02:09:38Z

/cherry-pick release-0.9

github-actions · 2024-09-19T02:10:04Z

🤖 says: Error cherry-picking.

Auto-merging apis/apps/v1alpha1/opsrequest_types.go
Auto-merging config/crd/bases/apps.kubeblocks.io_opsrequests.yaml
Auto-merging controllers/apps/operations/rebuild_instance.go
CONFLICT (content): Merge conflict in controllers/apps/operations/rebuild_instance.go
Auto-merging controllers/apps/operations/rebuild_instance_test.go
CONFLICT (content): Merge conflict in controllers/apps/operations/rebuild_instance_test.go
Auto-merging deploy/helm/crds/apps.kubeblocks.io_opsrequests.yaml
Auto-merging docs/developer_docs/api-reference/cluster.md
Auto-merging pkg/constant/annotations.go
Auto-merging pkg/controller/instanceset/instance_util.go
Auto-merging pkg/controller/instanceset/instance_util_test.go
Auto-merging pkg/controller/instanceset/reconciler_instance_alignment_test.go
Auto-merging pkg/controller/instanceset/reconciler_status.go
Auto-merging pkg/controller/instanceset/reconciler_status_test.go
CONFLICT (content): Merge conflict in pkg/controller/instanceset/reconciler_status_test.go
error: could not apply 87e27f5... feat: specify node for rebuild instance ops (#8151)
hint: After resolving the conflicts, mark them with
hint: "git add/rm ", then run
hint: "git cherry-pick --continue".
hint: You can instead skip this commit with "git cherry-pick --skip".
hint: To abort and get back to the state before "git cherry-pick",
hint: run "git cherry-pick --abort".
hint: Disable this message with "git config advice.mergeConflict false"

github-actions · 2024-09-19T02:10:05Z

🤖 says: ‼️ cherry pick action failed.
See: https://github.com/apecloud/kubeblocks/actions/runs/10933182583

cjc7373 added 4 commits September 13, 2024 00:10

schedule once for instance set

d51ee8e

set scheduling annotaion in rebuild instance ops

f5ca4c4

fix

2358913

Merge remote-tracking branch 'origin/main' into feature/specify-node-…

eba2646

…rebuild

cjc7373 requested review from wangyelei and a team as code owners September 14, 2024 03:14

github-actions bot added the size/L Denotes a PR that changes 100-499 lines. label Sep 14, 2024

fix version

92e25d0

cjc7373 added this to the Release 0.9.1 milestone Sep 14, 2024

cjc7373 added 2 commits September 14, 2024 11:29

make linter happy

8ffed77

doc

c1e3265

more unit test

e38f13a

wangyelei approved these changes Sep 18, 2024

View reviewed changes

apecloud-bot added the approved PR Approved Test label Sep 18, 2024

fix a race condition

43bb779

apecloud-bot removed the approved PR Approved Test label Sep 18, 2024

zjx20 approved these changes Sep 18, 2024

View reviewed changes

apecloud-bot added the approved PR Approved Test label Sep 18, 2024

don't affect original behaviour

856a7ae

apecloud-bot removed the approved PR Approved Test label Sep 18, 2024

free6om approved these changes Sep 18, 2024

View reviewed changes

apecloud-bot added the approved PR Approved Test label Sep 18, 2024

cjc7373 merged commit 87e27f5 into main Sep 18, 2024
35 checks passed

cjc7373 deleted the feature/specify-node-rebuild branch September 18, 2024 09:50

cjc7373 added a commit that referenced this pull request Sep 18, 2024

feat: specify node for rebuild instance ops (#8151)

e8c5cb5

wangyelei pushed a commit that referenced this pull request Sep 19, 2024

feat: specify node for rebuild instance ops (#8151)

c7c414d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: specify node for rebuild instance ops #8151

feat: specify node for rebuild instance ops #8151

cjc7373 commented Sep 14, 2024 •

edited

Loading

codecov bot commented Sep 14, 2024 •

edited

Loading

zjx20 commented Sep 14, 2024

wangyelei commented Sep 19, 2024

github-actions bot commented Sep 19, 2024

github-actions bot commented Sep 19, 2024

feat: specify node for rebuild instance ops #8151

feat: specify node for rebuild instance ops #8151

Conversation

cjc7373 commented Sep 14, 2024 • edited Loading

codecov bot commented Sep 14, 2024 • edited Loading

Codecov Report

zjx20 commented Sep 14, 2024

wangyelei commented Sep 19, 2024

github-actions bot commented Sep 19, 2024

github-actions bot commented Sep 19, 2024

cjc7373 commented Sep 14, 2024 •

edited

Loading

codecov bot commented Sep 14, 2024 •

edited

Loading