Skip to content

DLPX-74859 Stack fails to come up for GCP engines because delphix-platform service failed #275

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Apr 1, 2021

Conversation

grwilson
Copy link
Contributor

Pull request checklist

Please check if your PR fulfills the following requirements:

  • Tests for the changes have been added (for bug fixes / features)
  • Docs have been reviewed and added / updated if needed (for bug fixes / features)
  • Build was run locally and any changes were pushed
  • Lint has passed locally and any fixes were made for failures

Pull request type

Please check the type of change your PR introduces:

  • Bugfix
  • Feature
  • Code style update (formatting, renaming)
  • Refactoring (no functional changes, no api changes)
  • Build related changes
  • Documentation content changes
  • Other (please describe):

What is the current behavior?

Issue Number: DLPX-74859

What is the new behavior?

Does this introduce a breaking change?

  • Yes
  • No

Other information

@grwilson grwilson requested review from sebroy and prakashsurya March 26, 2021 15:39
@grwilson
Copy link
Contributor Author

- blockinfile:
path: /etc/default/instance_configs.cfg.template
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we need to undo any configuration that might have previously been applied to the .template file? E.g. remove the file? or remove the block we may have added in an earlier version?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I had a meeting with google to discuss how the new packages changed behavior and they mentioned that they maintained the template file in the new packages too even though it's not documented. I decided that it would be best to remove both the template file and the cfg file and just recreate it as part of our ansible logic.

# Make sure that the account daemon is always disabled. The override file
# above should prevent this and this is designed to catch any corner cases.
#
- command: systemctl disable google-accounts-daemon.service
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we need to undo any configuration that might have previously been applied? E.g. re-enable the service (assuming it's default state is to be enabled)?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We don't need to do anything as part of upgrade. The packaging remove the old services but does leave them masked:

root@gwilson-gcp:/var/tmp# systemctl list-unit-files \*google\*
UNIT FILE                        STATE
google-accounts-daemon.service   masked
google-clock-skew-daemon.service masked
google-guest-agent.service       enabled
google-instance-setup.service    masked
google-network-daemon.service    masked
google-oslogin-cache.service     static
google-shutdown-scripts.service  enabled
google-startup-scripts.service   enabled
google-oslogin-cache.timer       enabled

If someone ever unmasks the service then it just gets removed.

Copy link
Contributor

@sebroy sebroy left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have the same concern/question as Prakash regarding the config file. If we ever have to change what's in there, or if a human were to accidentally tweak what's in there, then I think this ansible logic won't quite do what we want. It might be safest to remove the file, and write it from scratch every time.

@grwilson
Copy link
Contributor Author

grwilson commented Apr 1, 2021

New build: http://selfservice.jenkins.delphix.com/job/devops-gate/job/master/job/appliance-build-orchestrator-pre-push/4977/

I have also upgraded an engine with these changes.

@grwilson grwilson merged commit 52d370f into delphix:master Apr 1, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.

3 participants