Skip to content

DLPX-65491 Invalid argument when mounting ZFS filesystem #153

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Oct 31, 2019

Conversation

grwilson
Copy link
Contributor

No description provided.

@grwilson
Copy link
Contributor Author

ab-pre-push

# As a results any new mount will propagate the mount event to their peer
# groups. This can result in inflating the number of mounts for that
# mount namespace resulting in it hitting the mount max value prematurely.
# To avoid this, we increase the mount-max value to 3 times the default
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you elaborate how you picked the number 3? Doesn't this just mean that we push the limit a bit further, and that we'd hit it if we spin up more VDBs?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For us to hit this would require us to have about 150K filesystems mounted that were then part of the systemd-resolved mount namespace. Even 50K filesystem is unlikely in our customer base but the scalability servers have seen numbers this high and that's where we were hitting this bug.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We could also just set all mount as private but the impact of that is greater and given the release timeframe increasing the default max seemed less risky.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm fine with that, we should probably just mention that 150k file systems limit in this comment.

Setting the mounts as private would probably indeed cause issues during upgrade-verify since we are bind mounting domain0 in the container, right?

Also, when running upgrade-verify, would that result in even more mounts in the systemd-resolved namespace?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I tested that the upgrade container will add another factor to our already inflated count. So with that in mind, to support 100,000 filesystems we need to set the max to 3 times that value. If we start adding additional containers, we will have to adjust this.

@pzakha
Copy link
Contributor

pzakha commented Oct 31, 2019

bors delegate+

@bors
Copy link
Contributor

bors bot commented Oct 31, 2019

✌️ grwilson can now approve this pull request. To approve and merge a pull request, simply reply with bors r+. More detailed instructions are available here.

@grwilson
Copy link
Contributor Author

bors r+

bors bot added a commit that referenced this pull request Oct 31, 2019
153: DLPX-65491 Invalid argument when mounting ZFS filesystem r=grwilson a=grwilson



Co-authored-by: George Wilson <george.wilson@delphix.com>
@bors
Copy link
Contributor

bors bot commented Oct 31, 2019

Build succeeded

  • continuous-integration/travis-ci/push

@bors bors bot merged commit fb8751d into delphix:master Oct 31, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.

3 participants