-
Notifications
You must be signed in to change notification settings - Fork 37
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Remove project name from schema #684
Conversation
@bdice @csadorf the more I work with these schema-related PRs the more I wonder about the value in using semantic versioning-style versions for schemas. I don't think they're quite appropriate, and I think they introduce more complexity than is worthwhile, especially since there are no equivalents to "patches" AFAICT. I could theoretically imagine a schema version bump that would not require migrating from an old schema by adding support for optional keys, and that could constitute a minor version, but is that important for us to really support? I think the schema code would be significantly simplified by just using a simple integer-indexed version scheme, and that is practically what we are doing right now anyway. Thoughts? |
7ad3e2e
to
6b02258
Compare
@vyasr My recollection is that we will only make “major” (breaking) schema versions, but we use PEP 440 version identifiers (not necessarily semantic versioning) just because it gives us flexibility should we need it. I don’t have strong feelings on this. Integers would be fine with me aside from the pain of refactoring and the minor nicety of using standardized identifiers in the Python ecosystem. |
I mainly ask because the migration code ends up doing a lot of conversion back and forth between "proper" versions and integers (or really, a string containing a single number) because all of the stored variables use version numbers like "1". So I think our options are to either lean into the version numbers by replacing all of those with |
6b02258
to
578d433
Compare
e1e9944 addresses #643 (comment). That case isn't quite covered by the tests added in #682 since that also requires the global config file to be missing, but I think this is good enough for now. I had to change a test because in fact the behavior of various error cases handled by the config CLI wasn't consistent. The existing config code actually had some errors in it that were masked by old behaviors and were uncovered by this change, which is why we observed this problem in the first place. We should aim to improve testing of the config CLI further, but I think that's out of scope for this PR as long as we've resolved the current issues. |
@vyasr Yes, I'm in favor of just going with simple integer increments. While schemas can generally be backwards-compatible, I can't imagine a scenario where communicating compatibility through the version number is adding much value in this specific context. We can always "manually" document specific compatiblity ranges if there is a need for this. |
OK, I'll make a separate PR with that change. |
tests/test_project.py
Outdated
@@ -98,7 +98,7 @@ def test_repr(self): | |||
assert p == self.project | |||
|
|||
def test_str(self): | |||
str(self.project) == self.project.id | |||
str(self.project) == f"Project.get_project({self.project.root_directory()})" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This needs to use a repr
so that the path is quoted, based on the implementation. I haven't checked if tests are passing.
str(self.project) == f"Project.get_project({self.project.root_directory()})" | |
str(self.project) == f"Project.get_project({repr(self.project.root_directory())})" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Well it would help if this test actually contained an assertion... 😂
Looks like this test has remained unchanged since bbf7e7b way back in 2016 (when it was first added). Probably good to enable it now... 🙃
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
To know oneself, one should assert oneself.
-- Albert Camus
e1e9944
to
c8cc307
Compare
Codecov Report
@@ Coverage Diff @@
## schema2 #684 +/- ##
===========================================
- Coverage 86.28% 86.24% -0.05%
===========================================
Files 52 52
Lines 5040 5037 -3
Branches 1101 1099 -2
===========================================
- Hits 4349 4344 -5
- Misses 489 491 +2
Partials 202 202
Continue to review full report at Codecov.
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A few more thoughts. I'm fine if you merge this once you've considered/applied my comments, so I'm approving.
mode = "" | ||
_print_err(f"Did not find a{mode}configuration file.") | ||
mode = "local or global" | ||
_print_err(f"Did not find a {mode} configuration file.") | ||
return |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should this exit with some nonzero return code? I notice that some of our code paths calling _print_err
are informational, while others indicate actual errors and are followed by sys.exit(1)
. Some code paths raise RuntimeWarning
, which also exits with code 1.
Lines 1715 to 1719 in c8cc307
except RuntimeWarning as warning: | |
_print_err(f"Warning: {warning}") | |
if args.debug: | |
raise | |
sys.exit(1) |
Perhaps we need to assess this in a separate PR, and make sure all the code in __main__.py
is consistent about using nonzero exit codes for errors. For now I'll mark a couple cases that are obvious.
return | |
sys.exit(1) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I read the first paragraph and was going to say "yes, I realized this too but decided that was better suited to a separate PR", and then I read your second paragraph. But... I'll do it anyway: yes, I realized this too but decided that was better suited to a separate PR!
It definitely is something that we should fix, though.
verify_config(cfg) | ||
mode = "local or global" | ||
_print_err(f"Did not find a {mode} configuration file.") | ||
return |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
return | |
sys.exit(1) |
* Remove project id API. * Remove project name from config as part of migration. * Fix issues with config CLI and remove project from default cfg. * Address PR comments. * Change the str of a project to the str of its root directory.
* Remove project id API. * Remove project name from config as part of migration. * Fix issues with config CLI and remove project from default cfg. * Address PR comments. * Change the str of a project to the str of its root directory.
* Remove project id API. * Remove project name from config as part of migration. * Fix issues with config CLI and remove project from default cfg. * Address PR comments. * Change the str of a project to the str of its root directory.
* Initial migration to schema version 2 including config rename (#678) * Implement initial migration to schema version 2. * Require project-local config to be signac.rc (not .signacrc) and make searches stricter to match. * Standardize method for getting project config at a root. * Move config to .signac/config. * Fix import order. * Address PR comments. * Remove some unnecessary code. * Address final PR coments. * Use integer schema version numbers (#688) * Change schema versioning to use integer strings. * Switch from int strings to pure ints. * Update signac/contrib/migration/__init__.py Co-authored-by: Bradley Dice <bdice@bradleydice.com> Co-authored-by: Bradley Dice <bdice@bradleydice.com> * Remove project name from schema (#684) * Remove project id API. * Remove project name from config as part of migration. * Fix issues with config CLI and remove project from default cfg. * Address PR comments. * Change the str of a project to the str of its root directory. * Change Project constructor to use root directory (#706) * Change project constructor to accept a root directory instead of a config file. * Change Project repr. * Address easy PR comments. * Move internal signac files into .signac subdirectory (#708) * Move shell history. * Move sp_cache file. * Address PR comments. * Move discovery to separate functions. (#711) * Move discovery to separate functions. * Address first round of PR comments. * Address PR comments. * Apply suggestions. * Remove configurable workspace directory (#714) * Remove workspace configurability. * Implement workspace_dir migration. * Apply suggestions from code review Co-authored-by: Bradley Dice <bdice@bradleydice.com> * Address remaining PR comments. * Update tests/test_project.py * Remove mention of configurability from project workspace docstring * Address PR comments. Co-authored-by: Bradley Dice <bdice@bradleydice.com> Co-authored-by: Corwin Kerr <cbkerr@umich.edu> * Update description of schema migration. * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: Bradley Dice <bdice@bradleydice.com> Co-authored-by: Corwin Kerr <cbkerr@umich.edu> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* Remove project id API. * Remove project name from config as part of migration. * Fix issues with config CLI and remove project from default cfg. * Address PR comments. * Change the str of a project to the str of its root directory.
* Remove project id API. * Remove project name from config as part of migration. * Fix issues with config CLI and remove project from default cfg. * Address PR comments. * Change the str of a project to the str of its root directory.
* Remove project id API. * Remove project name from config as part of migration. * Fix issues with config CLI and remove project from default cfg. * Address PR comments. * Change the str of a project to the str of its root directory.
* Remove project id API. * Remove project name from config as part of migration. * Fix issues with config CLI and remove project from default cfg. * Address PR comments. * Change the str of a project to the str of its root directory.
* Remove project id API. * Remove project name from config as part of migration. * Fix issues with config CLI and remove project from default cfg. * Address PR comments. * Change the str of a project to the str of its root directory.
* Remove project id API. * Remove project name from config as part of migration. * Fix issues with config CLI and remove project from default cfg. * Address PR comments. * Change the str of a project to the str of its root directory.
* Remove project id API. * Remove project name from config as part of migration. * Fix issues with config CLI and remove project from default cfg. * Address PR comments. * Change the str of a project to the str of its root directory.
Description
This PR updates the v1 to v2 schema migration to remove project names and store preexisting ones in the project document.
Motivation and Context
Resolves #541. Part of the changeset of #643.
Types of Changes
1The change breaks (or has the potential to break) existing functionality.
Checklist:
If necessary: