Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove project name from schema #684

Merged
merged 5 commits into from
Feb 26, 2022
Merged

Remove project name from schema #684

merged 5 commits into from
Feb 26, 2022

Conversation

vyasr
Copy link
Contributor

@vyasr vyasr commented Feb 21, 2022

Description

This PR updates the v1 to v2 schema migration to remove project names and store preexisting ones in the project document.

Motivation and Context

Resolves #541. Part of the changeset of #643.

Types of Changes

  • Documentation update
  • Bug fix
  • New feature
  • Breaking change1

1The change breaks (or has the potential to break) existing functionality.

Checklist:

If necessary:

  • I have updated the API documentation as part of the package doc-strings.
  • I have created a separate pull request to update the framework documentation on signac-docs and linked it here.
  • I have updated the changelog and added all related issue and pull request numbers for future reference (if applicable). See example below.

@vyasr vyasr added this to the v2.0.0 milestone Feb 21, 2022
@vyasr vyasr requested review from a team as code owners February 21, 2022 19:34
@vyasr vyasr self-assigned this Feb 21, 2022
@vyasr vyasr requested review from bdice and pepak13 and removed request for a team February 21, 2022 19:34
@vyasr
Copy link
Contributor Author

vyasr commented Feb 21, 2022

@bdice @csadorf the more I work with these schema-related PRs the more I wonder about the value in using semantic versioning-style versions for schemas. I don't think they're quite appropriate, and I think they introduce more complexity than is worthwhile, especially since there are no equivalents to "patches" AFAICT. I could theoretically imagine a schema version bump that would not require migrating from an old schema by adding support for optional keys, and that could constitute a minor version, but is that important for us to really support? I think the schema code would be significantly simplified by just using a simple integer-indexed version scheme, and that is practically what we are doing right now anyway. Thoughts?

@vyasr vyasr requested a review from a team as a code owner February 21, 2022 19:41
@vyasr vyasr force-pushed the schema/remove_project_name branch from 7ad3e2e to 6b02258 Compare February 21, 2022 19:41
@bdice
Copy link
Member

bdice commented Feb 21, 2022

@vyasr My recollection is that we will only make “major” (breaking) schema versions, but we use PEP 440 version identifiers (not necessarily semantic versioning) just because it gives us flexibility should we need it. I don’t have strong feelings on this. Integers would be fine with me aside from the pain of refactoring and the minor nicety of using standardized identifiers in the Python ecosystem.

@vyasr
Copy link
Contributor Author

vyasr commented Feb 21, 2022

I mainly ask because the migration code ends up doing a lot of conversion back and forth between "proper" versions and integers (or really, a string containing a single number) because all of the stored variables use version numbers like "1". So I think our options are to either lean into the version numbers by replacing all of those with packaging.Version objects or to move completely away from them in favor of simple integers. @csadorf do you have a preference? I wasn't part of the early design discussions around the migration logic so I'm not sure if you two had specific thoughts at the time on the importance of full versions.

@vyasr
Copy link
Contributor Author

vyasr commented Feb 21, 2022

e1e9944 addresses #643 (comment). That case isn't quite covered by the tests added in #682 since that also requires the global config file to be missing, but I think this is good enough for now. I had to change a test because in fact the behavior of various error cases handled by the config CLI wasn't consistent. The existing config code actually had some errors in it that were masked by old behaviors and were uncovered by this change, which is why we observed this problem in the first place. We should aim to improve testing of the config CLI further, but I think that's out of scope for this PR as long as we've resolved the current issues.

@csadorf csadorf removed the request for review from a team February 22, 2022 09:05
@csadorf
Copy link
Contributor

csadorf commented Feb 22, 2022

@vyasr Yes, I'm in favor of just going with simple integer increments. While schemas can generally be backwards-compatible, I can't imagine a scenario where communicating compatibility through the version number is adding much value in this specific context. We can always "manually" document specific compatiblity ranges if there is a need for this.

@vyasr
Copy link
Contributor Author

vyasr commented Feb 23, 2022

OK, I'll make a separate PR with that change.

@vyasr vyasr mentioned this pull request Feb 24, 2022
12 tasks
@@ -98,7 +98,7 @@ def test_repr(self):
assert p == self.project

def test_str(self):
str(self.project) == self.project.id
str(self.project) == f"Project.get_project({self.project.root_directory()})"
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This needs to use a repr so that the path is quoted, based on the implementation. I haven't checked if tests are passing.

Suggested change
str(self.project) == f"Project.get_project({self.project.root_directory()})"
str(self.project) == f"Project.get_project({repr(self.project.root_directory())})"

Copy link
Contributor Author

@vyasr vyasr Feb 26, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Well it would help if this test actually contained an assertion... 😂

Looks like this test has remained unchanged since bbf7e7b way back in 2016 (when it was first added). Probably good to enable it now... 🙃

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

To know oneself, one should assert oneself.
-- Albert Camus

@vyasr vyasr force-pushed the schema/remove_project_name branch from e1e9944 to c8cc307 Compare February 26, 2022 04:58
@vyasr vyasr requested a review from bdice February 26, 2022 04:58
@codecov
Copy link

codecov bot commented Feb 26, 2022

Codecov Report

Merging #684 (c8cc307) into schema2 (cf73d1c) will decrease coverage by 0.04%.
The diff coverage is 74.07%.

❗ Current head c8cc307 differs from pull request most recent head ca697f8. Consider uploading reports for the commit ca697f8 to get more accurate results

Impacted file tree graph

@@             Coverage Diff             @@
##           schema2     #684      +/-   ##
===========================================
- Coverage    86.28%   86.24%   -0.05%     
===========================================
  Files           52       52              
  Lines         5040     5037       -3     
  Branches      1101     1099       -2     
===========================================
- Hits          4349     4344       -5     
- Misses         489      491       +2     
  Partials       202      202              
Impacted Files Coverage Δ
signac/common/validate.py 100.00% <ø> (ø)
signac/__main__.py 78.16% <56.25%> (+0.48%) ⬆️
signac/contrib/migration/v1_to_v2.py 92.00% <100.00%> (-8.00%) ⬇️
signac/contrib/project.py 89.29% <100.00%> (-0.16%) ⬇️
signac/contrib/migration/__init__.py 82.75% <0.00%> (-3.45%) ⬇️
signac/common/config.py 78.12% <0.00%> (-3.13%) ⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update cf73d1c...ca697f8. Read the comment docs.

Copy link
Member

@bdice bdice left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A few more thoughts. I'm fine if you merge this once you've considered/applied my comments, so I'm approving.

mode = ""
_print_err(f"Did not find a{mode}configuration file.")
mode = "local or global"
_print_err(f"Did not find a {mode} configuration file.")
return
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should this exit with some nonzero return code? I notice that some of our code paths calling _print_err are informational, while others indicate actual errors and are followed by sys.exit(1). Some code paths raise RuntimeWarning, which also exits with code 1.

signac/signac/__main__.py

Lines 1715 to 1719 in c8cc307

except RuntimeWarning as warning:
_print_err(f"Warning: {warning}")
if args.debug:
raise
sys.exit(1)

Perhaps we need to assess this in a separate PR, and make sure all the code in __main__.py is consistent about using nonzero exit codes for errors. For now I'll mark a couple cases that are obvious.

Suggested change
return
sys.exit(1)

Copy link
Contributor Author

@vyasr vyasr Feb 26, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I read the first paragraph and was going to say "yes, I realized this too but decided that was better suited to a separate PR", and then I read your second paragraph. But... I'll do it anyway: yes, I realized this too but decided that was better suited to a separate PR!

It definitely is something that we should fix, though.

verify_config(cfg)
mode = "local or global"
_print_err(f"Did not find a {mode} configuration file.")
return
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
return
sys.exit(1)

@vyasr vyasr merged commit 01873a1 into schema2 Feb 26, 2022
@vyasr vyasr deleted the schema/remove_project_name branch February 26, 2022 05:42
@vyasr vyasr mentioned this pull request Mar 4, 2022
vyasr added a commit that referenced this pull request Mar 14, 2022
* Remove project id API.

* Remove project name from config as part of migration.

* Fix issues with config CLI and remove project from default cfg.

* Address PR comments.

* Change the str of a project to the str of its root directory.
vyasr added a commit that referenced this pull request Apr 14, 2022
* Remove project id API.

* Remove project name from config as part of migration.

* Fix issues with config CLI and remove project from default cfg.

* Address PR comments.

* Change the str of a project to the str of its root directory.
vyasr added a commit that referenced this pull request Apr 19, 2022
* Remove project id API.

* Remove project name from config as part of migration.

* Fix issues with config CLI and remove project from default cfg.

* Address PR comments.

* Change the str of a project to the str of its root directory.
vyasr added a commit that referenced this pull request Apr 19, 2022
* Initial migration to schema version 2 including config rename (#678)

* Implement initial migration to schema version 2.

* Require project-local config to be signac.rc (not .signacrc) and make searches stricter to match.

* Standardize method for getting project config at a root.

* Move config to .signac/config.

* Fix import order.

* Address PR comments.

* Remove some unnecessary code.

* Address final PR coments.

* Use integer schema version numbers (#688)

* Change schema versioning to use integer strings.

* Switch from int strings to pure ints.

* Update signac/contrib/migration/__init__.py

Co-authored-by: Bradley Dice <bdice@bradleydice.com>

Co-authored-by: Bradley Dice <bdice@bradleydice.com>

* Remove project name from schema (#684)

* Remove project id API.

* Remove project name from config as part of migration.

* Fix issues with config CLI and remove project from default cfg.

* Address PR comments.

* Change the str of a project to the str of its root directory.

* Change Project constructor to use root directory (#706)

* Change project constructor to accept a root directory instead of a config file.

* Change Project repr.

* Address easy PR comments.

* Move internal signac files into .signac subdirectory (#708)

* Move shell history.

* Move sp_cache file.

* Address PR comments.

* Move discovery to separate functions. (#711)

* Move discovery to separate functions.

* Address first round of PR comments.

* Address PR comments.

* Apply suggestions.

* Remove configurable workspace directory  (#714)

* Remove workspace configurability.

* Implement workspace_dir migration.

* Apply suggestions from code review

Co-authored-by: Bradley Dice <bdice@bradleydice.com>

* Address remaining PR comments.

* Update tests/test_project.py

* Remove mention of configurability from project workspace docstring

* Address PR comments.

Co-authored-by: Bradley Dice <bdice@bradleydice.com>
Co-authored-by: Corwin Kerr <cbkerr@umich.edu>

* Update description of schema migration.

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

Co-authored-by: Bradley Dice <bdice@bradleydice.com>
Co-authored-by: Corwin Kerr <cbkerr@umich.edu>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
vyasr added a commit that referenced this pull request Apr 21, 2022
* Remove project id API.

* Remove project name from config as part of migration.

* Fix issues with config CLI and remove project from default cfg.

* Address PR comments.

* Change the str of a project to the str of its root directory.
vyasr added a commit that referenced this pull request May 2, 2022
* Remove project id API.

* Remove project name from config as part of migration.

* Fix issues with config CLI and remove project from default cfg.

* Address PR comments.

* Change the str of a project to the str of its root directory.
bdice pushed a commit that referenced this pull request Jun 14, 2022
* Remove project id API.

* Remove project name from config as part of migration.

* Fix issues with config CLI and remove project from default cfg.

* Address PR comments.

* Change the str of a project to the str of its root directory.
bdice pushed a commit that referenced this pull request Aug 1, 2022
* Remove project id API.

* Remove project name from config as part of migration.

* Fix issues with config CLI and remove project from default cfg.

* Address PR comments.

* Change the str of a project to the str of its root directory.
bdice pushed a commit that referenced this pull request Oct 7, 2022
* Remove project id API.

* Remove project name from config as part of migration.

* Fix issues with config CLI and remove project from default cfg.

* Address PR comments.

* Change the str of a project to the str of its root directory.
bdice pushed a commit that referenced this pull request Oct 27, 2022
* Remove project id API.

* Remove project name from config as part of migration.

* Fix issues with config CLI and remove project from default cfg.

* Address PR comments.

* Change the str of a project to the str of its root directory.
vyasr added a commit that referenced this pull request Oct 30, 2022
* Remove project id API.

* Remove project name from config as part of migration.

* Fix issues with config CLI and remove project from default cfg.

* Address PR comments.

* Change the str of a project to the str of its root directory.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants