Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

provide reason a job is not scheduled #1219

Open
grondo opened this issue Jun 6, 2024 · 0 comments
Open

provide reason a job is not scheduled #1219

grondo opened this issue Jun 6, 2024 · 0 comments

Comments

@grondo
Copy link
Contributor

grondo commented Jun 6, 2024

A question sysadmins and developers get often is "why is job X not running?"

It seems like Fluxion could provide insigths to make this question easier to answer, perhaps even in the output of flux jobs.
Some reasons that we have to manually determine now include:

  • waiting for higher priority jobs to be scheduled
  • constraints provided for resources that are currently unavailable
  • highest priority job, but waiting for resources to become available

A simple solution would be for Fluxion to return a reason or similar field in the scheduler annotations when it can provide one. This could be made available in flux jobs.

Another, perhaps longer term solution would be to provide an RPC that unveils a snapshot of the current schedule if one could be made available.

wihobbs added a commit to wihobbs/flux-core that referenced this issue Sep 10, 2024
Problem: autoconf will accept junk at configure time as a valid
version, then go off and generate an invalid version.h file. This
happens frequently with shallow clones, setups in CI, or other
constrained user environments. It has caused new contributors a
lot of confusion in the past.

Solution: Like flux-framework/flux-sched#1219 suggested, reject
invalid versions at configure time and provide appropriate
suggestions to the user to remedy this.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant