Simple Python Scheduler

This is simple python scheduler for a multi-user multi-GPU server. As of now, this scheduler works purely based on trust that all users will use this to run their scripts. This script is NOT INTENDED for complex environments that needs security.

Right now, the only thing it can do is allocate GPUs to users on demand, and kill any processes that are intruding. It will also kill processes that exceeded the initial lifetime.

Dependancies

flufl.lock
psutil
nvidia-ml-py3

Supported Commands

`salloc`

Will start an interactive shell with the correct GPU allocated. Closing the shell will result in releasing the GPU.

`srunsched`

Run the scheduler.

Planned Commands

`sbatch`

Will queue the job for execution. First queued object will run. We won't have any priority settings or limitations for now. Must define up-time for the job.

`squeue`

Read and report the current queue.

`susage`

Report wall time of all users. (later)

Directory to be monitored

Queue will be located at /var/sps/queue

Add queue will be located at /var/sps/addqueue/<user>

Quota for GPU will be located at /var/sps/addqueue/<user>.quota. Will only have effect when /var/sps/addqueue/<user> exists.

Current job at GPU will be at /var/sps/gpu/X

Job file

Jobs will be named

<time>-<user>-<type>-<pid>.job

and the corresponding shell environment for the batch job

<time>-<user>-<type>-<pid>.env

Time will be from Python module time.time().

pid will be the pid of the job submitter.

type will be either salloc or sbatch.

All job files and env files are in json format.

TODO

All variables and functions are now contained in a single file for each instance. Structure this better.

Known Vulnerabilities

The lock file can arbitrarilly be deleted.

Name		Name	Last commit message	Last commit date
Latest commit History 132 Commits
sps		sps
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Simple Python Scheduler

Dependancies

Supported Commands

`salloc`

`srunsched`

Planned Commands

`sbatch`

`squeue`

`susage`

Directory to be monitored

Job file

TODO

Known Vulnerabilities

About

Releases 6

Packages

Contributors 3

Languages

License

vcg-uvic/simple-python-scheduler

Folders and files

Latest commit

History

Repository files navigation

Simple Python Scheduler

Dependancies

Supported Commands

salloc

srunsched

Planned Commands

sbatch

squeue

susage

Directory to be monitored

Job file

TODO

Known Vulnerabilities

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 6

Packages 0

Contributors 3

Languages

`salloc`

`srunsched`

`sbatch`

`squeue`

`susage`

Packages