Skip to content

This is a tutorial on the basic usage of Scinet HPC

Notifications You must be signed in to change notification settings

jldz9/scinet-cheatsheet

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 

Repository files navigation

SCINet Cheat Sheet

SCINet is the USDA-ARS’s initiative for scientific computing.

The high-performance computing (HPC) resources of Scinet currently include Ceres and Altas

This tutorial is meant to let users quickly start on HPC. For detailed instruction please refer to Ceres and Atlas

This tutorial also assumes you have access to Scinet and have initialized the login credentials; if not, check Tutorial

Ceres


Access

Method Code/Link Note
Login ssh user.name@ceres.scinet.usda.gov Replace the user.name with your username in Scinet
DTN Login ssh user.name@ceres-dtn.scinet.usda.gov DTN refer to data transfer node
GUI Open Ondemand Login tutorial and Usage

Data Transfer

Method Code/Link Note
Windows scp -r path\to\target user.name@ceres-dtn.scinet.usda.gov:/path/to/dest Tested on Windows 11 powershell, use " \ " on windows path since windows like it better
Linux rsync -avz --no-p --no-g /path/to/target <user.name>@ceres-dtn.scinet.usda.gov:/path/to/dest
MacOS rsync -avz --no-p --no-g /path/to/target <user.name>@ceres-dtn.scinet.usda.gov:/path/to/dest Use --iconv utf-8-mac if run into issue source
GUI Globus Instruction

Compute
Do not run computational intensive job on login node, you need to allocate a computing node

Method Code/Link Note
Allocate resource salloc -n 8 -N 1 --mem=128G -p=short -t 01:00:00 This means I want to reserve 8 core -n in 1 Node -N with 128GB memory --mem under short partitions -p for 1 hour -t, Ceres gives you an interactive shell after allocated the resource
Job run srun salloc -n 8 -N 1 --mem=128G -p=short -t 01:00:00 your_command Similiar with salloc but you can run your command use srun and specific the resource you need
Batch run Batch script generator Generally like to combine multiple srun together , check Tutorial for more detail
Check job squeue
Cancel job scancel jobID JobID can be obtained from squeue

Software
Software on HPC can be load as module

Method Code/Link Note
Check module availibility module avail your_module your_module refer to the actual module name
load module module load your_module If there are different versions for your_module, this will load the default version, use your_module/version_number to load the version you want
self install software module load miniconda; conda install or pip install This will load Conda package manager and allow you to self-install package, check Toturial
Use container module load apptainer; apptainer foo HPC environment cannot allow docker because of user privilege, apptainer is a great alternative

Atlas


Access

Method Code/Link Note
Login ssh user.name@Atlas-login.hpc.msstate.edu Replace the user.name with your username in Scinet
DTN Login ssh user.name@Atlas-dtn.hpc.msstate.edu DTN refer to data transfer node
GUI Open Ondemand login tutorial and Usage

Data Transfer

Method Code/Link Note
Windows scp -r path\to\target user.name@Atlas-dtn.hpc.msstate.edu:/path/to/dest Tested on Windows 11 powershell, use " \ " on windows path since windows like it better
Linux rsync -avz --no-p --no-g /path/to/target <user.name>@Atlas-dtn.hpc.msstate.edu:/path/to/dest
MacOS rsync -avz --no-p --no-g /path/to/target <user.name>@Atlas-dtn.hpc.msstate.edu:/path/to/dest Use --iconv utf-8-mac if run into issue source
GUI Globus Instruction

Compute
Do not run computational intensive job on login node, you need to allocate a computing node

Method Code/Link Note
Allocate resource salloc -n 8 -N 1 --mem=128G -p=short -t 01:00:00 -A your_user_group Atlas does not excute interactive shell after use salloc and requires user group input, the user group typically is your project name, check Miscellaneous section in the cheat sheet for how to get your user group
Interactive srun salloc -n 8 -N 1 --mem=128G -p=short -t 01:00:00 -A your_user_group --pty bash Get a shell on a compute job
job run srun salloc -n 8 -N 1 --mem=128G -p=short -t 01:00:00 -A your_user_group your_command Similiar with salloc but you can run your command use srun and specific the resource you need
Batch run Batch script generator Generally like to combine multiple srun together , check Tutorial for more detail
Check job squeue
Cancel job scancel jobID JobID can be obtained from squeue

Software
Software on HPC can be load as module

Method Code/Link Note
Check module availibility module avail your_module your_module refer to the actual module name
load module module load your_module If there are different versions for your_module, this will load the default version, use your_module/version_number to load the version you want
self install software module load miniconda; conda install or pip install This will load Conda package manager and allow you to self-install package, check Toturial
Use container module load apptainer; apptainer foo HPC environment cannot allow docker because of user privilege, apptainer is a great alternative

Miscellaneous


Method Code/Link Note
Get user group sacctmgr show associations where user=user.name format=account%20,qos%50 Replace the user.name with your username in Scinet
Project folder /project/your_user_group/ Normally 1TB limitation
90days folder /90daydata/your_user_group/ Unlimited, but will be removed every 90 days
Temporary folder $TMPDIR 1.5TB during executing of job
Home /home/user.name Limit to 15GB
Long term backup rsync -avz --no-p --no-g path/to/dir user.name@nal-dtn.scinet.usda.gov:/LTS/project/your_project_name/ Only backup import results, don't put your random data in it
Cloud Storage check Rclone Rclone was pre-installed on Scinet check rclone --help
Remove ssh key ssh-keygen -R your-hostname Sometime they ask you to do that in email

About

This is a tutorial on the basic usage of Scinet HPC

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published