Skip to content

Fix issue of normal user run tensorflow workloads #2

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
May 28, 2019
Merged

Fix issue of normal user run tensorflow workloads #2

merged 1 commit into from
May 28, 2019

Conversation

fenglixa
Copy link
Contributor

What this PR does / why we need it:
Fix issue below

Issue: run tensorflow jobs by normal user (non LSF Primary Admin), will be error report as below:

/home/fengli/mldl_top/scripts/dockerPasswd.sh: line 41: /home/fengli/Classify_Image_1557399392392TheKL/.passwd.164.0: Permission denied
/home/fengli/mldl_top/scripts/dockerPasswd.sh: line 42: /home/fengli/Classify_Image_1557399392392TheKL/.group.164.0: Permission denied
/home/fengli/mldl_top/scripts/dockerPasswd.sh: line 52: /home/fengli/Classify_Image_1557399392392TheKL/.passwd.164.0: Permission denied
/home/fengli/mldl_top/scripts/dockerPasswd.sh: line 66: /home/fengli/Classify_Image_1557399392392TheKL/.group.164.0: Permission denied
/home/fengli/mldl_top/scripts/dockerPasswd.sh: line 66: /home/fengli/Classify_Image_1557399392392TheKL/.group.164.0: Permission denied
/home/fengli/mldl_top/scripts/dockerPasswd.sh: line 66: /home/fengli/Classify_Image_1557399392392TheKL/.group.164.0: Permission denied
chmod: cannot access ‘/home/fengli/Classify_Image_1557399392392TheKL/.passwd.164.0’: No such file or directory
chmod: cannot access ‘/home/fengli/Classify_Image_1557399392392TheKL/.group.164.0’: No such file or directory

Expect result: run tensorflow jobs by normal user successfully
Root Cause: dockerPasswd.sh was exec by LSF Primary Admin, and will write files to CWD/OUTPUT_FILE_LOCATION. But LSF Primary Admin has no write permission if the workloads are ran by LSF Non-Primary Admin user.

Fix Solution: Provide write permission to OUTPUT_FILE_LOCATION.

Pls mail me or append your concern here if any issues, Thanks.

@fenglixa fenglixa@cn.ibm.com

@adamsla adamsla merged commit 414912f into IBMSpectrumComputing:master May 28, 2019
@fenglixa fenglixa deleted the fix-normal-user-issue branch May 29, 2019 02:01
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants