Open
Description
Notes from the event.
== Description of the problem ==
- A minor update of glibc was done. This is a process which we have done many times before, and which every sysadmin does without issue quite often when they run a yum update. This is also something which has been done on multiple occasions on whole clusters with NFS root.
- The libc.so file displayed on some CVMFS clients was incorrect. Running sha256sum on libc-2.30.so multiple times in a row would return a different checksum every time.
- We confirmed that the CVMFS cache itself had the right content. It was between the cache and the view that was presented to the client.
- About 10% of the nodes were affected.
== What we did to fix it ==
Two options worked
- Forcing to restart the CVMFS client, on the affected nodes: sudo cvmfs_config killall
- Killing the job that kept libc-2.30.so opened.
== What do we do from now ==
- We have gathered information to open a bug report with CVMFS.
- We will try to reproduce the issue using our development repository.
- We will consider other options for more critical-than-average libraries, such as libc-2.30.so. For example, introducing a symlink layer for those.