[COMPLETED]?Colossus maintenance on Thursday, 3rd October

UPDATE, 09:30, starting unmounting of NFS-shares from /cluster on all machines.

We have now solved the problems we encountered on Monday, and are now ready to replace the NFS-exporter.

The work will start on Thursday 3rd October at 09:00 CET. We expect to be finished by the end of the day, possibly earlier.

During the maintenance, we have to unmount /cluster on all virtual machines (VMs) that mount it. This means that the /cluster/projects/pXX areas will be unavailable on the VMs, and it will not be possible to use the module load system for software on the VMs. Some VMs might also require a reboot.

Jobs on Colossus will continue to run as normal, but it will not be possible to submit new jobs during the stop.

Do not run jobs on VMs that need data from /cluster or software modules. If you do so, we will have to kill them to unmount the /cluster area. Also, if the VM needs to be rebooted, all running jobs on it will be killed.

Please save your data before the maintenance window, and follow our Operational Log for the update.

We will update the progress here: http://www.uio.no/english/services/it/research/sensitive-data/log/

This work will increase the current bandwidth to export /cluster partition to the HPC-projects, which has a been a problem lately.

Published Oct. 1, 2019 1:32 PM - Last modified Oct. 3, 2019 1:01 PM