The maintenance work will begin on Monday 8th August and is expected to take up to a week to complete. During this maintenance window there will be no access to the Balena system and all queued jobs will need to be cleared from the scheduler.
The majority of this work will be performed by ClusterVision. We are anticipating needing a full week to give ClusterVision and ourselves enough time to complete these maintenance tasks. We shall open up access once all disruptive tasks have been completed.
Below is a list of some of the maintenance work which will be taking place:
- Upgrading the SLURM scheduler, security patching and enabling new features
- Testing SLURM's node power management
- Enabling global file locking on the BeeGFS scratch partition
- ClusterVision will also be configuring new system monitoring tools