University of Bath

IT services status

Subscribe

Subscribe to IT services status

HPC

Steven Chapman
27th January 2017

Balena maintenance - 2nd February 2017

During Inter-Semester break, on Thursday 2 February between 09:00 and 13:00, we will be placing the cluster into maintenance mode whilst we perform failover tests between the pair of master nodes and BeeGFS node couplets. These tests will ensure that...
Steven Chapman
25th July 2016

Balena Maintenance - 8th to 12th August 2016

The maintenance work will begin on Monday 8th August and is expected to take up to a week to complete. During this maintenance window there will be no access to the Balena system and all queued jobs will need to...
Steven Chapman
30th March 2016

Balena HPC maintenance 11th April 2016

On Monday 11th April, the Balena HPC cluster will be unavailable due to maintenance work. Part of this work will include ClusterVision performing a full health check to ensure Balena is running at optimal performance. During this time, all jobs...
Steven Chapman
15th January 2016

Balena unavailable on 26th January 2016

On Tuesday 26th January there will be a mandatory fire suppression test being carried out in the same room as the Balena HPC system. We need to treat this as a potential risk of power outage to the data centre...
Steven Chapman
14th October 2015

Aquila /data area is being recovered

To update everyone on the condition of the /data storage area. I have some good news! We received the new power supply from ClusterVision for the storage server, after fitting the new supply we discovered that the mainboard has also...
Marianne
30th September 2015

Aquila HPC service retirement - 30 September 2015

The compute nodes on the Aquila HPC system will be powered down on 30th September 2015, the process starts at midday. As well as powering down the compute nodes, we will also be disabling the scheduler system and turning off...
Steven Chapman
3rd August 2015

Balena service online after BeeGFS upgrade

The Balena HPC service is now ready for use after the BeeGFS parallel file system upgrade - new features available after this upgrade include informational quota and quota enforcement. We have successfully completed cluster wide pre-production tests to ensure that...
Steven Chapman
10th July 2015

BeeGFS upgrade 27th - 31st July 2015

From 27th July the BeeGFS storage on the Balena HPC cluster will be undergoing an upgrade. We are expecting that Balena will be unavailable for, at most, the entire week while ClusterVision perform the upgrade. During this period there will...
Steven Chapman
3rd July 2015

Aquila system outage 30th June - 2nd July 2015

On the 30th June the storage server providing /apps and /data lost power and resulted in the Aquila system becoming unresponsive. This issue was fixed by reseating the power supply unit on the storage node and the storage node boot...
Steven Chapman
29th June 2015

Aquila storage maintenance work on 7-9am on 7th July 2015

Essential maintenance work will be performed on Aquila's home area storage on 7th July 2015, between 7-9am. While this work is being carried out Aquila will not be available or accessible. All users are required to log out before 8pm...