University of Bath

IT services status

Subscribe

Subscribe to IT services status

HPC

Steven Chapman
27th January 2017

Balena maintenance - 2nd February 2017

During Inter-Semester break, on Thursday 2 February between 09:00 and 13:00, we will be placing the cluster into maintenance mode whilst we perform failover tests between the pair of master nodes and BeeGFS node couplets. These tests will ensure that...
Steven Chapman
30th March 2016

Balena HPC maintenance 11th April 2016

On Monday 11th April, the Balena HPC cluster will be unavailable due to maintenance work. Part of this work will include ClusterVision performing a full health check to ensure Balena is running at optimal performance. During this time, all jobs...
Steven Chapman
15th January 2016

Balena unavailable on 26th January 2016

On Tuesday 26th January there will be a mandatory fire suppression test being carried out in the same room as the Balena HPC system. We need to treat this as a potential risk of power outage to the data centre...
Jessica
18th September 2015

IT Maintenance Tuesday 22 September 7am-9am

Maintenance and upgrades will be taking place, during our at risk* period Tuesday 22 September 2015, 7am-9am. The network will be undergoing maintenance. You may experience minor disruptions if you are using the Internet and any of the services below...
Steven Chapman
3rd August 2015

Balena service online after BeeGFS upgrade

The Balena HPC service is now ready for use after the BeeGFS parallel file system upgrade - new features available after this upgrade include informational quota and quota enforcement. We have successfully completed cluster wide pre-production tests to ensure that...
Steven Chapman
10th July 2015

BeeGFS upgrade 27th - 31st July 2015

From 27th July the BeeGFS storage on the Balena HPC cluster will be undergoing an upgrade. We are expecting that Balena will be unavailable for, at most, the entire week while ClusterVision perform the upgrade. During this period there will...
Steven Chapman
3rd July 2015

Aquila system outage 30th June - 2nd July 2015

On the 30th June the storage server providing /apps and /data lost power and resulted in the Aquila system becoming unresponsive. This issue was fixed by reseating the power supply unit on the storage node and the storage node boot...
Jessica
3rd July 2015

HPC Aquila is available

Aquila is available and stable, 5.30pm 02 July 2015. Aquila is running at reduced capacity with 76 standard nodes and the two gpu nodes. Problem summary Original problem on Tuesday, 30 June 2015, was with a power supply on one of...
Jessica
2nd July 2015

HPC Aquila unavailable

The Aquila culster is still unavailable, while problems are investigated by the HPC team.
Steven Chapman
29th June 2015

Aquila storage maintenance work on 7-9am on 7th July 2015

Essential maintenance work will be performed on Aquila's home area storage on 7th July 2015, between 7-9am. While this work is being carried out Aquila will not be available or accessible. All users are required to log out before 8pm...