Daily Bulletin

HPC Systems Maintenance Update

October 25, 2023

HPC systems maintenance activities continue and are proceeding well.  

GLADE file space transitions and storage maintenance activities have been completed. 

Cheyenne has been returned to service. Jobs that were held in the queues at the beginning of the maintenance period are now running again. (Cheyenne will not be accessible through JupyterHub until later in the outage, when broader JupyterHub service is restored.)

Maintenance is ongoing for Derecho, Casper, and JupyterHub, with all systems still planned for return to service by the end of the week.

Casper operating system updates are proceeding, with CISL engineers currently engaged in testing on the refreshed nodes.

Derecho’s return to service has been slowed slightly. During routine hardware maintenance, CISL staff uncovered several components that will require replacement. The replacement procedure requires power to be removed at the cabinet level, and several such cabinets are impacted. Replacement hardware is on the way, but has not yet arrived at NWSC. Still, we expect to return Derecho to users by the end of the week, with more clarity Thursday 10/26 afternoon. 

Please report any Cheyenne usability issues to https://rchelp.ucar.edu.