Dear Grace Users,
Scheduled maintenance will be performed on the Grace cluster starting on December 3-4, 2024, at 8:00am.
Due to the limited updates needed on Grace at this time, the upcoming December maintenance will not be a full 3-day downtime but will rather have limited disruptions. The Grace cluster and storage will remain online and available throughout the maintenance period and there will be no disruption to running or pending batch jobs. However, certain services will be unavailable for short periods during the maintenance window. There will be reduced availability of compute nodes at times, so users might experience temporality increased wait times.
Maintenance will be performed on sets of nodes, in the following order. Each set will be down briefly and then returned to service.
Tuesday December 3:
Login nodes (there are two nodes but only one will be down at a time)
Globus
Transfer node (transfer-grace.ycrc.yale.edu)
Half of commons nodes
Wednesday December 4:
The remaining commons nodes
All PI nodes
Note to groups with PI nodes:
As the maintenance window approaches, the Slurm scheduler will not start any job Submitted to a PI partition if the job’s requested wallclock time extends past the start of the downtime for PI nodes (8:00 am on December 4, 2024). If you run squeue, such jobs will show as pending jobs with the reason “ReqNodeNotAvail.” (If your job can actually be completed in less time than you requested, you may be able to avoid this by making sure that you request the appropriate time limit using “-t” or “–time”.) Held jobs will automatically return to active status after the maintenance period, at which time they will run in normal priority order.
The Message of the Day (MOTD) will be updated throughout the maintenance period to report the current status. An email notification will be sent when the maintenance is completed.
Please visit the status page at research.computing.yale.edu/system-status for the latest updates. If you have any questions, comments, or concerns, please contact us at hpc@yale.edu.