Status and Maintenance

Computing Systems Status

There are currently no known issues. Please reach out to research.computing@yale.edu with any questions or to report problems. 

Scheduled Maintenance

To perform critical updates and minimize downtime, regular maintenance will be performed on each cluster on a rotating schedule. During maintenance, logins will be disabled, jobs will not run, and cluster storage may be unavailable. Communication will be sent to users four weeks and one week before the maintenance period and in case of any changes.

All YCRC-managed clusters are down for planned maintenance in late Spring (dates for 2026 TBD).

This represents an updated approach to YCRC system maintenance.  Until now, each cluster has had two full-downtime maintenance periods per year, each lasting three days.  With the new approach, each cluster are updated twice a year.  However, only one of these two annual maintenance periods are a full downtime.  The other involves rolling updates to a live cluster.  We are working toward a system in which the annual full-downtime maintenance will take place on all YCRC-managed clusters simultaneously.  Approximately six months after this full downtime, the clusters will be patched with minor updates on a rolling basis with minimal disruption. 

The new approach has several advantages.  By consolidating the major cluster updates, YCRC is able to focus on the preparation for and execution of those major updates once a year, instead of the nearly once a month, freeing more time for supporting researchers.  Performing maintenance on all clusters within a data center simultaneously facilitates maintenance on subsystems that affect multiple clusters.  All clusters are kept on the same major version of the image throughout the year, making for a more consistent and easier-to-support environment.  For any given cluster, the number of days per year of planned total downtime is reduced.  Also, for each cluster, the number of planned total-downtime periods is reduced from two to one.  There is a second period each year of rolling updates, but these will entail limited disruption.

If you have any questions, comments, or concerns, please contact us at hpc@yale.edu.

2025 Maintenance Schedule

Upcoming:

  • Hopper - September 24

Past:

  • Bouchet and its associated storage  - June 2 - June 5.
  • Grace, Milgram and Misha and their associated storage  - June 9 - June 12.
  • McCleary and its associated storage - June 10 - June 12.