System Updates Archive

  • Omega Scheduled Maintenance

    Scheduled maintenance will be performed on Omega beginning Monday, April 2nd, 2018, at 8:00 am.  Maintenance is expected to be completed by the end of this week.   During this time, logins will be disabled and Omega’s storage will not be available.  An email notification will be sent when the maintenance has been completed, and the cluster is available.

    As the maintenance window approaches, the Slurm scheduler will not start any job if the job’s requested wallclock time extends past the start of the maintenance period (8:00am on April 2, 2018). If you run squeue, such jobs will show as pending jobs with the reason “ReqNodeNotAvail”. (If your job can actually be completed in less time than you requested, you may be able to avoid this by making sure that you request the appropriate time limit using “-t” or “–time”.) Held jobs will automatically return to active status after the maintenance period, at which time they will run in normal priority order.

    The primary purpose of the maintenance period is to finish moving all data off of the Lustre storage system onto the Loomis GPFS storage system. After the maintenance period is complete, current Omega files will be stored solely on the Loomis GPFS system. For groups that have migrated their workloads entirely to Grace or Farnam, their Omega data will be available for copying and clean-up until December 2018 at

    /gpfs/loomis/home.omega/<metagroup>/<group>
    /gpfs/loomis/scratch.omega/<metagroup>/<group>

    In addition to the above access from Grace, the remaining groups on Omega will still be able to access their storage on Omega at any preexisting paths until the cluster is fully decommissioned in December 2018. These groups will be contacted shortly with more details about the decommission process.

    If you have questions, comments, or concerns, please contact us at hpc@yale.edu.

  • Grace Scheduled Maintenance

    2/26 - 8:30am - Scheduled Grace Maintenance has begun. Logins to Grace are disabled, and Grace’s storage is unavailable to Farnam users. The cluster and storage are expected to be back online by the end of day, Friday March 2nd.  

  • Scheduled Maintenance on Farnam

    2/5/18 - 8:30am - Scheduled maintenance on Farnam has begun. During the maintenance, logins will be disabled. Farnam is expected to be available by Wednesday evening, Feb 7th.

  • Grace - project space unavailable

    Friday, January 12, 2018 - 4:00pm

    The Grace project (aka scratch) space is currently unavailable and any jobs accessing the project space will have failed. Grace project is also unavailable from Farnam preventing new jobs from beginning. We are working to restore the filesystem to service as quickly as possible. Sorry for the inconvience.

    Update (5pm): project space has been restored to service.

  • Grace Scheduled Maintenance - Updated 5pm, 12/13

    12/13 - 5pm - Grace’s scheduled maintenance continues, and the cluster is expected to be available by Thursday afternoon.

     

  • Milgram Scheduled Maintenance

    Milgram will undergo scheduled maintenance beginning Monday 11/27 at 8am, and ending Wednesday 11/29 at 5pm. During this time, logins to Milgram will be disabled and the storage will be unavailable. 

  • Milgram Unavailable

    09:30 - 11/9 - Milgram is currently unavailable and YCRC staff are working on restoring the cluster to service. 

  • Ruddle Scheduled Maintenance 11/6-11/8

    Scheduled maintenance will be performed on Ruddle beginning Monday, November 6, 2017, at 8:00 am.  Maintenance is expected to be completed by the end of the day, Wednesday, November 8, 2017.   During this time, logins will be disabled and Ruddle’s storage will not be available.

     
  • Slurm Issues on Grace and Farnam

    Monday, October 23, 2017 - 5:00pm to 6:30pm

    <p>We are currently experiencing issues with the schedulers on Grace and Farnam. We are working to resolve the issues as soon as possible. Sorry for the inconvenience.</p>

  • Omega Unavailable - Update 18 Sept 2017, 9:00pm

    YCRC staff are currently performing validation checks on Omega’s storage. Omega will be unavailable until these checks successfully complete.

Pages