You are here


Breezy System Status

Post date System Status: Update Notes
2017-04-17 - 23:02 PDT Online

Lattice, Parallel, and Breezy back online

Clusters are substantially back online.

2017-04-17 - 20:51 PDT Offline

Power Outage Affecting Lattice, Parallel, and Breezy

This evening an electrical power interruption caused the loss of all jobs on Lattice, Parallel, and Breezy.  Staff are working hard to restore service.

2016-10-27 - 22:29 PDT Online

Breezy back to normal operations after system upgrade.

Breezy has been upgraded to the latest CentOS 6.8 operating system.
Should you discover any issues, please report them to Thank you.

2016-10-13 - 02:05 PDT Conditions

Wednesday evening, Oct. 12 - Job scheduling restored on Breezy after system upgrade

After a file system problem disrupted service on Breezy on Wednesday morning, scheduling has now been restored, but, on a reduced number of nodes. During the outage, the Linux system and batch system software were updated. The new system will use cpusets, a method of confining the effects of one job on another by more strictly limiting jobs to the resources requested.  Some compute nodes still remain to be updated, as running jobs on those nodes will be allowed to complete first.  As some jobs were lost due to the file system problem, please check output carefully and resubmit jobs as necessary.  Sorry for the inconvenience.
2016-10-12 - 09:50 PDT Offline

Breezy, Lattice, and Parallel are not available due to Filesystem problem

A non disruptive file system upgrade in Calgary did not go according to plan resulting in intermittent access to the file systems.

Some jobs have been lost. We are working with the vendor.

2016-07-11 - 23:01 PDT Online

System fully operational

Finished on July 12, 2016 - 0:00 MDT

2016-07-08 - 17:22 PDT Conditions

Filesystem Degraded

The file system that serves Lattice, Parallel, and Breezy has

experienced a condition that forced administrators to reduce the load

on the filesystem temporarily by pausing jobs on a significant portion

of the clusters.  Jobs will be resumed when the condition has cleared.

Sorry for the inconvenience.