You are here

Orcinus

Orcinus System Status

Post date System Status: Update Notes
2018-01-29 - 14:38 PST Online

Monday, January 29, 11:00 AM (PST) Power issue across UBC Campus

The UBC campus experienced  power bumps, all buildings were affected.
 As a result many Orcinus' computing nodes lost power. The jobs running
 there were lost. Please examine your intermediate data and resubmit the
 lost computations.

 We are sorry for any inconveniences, but unfortunately these power issues
 are beyond our control.

2018-01-27 - 17:08 PST Online

File system isuse.

We are having an issue with the file system. Our UBC colleagues will address as soon as possible.

Sorry for the inconvenience.

Saturday January 27 4PM (PST)

The Lustre FS issues have been resolved, but some of the running jobs were lost. Please examine your intermediate data and resubmit the
 lost computations
 Sorry for any inconveniences.

2018-01-27 - 11:21 PST Conditions

File system isuse.

We are having an issue with the file system. Our UBC colleagues will address as soon as possible.

Sorry for the inconvenience.

2018-01-21 - 14:33 PST Online

Sunday, January 21, 2:00 PM (PST) Orcinus power issue

Last night around 4:30 AM (PST) There was a power issue which affected most of the Orcinus' compute nodes. The jobs running on these nodes were lost.

Please examine your intermediate data and resubmit the lost computations

2017-12-03 - 22:57 PST Online

Power and cooling issues Dec. 2, 2017 10:30AM PST

Dec. 2, 2017 10:30AM PST

Around 10:30AM PST UBC Campus experienced short power outage. AS a result Orcinus cluster lost cooling. All running jobs were lost. The power and cooling has been restores and we are working to restore the operational status of the system ASAP.

 

 Sunday, Dec. 3 2017, 10:15PM PST
 The job scheduling has been started. Please examine your input files and
 resubmit the lost computations.

2017-12-02 - 20:00 PST Offline

Power and cooling issues Dec. 2, 2017 10:30AM PST

Dec. 2, 2017 10:30AM PST

Around 10:30AM PST UBC Campus experienced short power outage. AS a result Orcinus cluster lost cooling. All running jobs were lost. The power and cooling has been restores and we are working to restore the operational status of the system ASAP.

2017-12-02 - 20:00 PST Offline

Power and cooling issues Dec. 2, 2017 10:30AM PST

Dec. 2, 2017 10:30AM PST

Around 10:30AM PST UBC Campus experienced short power outage. AS a result Orcinus cluster lost cooling. All running jobs were lost. The power and cooling has been restores and we are working to restore the operational status of the system ASAP.

Pages