You are here

Grex

Grex System Status

Post date System Status: Update Notes
2017-05-21 - 17:24 PDT Conditions

Grex is open to test access

Our works on Grex storage update are almost complete. The system is open for access and running jobs, for now in test mode. More updates on status and documentation is to follow. Please contact support@westgrid.ca if you experience problems using it or accessing your data!

2017-05-19 - 21:14 PDT Offline

Grex is down for the planned storage outage.

Grex is down, user login access is not available, batch queues are stopped while we migrate to our new Lustre storage.

The outage is extended because of the new Lustre stability issues.

2017-05-17 - 06:22 PDT Offline

Grex is down for the planned storage outage.

Grex is down, user login access is not available, batch queues are stopped while we migrate to our new Lustre storage. ETA for putting Grex back online is Friday , May Nineteenth.

2017-05-09 - 08:38 PDT Conditions

Grex storage outage planned for Wed. May 17.

We will have Grex  outage on 8:30AM, Wed. May 17 to put online new Lustre storage. All the compute nodes will be reprovisioned with new Lustre filesystem client, and rebooted so jobs will be lost. A reservation is in place to prevent new longer jobs from starting. Users can adjust their walltime for jobs to end prior to outage for better throughput.

2017-03-22 - 14:34 PDT Online

System fully operational

Finished on March 22, 2017 - 21:34 GMT

2017-03-20 - 12:16 PDT Downtime Scheduled

Downtime scheduled for Grex login nodes

To install 10Gb Ethernet adapters into Grex login nodes, we will be reinstalling them on Wednesday, March 22, after 3PM CST. This will not affect running jobs or  user data, but access to a particular login node might be interrupted while it is being reinstalled.

2017-02-07 - 10:59 PST Online

Lustre file system is working again.

Lustre file system on Grex is working again. Some of the running jobs might have expired while FS was not available.

Pages