Computing Facilities
The WestGrid computing facilities are distributed among several resource provider sites, with some specialization at each site. WestGrid is connected by good networks so that users can use the system which best fits their needs, regardless of where it is physically located. WestGrid provides several types of computing systems, since different users' programs will run best on different kinds of hardware. The systems are for high performance computing, so they are something beyond what you would find on a desktop. We have clusters, clusters with fast interconnect, and shared memory systems. Use the system which best fits your needs, not necessarily the one closest to you. Anything else is less than optimal and a waste of valuable resources. See the QuickStart Guide for New Users introduction to choosing the most appropriate system. For more detailed information about the differences between the WestGrid systems, see the pages in this section. Serial programs can run on one CPU or core of a compute cluster. Some researchers have a serial program which they need to run many times; they can run multiple copies simultaneously on a cluster. Parallel programs have multiple programs running at the same time which have some need to communicate with each other. Then the important distinction is how much they need to communicate and how quickly they need to do it. In order of increasing demands, those programs can run on a regular cluster, a cluster with a fast interconnect, or a shared memory machine. This depends on how the program is written (MPI, OpenMP, threads, etc). How well a parallel program scales will determine how many nodes of a cluster or machine that program should be run on. Other factors will also affect the decision of which system to run on. For example, the amount of memory available, the software which is installed, the restrictions due to software licensing, etc. WestGrid also has specialized systems. For example, ones with special visualization capabiliites, FPGAs, GPUs, etc. See the QuickStart Guides for more information about each system. List of facilities by location
- Simon Fraser University
- ex-storage facility (gridstore and blackhole), Hydra, Robson, Tantalus, Snowpatch, Bugaboo (also a storage facility)
- University of Alberta
- Nexus group, Cortex group, Checkers
- University of British Columbia
- Glacier, Orcinus
- University of Calgary
- Matrix, Terminus
- University of Saskatchewan
- Storage facility (Silo and Hopper)
- University of Victoria
- Storage facility (Pleiades), Hermes, Nestor
List of facilties by general type
- storage
- USask Storage Facility, UVic Storage Facility, SFU Storage facility
- shared memory
- Nexus group, Cortex group
- cluster
- Glacier, Robson, Snowpatch, Hermes
- cluster with fast interconnect
- Matrix, Tantalus, Terminus, Bugaboo, Checkers, Orcinus, Nestor
- visualization
- Hydra [being decommissioned]
- checkers [special nodes with GPUs]
Future Plans These systems are being purchased, installed, or configured now:UVic - serial, parallel, and storage. -- near production U of Calgary - high performance MPI.-- awaiting delivery in JulyU of Lethbridge - large memory. -- near productionU of Manitoba - fat nodes. -- expect delivery in AugustU of Alberta - add GPUs to some nodes on checkers.-- testing More to come in 2010 and 2011 [update plan]:UVic parts 2 and 3 of storage, part 2 of serial.UBC part 2 of compute, storageSFU parts 2 and 3 of compute and storageUofCalgary part 2 or storage and compute. Also "new architecture"UofA part 2 of compute and storage. Also shared memory.UofS 2010 and 2011: Storage parts 2 and 3 Updated 2010-07-12.
