hpc-ch » HPC System http://www.hpc-ch.org/wp The Swiss HPC Service Provider Community Tue, 07 Feb 2012 21:06:57 +0000 en hourly 1 http://wordpress.org/?v=3.1 Second cabinet of Cray XE6 installed at CSCS http://www.hpc-ch.org/wp/2010/09/08/second-cabinet-of-cray-xe6-installed-at-cscs/ http://www.hpc-ch.org/wp/2010/09/08/second-cabinet-of-cray-xe6-installed-at-cscs/#comments Wed, 08 Sep 2010 06:00:19 +0000 mdl http://www.hpc-ch.org/wp/?p=975 On September 2, 2010 the second cabinet of the Cray XE6 arrived and was installed at CSCS (see previous posting). Both cabinets are identical and will be interconnected to build a single large system.

Continuing the tradition of CSCS, the Cray XE6 has been given the name of a Swiss mountain, in this case Palu. The Piz Palü is a mountain in the Bernina Range in the Canton of Graubünden. There are three summits on its ridge, the highest being 3’901 m high.

We should note that this has been the first installation in the world of a Cray XE6. CSCS and selected users have already run first benchmarks on Palu and they have been very promising. Palu will subsequently be made available to all users starting this week.

The second cabinet of Palu has been financed by the Università della Svizzera Italiana (USI) and will be used in particular by researchers of the Insitute of Computational Science of Prof. Rolf Krause.

The next two photos show the arrival by truck of the cabinet at CSCS and the installation of the second door with a nice representation of the Piz Palü.

]]>
http://www.hpc-ch.org/wp/2010/09/08/second-cabinet-of-cray-xe6-installed-at-cscs/feed/ 0
First Cray XE6 Supercomputer installed at CSCS http://www.hpc-ch.org/wp/2010/08/06/first-cray-xe6-supercomputer-installed-at-cscs/ http://www.hpc-ch.org/wp/2010/08/06/first-cray-xe6-supercomputer-installed-at-cscs/#comments Fri, 06 Aug 2010 07:52:46 +0000 mdl http://www.hpc-ch.org/wp/?p=930 CSCS, the Swiss National Supercomputing Centre, is pleased to announce the successful installation of an early-release version of the new Cray XE6 supercomputing architecture manufactured by Cray Inc.

(Cross Posting from the CSCS news page)

The single cabinet, 20 blade system, contains 160 compute sockets and uses the new 2.1GHz, 12-core AMD Opteron (aka Magny-Cours) CPUs for a total of 1920 compute cores.  The machine, which has been named Piz Palu, has a theoretical peak performance of 16TFlop/s and 2.5 Terabytes of memory. Furthermore the machine contains Cray’s next generation interconnect network, named Gemini, which promises increased performance and fault tolerance over the previous generation SeaStar technology.  Moreover the Gemini interconnect promises better support for Partitioned Global Address Space (PGAS) languages such as Co-array Fortran (CAF) and Unified Parallel C (UPC).  This Cray XE6 system is part of a joint collaboration between Cray and CSCS, and will enable CSCS and its user community to undertake testing and early familiarization with Cray’s next generation hardware and software technologies.

]]>
http://www.hpc-ch.org/wp/2010/08/06/first-cray-xe6-supercomputer-installed-at-cscs/feed/ 1
Monte Rosa: One Year Later http://www.hpc-ch.org/wp/2010/07/23/monte-rosa-one-year-later/ http://www.hpc-ch.org/wp/2010/07/23/monte-rosa-one-year-later/#comments Fri, 23 Jul 2010 10:00:54 +0000 mdl http://www.hpc-ch.org/wp/?p=911 In June of 2009 Monte Rosa was inaugurated at CSCS (see the article on this blog). During the last year Rosa has been working day and night providing cycles to the Swiss researcher community. In this year the number of cores used by the jobs increased rapidly: 57% of the jobs use now more than 512 cores and 33% of the jobs use even more than 2048 cores (status first quarter 2010). 24% of the users are from fluid dynamics and 23% are from physics, followed by nanoscience (12%), earth and environmental sciences (11%) and chemistry (8%).

In the TOP500 ranking Rosa started at rank 23 (June 2009), moved after an upgrade to position 21 (November 2009) and is now at position 27 (June 2010).

We created a movie showing the assembly of Monte Rosa, if you want to have a look click on the image here below or follow this link »

]]>
http://www.hpc-ch.org/wp/2010/07/23/monte-rosa-one-year-later/feed/ 0
CHIPP Phoenix Phase C Now Fully Operational http://www.hpc-ch.org/wp/2010/06/08/chipp-phoenix-phase-c-now-fully-operational/ http://www.hpc-ch.org/wp/2010/06/08/chipp-phoenix-phase-c-now-fully-operational/#comments Tue, 08 Jun 2010 19:44:05 +0000 mdl http://www.hpc-ch.org/wp/2010/06/08/chipp-phoenix-phase-c-now-fully-operational/ CSCS is happy to announce that the Phase C of the Phoenix cluster of CHIPP has being put fully operational this Tuesday. At the same time Phase B has been switched off and is being decommissioned.

The cluster Phoenix is being used to analyze as Tier 2 the data being produced by the LHC experiment at CERN in Geneva. The old compute nodes will be transferred to Swiss universities and used in Tier 3 environments.

We wish the researchers of CHIPP and Cern gut luck in the search of new subatomic particles.

In the next pictures you see the Phoenix sysadmins dismantling the old hardware.

]]>
http://www.hpc-ch.org/wp/2010/06/08/chipp-phoenix-phase-c-now-fully-operational/feed/ 0
New Visualization Cluster Eiger Delivered at CSCS http://www.hpc-ch.org/wp/2010/05/08/new-visualization-cluster-eiger-delivered-at-cscs/ http://www.hpc-ch.org/wp/2010/05/08/new-visualization-cluster-eiger-delivered-at-cscs/#comments Sat, 08 May 2010 21:36:00 +0000 mdl http://www.hpc-ch.org/wp/?p=561 The Visualization/Research & Development Cluster EIGER is a new CSCS facility which extends the current resource portfolio. During the Q2/Q3 2010 it will be fully integrated into the CSCS Supercomputing ecosystem, and will be opened to the swiss scientific user community for hybrid multicore/ multi-GPU computing, visualization, data analysis, and general purpose pre/post processing activities.

The Eiger mountain in the Swiss Alps …

… and the Eiger cluster in the CSCS machine room being assembled (the interconnect cables are still missing).

The EIGER cluster is a tightly coupled computing cluster system, running Novell SUSE Linux Enterprise Server 11 Operating System release and includes 19 nodes based on the dual-socket six-cores AMD Opteron 2427 processor architecture running at 2.2 GHz, offering 24 GB of main system memory per node, for a total of 228 cpu cores and 552 GB aggregate memory. 4 out of 19 cluster nodes offers a larger main system memory capacity up to 48 GB.

Altair PBS Professional V 10.3 is the main batch queuing system installed and supported on the cluster in order to let end-users access in a shared or reserved mode any available visualization/computing resource.

Several class of nodes have been defined inside the cluster, covering special functionalities :

  • Class 0: Administration Node (1x)
  • Class 1: Login Node (1x)
  • Class 2: Visualization Nodes (7x)
  • Class 3: Fat Visualization Nodes (4x)
  • Class 4: Advanced Development Nodes (4x)
  • Class 5: Storage Nodes (2x)

Depending on the node class membership, cluster nodes are equipped with one of the two kind of GPUs family products :

  • NVIDIA GeForce GTX 285 2 GB => Class 2/3 – soon to be extended with NVIDIA GeForce GTX 480 1.5 GB
  • NVIDIA TESLA S1070 GPUs 4 GB => Class 4 – soon to be extended with 2 upcoming new NVIDIA TESLA/FERMI S2070 6 GB

As an high speed network interconnect, the cluster EIGER rely on a dedicated Infiniband QDR fabric infrastructure, supporting both parallel-MPI traffic and the internal parallel scratch file system I/O data traffic. In addition, a commodity 10 GbE LAN ensures interactive login access, home, project and application file sharing among the cluster nodes, and a standard 1 Gbe administration network is also reserved for cluster management purposes.

]]>
http://www.hpc-ch.org/wp/2010/05/08/new-visualization-cluster-eiger-delivered-at-cscs/feed/ 0
Installation of the Phoenix Upgrade for CHIPP http://www.hpc-ch.org/wp/2010/02/06/installation-of-the-phoenix-upgrade-for-chipp/ http://www.hpc-ch.org/wp/2010/02/06/installation-of-the-phoenix-upgrade-for-chipp/#comments Sat, 06 Feb 2010 07:40:14 +0000 mdl http://www.hpc-ch.org/wp/?p=326 In January 2010 started at CSCS the upgrade of the cluster “Phoenix” that is being used as tier 2 by the Swiss Institute of Particle Physics (CHIPP) in the LHC experiment at CERN.  The Swiss commitment in 3 of the 4 large LHC experiments (ATLAS, CMS and LHCb) mandates establishing their own Grid computing infrastructure for performing LHC physics data analysis in Switzerland.

The previous system based on SunBlade 8000 for the Compute Notes, SF_X4200 M2 machines with 2.8 GHz AMD processors for the Service Nodes and SF_X4500 for the Storage Nodes with our ZFS-Solaris Technology has been running now for 3 years and an upgrade and expansion was necessary.

The new HPC system is based on a Gigabit Ethernet network and an additional Infiniband network based on QDR technology with the Sun Datacenter Infiniband Switch 648 and uses Lustre as parallel file system. The worker nodes are base on Sun X6275 blade server, based on the new Intel X5500 processor generation (Nehalem, 2.53 GHz, Quad-Core, 8MBCache).

The hardware has been delivered beginning January, 2010 and will be functional in March, 2010. The old compute nodes will be decomissioned by mid year.

In the next picture you can follow the assembling of the system in the first month.

The truck delivering the hardware beginning of January at a sunny day in Manno.

The boxes with the delivered hardware in the CSCS computing room.

Unloading of the racks that can up to 1’080 kg heavy.

Christoph Grab (CHIPP)  inspects the location of the upgrade of Phoenix (which is next to the existing system).

The first QDR InfiniBand cables (in blue) are connected to the IB fabric. Because they are very fragile, the cables are suspended to the deck and not running in the raised floor.

The sysadmins of Phoenix Jason and Fotis are getting ready to take over the new system.

]]>
http://www.hpc-ch.org/wp/2010/02/06/installation-of-the-phoenix-upgrade-for-chipp/feed/ 0
A New Home for Palu: The ENTER Museum for Computer http://www.hpc-ch.org/wp/2010/02/03/a-new-home-for-palu-the-enter-museum-for-computer/ http://www.hpc-ch.org/wp/2010/02/03/a-new-home-for-palu-the-enter-museum-for-computer/#comments Wed, 03 Feb 2010 06:00:41 +0000 mdl http://www.hpc-ch.org/wp/?p=344 In a previous blog entry we reported about the Cray  XT3, Palu, being decommissioned at CSCS. In the meantime we were able to give a new home to the  three of the six computing racks at ENTER, the Museum for Computer and Technology close  to Solothurn.

Das ENTER ist das einzige Museum in der Schweiz, welches sich der gesamten Breite der Computer, Computerperipherie und Technik widmet. Vermutlich weltweit einzigartig ist die Anzahl noch funktionierender Computersysteme. Nebst den ca. 400 Computern werden auch 100 Taschenrechner, ca. 50 mechanische Rechenmaschinen, Telekommunikation vom Telegrafen, Zentralen über die Telefonie bis zum modernen Handy, Röhrenradios, Röhrenfernsehgeräte, Chiffriermaschinen und vieles andere gezeigt.

It is a good feeling to know that computing systems running once at CSCS can be viewed by the generations of people. The Cray XT3 was an important step for CSCS being the first massively parallel MIMD supercomputer used at CSCS. Palu was used both by researchers of Swiss universities and by MeteoSwiss for weather forecasts.

In the next photos you may see some steps of the shipping of the three racks (each with a weight of about 830 kg). In addition a SGI Origin 9500 has also been donated to the museum.

The first Cray XT3 rack exits the CSCS building:

The rack being loaded on the truck for transport:

The console of the Cray XT3 and an additional rack are loaded in the computer room for transport.

Farewell of the Cray by the director of CSCS, Prof. Thomas Schulthess

]]>
http://www.hpc-ch.org/wp/2010/02/03/a-new-home-for-palu-the-enter-museum-for-computer/feed/ 0
Decommission of «Palu» at CSCS http://www.hpc-ch.org/wp/2010/01/27/decommission-of-%c2%abpalu%c2%bb-at-cscs/ http://www.hpc-ch.org/wp/2010/01/27/decommission-of-%c2%abpalu%c2%bb-at-cscs/#comments Wed, 27 Jan 2010 17:25:56 +0000 mdl http://www.hpc-ch.org/wp/?p=317 On January 27th, 2010 the Cray XT3 has reached the end of its life here at CSCS, for the last time the system has been shut down and disassembled.

This computer has been named after Piz Palü in the Bernina range in Graubunden, Eastern Switzerland with an elevation 3901 m and his last configuration has been the follow:

  • 6 Cabinets containing 14 service processing elements (PEs), subdivided into 7 service blades, and 548 dual core nodes giving 1096 compute PEs subdivided into 137 compute blade
  • 3 GB of Ram (Compute nodes)

Palu started its life as production system on January 2006. This computer was the very first Cray XT machine to set foot on Europe. It was the first supercomputer based on the XT architecture thatrun Catamount operating system using Infiniband as high-speed interconnect. Palu was mainly used for massively parallel jobs.

Of the 6 cabinets of Palu, two will be disassembled by Cray to be used as spare parts, one will stay at CSCS and be the first exhibit of an own museum. The last three cabinets will be shipped next week to a compter museum near to Solothurn. Stay tuned on this blog to get additional information on the museum next week…

In the next photos you may see the technician of Cray starting the disassembling of Palu (this will need almost three working days).

Disassembling of the interconnect.

The interconnect cables and removing the power supply cables.

Another view of the interconnect being disassembled:

Palu_Interconnect

Disassembling part of the compute blades to be used as spare parts.

]]>
http://www.hpc-ch.org/wp/2010/01/27/decommission-of-%c2%abpalu%c2%bb-at-cscs/feed/ 0
Move of Dôle at CSCS to a New Location http://www.hpc-ch.org/wp/2010/01/26/move-of-dole-at-cscs-to-a-new-location/ http://www.hpc-ch.org/wp/2010/01/26/move-of-dole-at-cscs-to-a-new-location/#comments Tue, 26 Jan 2010 16:19:23 +0000 mdl http://www.hpc-ch.org/wp/?p=304 Today the HPC system called “Dôle” has been moved to a new location inside CSCS. Dôle is being used by MeteoSwiss as failover for Buin, the main production system for weather forecast.

Previously Dôle and Buin were placed next to each one. The replacement to two separate locations in the CSCS computer room will ensure a maximum of availability in case of any technical issue.

]]>
http://www.hpc-ch.org/wp/2010/01/26/move-of-dole-at-cscs-to-a-new-location/feed/ 0
HPC @ University of Bern: ubelix – Uni BErn LInuX cluster http://www.hpc-ch.org/wp/2010/01/22/hpc-university-of-bern-ubelix-%e2%80%93-uni-bern-linux-cluster/ http://www.hpc-ch.org/wp/2010/01/22/hpc-university-of-bern-ubelix-%e2%80%93-uni-bern-linux-cluster/#comments Fri, 22 Jan 2010 09:09:12 +0000 ra http://www.hpc-ch.org/wp/?p=261 See also the original slides of Andres Aeschlimann as pdf »

Purpose
This Grid HPC infrastructure is primarily designed to support the researchers at the Campus. They should use their time doing research and not be bothered by deploying a Grid HPC infrastructure.

Picture of ubelix

Some facts

  • first Linux Cluster was installed in 2001 (1 master and 32 single core nodes)
  • continuously expanded to ~1000 cores in >200 nodes today
  • Dual- and quadcore worker nodes
  • Mostly Opterons, increasing # of Intels (Nehalem)
  • several suppliers (mostly SUN, but currently also IBM and some Dell)
  • < 100kW
  • Gentoo Linux www.gentoo.org
  • Kernel 2.6.22/2.6.27
  • 2TB memory, 50TB disk
  • Lustre filesystem: 1.8.1
  • Sun Grid Engine 6.2
  • Gb Switch
  • Currently no Infiniband Switch

Internal (private) network

  • TCP/IP
  • Stackable Switches (~40Gbs)
  • „normal“ Gigabit Ethernet on the worker nodes
  • 10GE Ethernet for high throughput servers

Internal Network of ubelix

Lustre@ubelix

ubelix Lustre

Application portfolio (local users)

  • HE Physics
  • Astronomy
  • Computational and Molecular Population Genetics Lab
  • Space Research Physics
  • Computer Vision and Artificial Intelligence
  • Chemistry and Biochemistry

Applications from remote (SMSCG)

  • ATLAS: high energy physics application developed for the LHC experiment at CERN
  • RSA768: cryptographic application
  • NAMD and GROMACS: biochemistry applications
  • GAMESS: biochemistry application (work in progress)

Remote access to cluster

Other clusters @ UniBE

  • The LHEP UNIBE Atlas T3 2009 – A ROCKS Cluster with ~200 cores (Sun Fire X2200 IU dual quad cores) and ~50 TB on CentOS. Located in same room as ID UNIBE clusgter. Mainly serves local and remote ATLAS scientists. Backfilled with remote users and applications. Speciality: Access only via ARC clients, i.e. remote and local users habe the same interface. http://ce.lhep.unibe.ch
  • Theoretical Physics (~200 cores, with interconnect)
  • Climate Physics (~100 cores)
  • Space Physics (~100 cores)
  • Chemistry (~100 cores, with interconnect)
  • Computational and Molecular Population Genetics (~60 cores)
]]>
http://www.hpc-ch.org/wp/2010/01/22/hpc-university-of-bern-ubelix-%e2%80%93-uni-bern-linux-cluster/feed/ 0