Harnessing a supercomputer for ATLAS

The Vega supercomputer in Slovenia is the most recent HPC centre to process ATLAS experimental data

16 June, 2022

ATLAS Briefing: Harnessing a supercomputer for ATLAS — Andrej Filipčič (left) and Jan Jona Javoršek (right) from the Jožef Stefan Institute in Ljubljana, Slovenia, next to Vega at the Institute of Information Science in Maribor. (Image: CERN)

The ATLAS collaboration uses a global network of data centres – the Worldwide LHC Computing Grid – to perform data processing and analysis. These data centres are generally built from commodity hardware to run the whole spectrum of ATLAS data crunching, from reducing the raw data coming out of the detector down to a manageable size to producing plots for publication.

While the Grid’s distributed approach has proven very successful, the computing needs of the LHC experiments keep expanding, so the ATLAS collaboration has been exploring the potential of integrating high-performance computing (HPC) centres in the Grid’s distributed environment. HPC harnesses the power of purpose-built supercomputers constructed from specialised hardware, and is used widely in other scientific disciplines.

However, HPC poses significant challenges for ATLAS data processing. Access to supercomputer installations are typically subject to more restrictions than Grid sites and their CPU architectures may not be suitable for ATLAS software. Their scheduling mechanisms favour very large jobs using many thousands of nodes, which is atypical of an ATLAS workflow. Finally, the supercomputer installation may be geographically distant from storage hosting ATLAS data, which may pose network problems.

Despite these challenges, ATLAS collaborators have been able to successfully exploit HPC over the last few years, including several near the top of the famous Top500 list of supercomputers. Technological barriers were overcome by isolating the main computation from the parts requiring network access, such as data transfer. Software issues were resolved by using container technology, which allows ATLAS software to run on any operating system, and the development of “edge services”, which enables computations to run in an offline mode without the need to contact external services.

The most recent HPC centre to process ATLAS data is Vega – the first new petascale EuroHPC JU machine, hosted in the Institute of Information Science in Maribor, Slovenia. Vega started operation in April 2021 and consists of 960 nodes, each of which contains 128 physical CPU cores, for a total of 122 800 physical or 245 760 logical cores. To put this in perspective, the total number of cores provided to ATLAS from Grid resources is around 300 000.

Due to close connections with the community of ATLAS physicists in Slovenia, some of whom were heavily involved in the design and commissioning of Vega, the ATLAS collaboration was one of the first users to be granted official time allocations. This was to the benefit of both the ATLAS collaboration, which could take advantage of a significant extra resource, and Vega, which was supplied with a steady, well-understood stream of jobs to assist in the commissioning phase.

Vega was almost continually occupied with ATLAS jobs from the moment it was turned on, and the periods where fewer jobs were running were due to either other users on Vega or a lack of ATLAS jobs to submit. This huge additional computing power – essentially doubling ATLAS’s available resources – was invaluable, allowing several large-scale data-processing campaigns to run in parallel. As such, the ATLAS collaboration heads towards the restart of the LHC with a fully refreshed Run 2 data set and corresponding simulations, many of which have been significantly extended in statistics thanks to the additional resources provided by Vega.

It is a testament to the robustness of ATLAS’s distributed computing systems that they could be scaled up to a single site equivalent in size to the entire Grid. While Vega will eventually be given over to other science projects, some fraction will continue to be dedicated to ATLAS. What’s more, the successful experience shows that ATLAS members (and their data) are ready to jump on the next available HPC centre and fully exploit its potential.

ATLAS computing

Also On Computing

IT: interactions, innovations, impact

Computing

Opinion

Enrica Porcari

17 April, 2024

CERN donates computing equipment to South Afr...

Computing

News

16 April, 2024

The next-generation triggers for CERN detecto...

Computing

News

11 April, 2024

Computer Security: Swipes vs PINs vs password...

Computing

News

9 April, 2024

World Wide Web at 35

Computing

News

27 March, 2024

Computer Security: Day of the open firewall

Computing

News

25 March, 2024

CERN rewarded for its contributions to cloud ...

Computing

News

21 March, 2024

Farewell to the Alcatel phone exchange

Computing

News

12 March, 2024

Computer Security: Bingo walk-through

Computing

News

12 March, 2024

CERN pays tribute to Peter Higgs

The CMS experiment at CERN measures a key par...

CERN and the Swiss Arts Council announce the ...

First observation of photons-to-taus in proto...

CERN Courier Mar/Apr 2024

High-Luminosity LHC images

LHC Facts and Figures

Harnessing a supercomputer for ATLAS

Related Articles

The next-generation triggers for CERN detecto...

ATLAS provides first measurement of the W-bos...

The delicate balance of lepton flavours

Also On Computing

IT: interactions, innovations, impact

CERN donates computing equipment to South Afr...

The next-generation triggers for CERN detecto...

Computer Security: Swipes vs PINs vs password...

World Wide Web at 35

Computer Security: Day of the open firewall

CERN rewarded for its contributions to cloud ...

Farewell to the Alcatel phone exchange

Computer Security: Bingo walk-through

CERN

Science

Featured resources

Harnessing a supercomputer for ATLAS

Related Articles

Also On Computing