BEGIN:VCALENDAR
VERSION:2.0
PRODID:Linklings LLC
BEGIN:VTIMEZONE
TZID:Europe/Stockholm
X-LIC-LOCATION:Europe/Stockholm
BEGIN:DAYLIGHT
TZOFFSETFROM:+0100
TZOFFSETTO:+0200
TZNAME:CEST
DTSTART:19700308T020000
RRULE:FREQ=YEARLY;BYMONTH=3;BYDAY=-1SU
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:+0200
TZOFFSETTO:+0100
TZNAME:CET
DTSTART:19701101T020000
RRULE:FREQ=YEARLY;BYMONTH=10;BYDAY=-1SU
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTAMP:20241120T082408Z
LOCATION:HG D 1.2
DTSTART;TZID=Europe/Stockholm:20240603T123000
DTEND;TZID=Europe/Stockholm:20240603T130000
UID:submissions.pasc-conference.org_PASC24_sess148_msa289@linklings.com
SUMMARY:EMOI: CSCS Extensible Monitoring and Observability Infrastructure
DESCRIPTION:Minisymposium\n\nJean-Guillaume Piccinali and Jonathan Coles (
 ETH Zurich / CSCS)\n\nThe Swiss National Supercomputing Centre (CSCS) is e
 xpanding its computational capabilities with the Alps architecture, a Cray
  HPE EX system incorporating around 5000 GH200 modules, in addition to the
  pre-existing nodes. This expansion poses challenges in monitoring due to 
 hardware heterogeneity, including AMD Rome CPUs, Mi250x and Mi300 GPUs, Nv
 idia A100, and the Arm-based Grace-Hopper GH200. Implementing measures to 
 decrease power usage can help reduce the operational costs and environment
 al challenges associated with supercomputers. To address these challenges,
  CSCS has developed an Extensible Monitoring and Observability Infrastruct
 ure (EMOI), designed to manage the substantial data influx and provide ins
 ightful analysis of the infrastructure's behavior. EMOI integrates with Cr
 ay System Management (CSM) and Cray System Monitoring Application (SMA), e
 mphasizing a Kafka-centric approach for enhanced interoperability.  We wil
 l delve into the structure and quality of collected datasets, focusing on 
 power consumption data.  We hope that our experience will be beneficial no
 t only to CSCS but also to other HPE/Cray sites facing similar challenges 
 in supercomputing infrastructure management.\n\nDomain: Climate, Weather, 
 and Earth Sciences, Applied Social Sciences and Humanities, Engineering\n\
 nSession Chairs: Jan Frederik Engels (DKRZ) and William Sawyer (ETH Zurich
  / CSCS)
END:VEVENT
END:VCALENDAR
