SIGHPC Systems Professionals Workshop

HPCSYSPROS23

Sunday November 12th 2023

9:00am - 12:30pm

Rooms 503-504

Held in conjunction with

SC23 Logo

and in cooperation with

SIGHPC Logo

 Quick Information

Supercomputing systems present complex challenges to personnel who design, deploy and maintain these systems. Standing up these systems and keeping them running require novel solutions that are unique to high performance computing. The success of any supercomputing center depends on stable and reliable systems, and HPC Systems Professionals are crucial to that success.

The Eighth Annual HPC Systems Professionals Workshop will bring together systems administrators, systems architects, and systems analysts in order to share best practices, discuss cutting-edge technologies, and advance the state-of-the-practice for HPC systems. This CFP requests that participants submit either papers, slide presentations, or 5-minute Lightning Talk proposals. Additionally reproducible artifacts (code segments, test suites, configuration management templates) which can be made available to the community for use are welcome for submissions either as a standalone submission or in addition to any paper or talk submissions.

 Proceedings

https://github.com/HPCSYSPROS/Workshop23

 Schedule

All times in Mountain Time

StartEndDescription
9:00 AM9:12 AMOpening Remarks, John Blaas (NCAR)
9:12 AM9:19 AM Clushible: Tidal Wave-Like Configuration with Ansible, Jared Baker (NCAR), John Blaas (NCAR), Jenett Tillotson (NCAR)
9:19 AM9:26 AM Embracing Batch on Kubernetes, Jason Kincl (Red Hat Inc), Patrick Bruszewski (Red Hat Inc)
9:26 AM9:43 AM Self-service Monitoring of HPC and Openstack Jobs for Users, Simon Guilbault (Université Laval)
9:43 AM10:00 AM ICE 2.0: Restructuring and Growing an Instructional HPC Cluster, J. Eric Coulter (Georgia Institute of Technology), Michael D. Weiner (Georgia Institute of Technology), Aaron Jezghani (Georgia Institute of Technology), Matthew Guidry (Georgia Institute of Technology), Ruben Lara (Georgia Institute of Technology), Fang Liu (Georgia Institute of Technology), Allan Metts (Georgia Institute of Technology), Ronald Rahaman (Georgia Institute of Technology), Kenneth Suda (Georgia Institute of Technology), Peter Wan (Georgia Institute of Technology), Gregory Willcox (Georgia Institute of Technology), Deirdre Womack (Georgia Institute of Technology), Dan Zhou (Georgia Institute of Technology)
10:00 AM10:30 AM Morning Coffee Break
10:30 AM11:20 AM MareNostrum 5: Site Report from BSC, Sergi Girona (BSC)
11:20 AM11:27 AM Democratizing Remote HPC Storage Access, Adam Focht (Pennsylvania State University)
11:27 AM11:34 AM What a GReaT Scheduling Opportunity, Gary Skouson, (Pennsylvania State University)
11:34 AM11:41 AM Overcoming Active Directory Woes with Plain Text Caches and Replacing Passwords, Jason St John (Guardant Health), Alex Younts (Guardant Health)
11:41 AM11:58 AM Heterogeneous Syslog Analysis: There Is Hope, Andres Quan (LANL), Leah Howell (LANL), Hugh Greenberg (LANL)
11:58 AM12:15 PM Report on Adaptable Open-Source Disaster Recovery Solution for Multi-Petabyte Storage Systems, Honwai Leong (DDN)
12:15 PM12:30 PM Chapter Updates and Closing Remarks, John Blaas (NCAR)

 Topics of Interest

Here are some topics of interest for this group. Note that these are here to indicate direction, not to disallow other related topics.

  • Cluster, configuration, or software management
  • Cybersecurity and data protection
  • Performance tuning/Benchmarking
  • Resource manager and job scheduler configuration
  • Monitoring/Mean-time-to-failure/ROI/Resource utilization
  • HPC storage solutions
  • Composable infrastructure and containers
  • Elastic workloads or optimizations for workload types
  • Virtualization/Clouds
  • Web-based cluster front ends
  • Designing and troubleshooting HPC interconnects

Example paper ideas might be:

  • Best practices for job scheduler configuration
  • Advantages of cluster automation
  • Managing software on HPC clusters

 Calendar

EventDate
Workshop Submissions OpenMay 1, 2023
Workshop Submission CloseAugust 4, 2023
Reviews Sent and Resubmissions OpenAugust 25, 2023
Resubmission ClosedSeptember 8, 2023
Final Program NotificationsSeptember 15th, 2023

 Organizing Committee

PositionNameAffiliation
Workshop ChairJohn Blaas NCAR
Program ChairMatt Bidwell NREL
Organizing Committee
Jay BlairASRC Federal
Subhasis Dasgupta University fo California San Diego
Bill GuytonShell
Mike HartmanStanford University
Adam HoughShell
Kyle HutsonKansas State University
Gary SkousonPenn State University

 Program Committee

NameAffiliation
Jonathon AndersonCIQ
Matt BidwellNational Renewable Energy Laboratory
Adam HoughShell
Andy KeenMichigan State University
David KingNational Center for Supercomputing Applications
Honwai LeongDDN
Ben NickellIdaho National Laboratory
John RobertsArgonne National Laboratory
Matt SmithThe University of British Columbia

Publication Information

All accepted papers and artifacts will be published on GitHub and archived with a DOI in Zenodo. You can view the previous years presentations here HPCSYSPROS SC22 Workshop Proceedings

 Contact Information

If you need to contact us, send email to SIGHPC SYSPROS.

 Links