Quick Information
Supercomputing systems present complex challenges to personnel who design, deploy and maintain these systems. Standing up these systems and keeping them running require novel solutions that are unique to high performance computing. The success of any supercomputing center depends on stable and reliable systems, and HPC Systems Professionals are crucial to that success.
The Eighth Annual HPC Systems Professionals Workshop will bring together systems administrators, systems architects, and systems analysts in order to share best practices, discuss cutting-edge technologies, and advance the state-of-the-practice for HPC systems. This CFP requests that participants submit either papers, slide presentations, or 5-minute Lightning Talk proposals. Additionally reproducible artifacts (code segments, test suites, configuration management templates) which can be made available to the community for use are welcome for submissions either as a standalone submission or in addition to any paper or talk submissions.
Proceedings
https://github.com/HPCSYSPROS/Workshop23
Schedule
All times in Mountain Time
Start | End | Description |
---|---|---|
9:00 AM | 9:12 AM | Opening Remarks, John Blaas (NCAR) |
9:12 AM | 9:19 AM | Clushible: Tidal Wave-Like Configuration with Ansible, Jared Baker (NCAR), John Blaas (NCAR), Jenett Tillotson (NCAR) |
9:19 AM | 9:26 AM | Embracing Batch on Kubernetes, Jason Kincl (Red Hat Inc), Patrick Bruszewski (Red Hat Inc) |
9:26 AM | 9:43 AM | Self-service Monitoring of HPC and Openstack Jobs for Users, Simon Guilbault (Université Laval) |
9:43 AM | 10:00 AM | ICE 2.0: Restructuring and Growing an Instructional HPC Cluster, J. Eric Coulter (Georgia Institute of Technology), Michael D. Weiner (Georgia Institute of Technology), Aaron Jezghani (Georgia Institute of Technology), Matthew Guidry (Georgia Institute of Technology), Ruben Lara (Georgia Institute of Technology), Fang Liu (Georgia Institute of Technology), Allan Metts (Georgia Institute of Technology), Ronald Rahaman (Georgia Institute of Technology), Kenneth Suda (Georgia Institute of Technology), Peter Wan (Georgia Institute of Technology), Gregory Willcox (Georgia Institute of Technology), Deirdre Womack (Georgia Institute of Technology), Dan Zhou (Georgia Institute of Technology) |
10:00 AM | 10:30 AM | Morning Coffee Break |
10:30 AM | 11:20 AM | MareNostrum 5: Site Report from BSC, Sergi Girona (BSC) |
11:20 AM | 11:27 AM | Democratizing Remote HPC Storage Access, Adam Focht (Pennsylvania State University) |
11:27 AM | 11:34 AM | What a GReaT Scheduling Opportunity, Gary Skouson, (Pennsylvania State University) |
11:34 AM | 11:41 AM | Overcoming Active Directory Woes with Plain Text Caches and Replacing Passwords, Jason St John (Guardant Health), Alex Younts (Guardant Health) |
11:41 AM | 11:58 AM | Heterogeneous Syslog Analysis: There Is Hope, Andres Quan (LANL), Leah Howell (LANL), Hugh Greenberg (LANL) |
11:58 AM | 12:15 PM | Report on Adaptable Open-Source Disaster Recovery Solution for Multi-Petabyte Storage Systems, Honwai Leong (DDN) |
12:15 PM | 12:30 PM | Chapter Updates and Closing Remarks, John Blaas (NCAR) |
Topics of Interest
Here are some topics of interest for this group. Note that these are here to indicate direction, not to disallow other related topics.
- Cluster, configuration, or software management
- Cybersecurity and data protection
- Performance tuning/Benchmarking
- Resource manager and job scheduler configuration
- Monitoring/Mean-time-to-failure/ROI/Resource utilization
- HPC storage solutions
- Composable infrastructure and containers
- Elastic workloads or optimizations for workload types
- Virtualization/Clouds
- Web-based cluster front ends
- Designing and troubleshooting HPC interconnects
Example paper ideas might be:
- Best practices for job scheduler configuration
- Advantages of cluster automation
- Managing software on HPC clusters
Calendar
Event | Date |
---|---|
Workshop Submissions Open | May 1, 2023 |
Workshop Submission Close | August 4, 2023 |
Reviews Sent and Resubmissions Open | August 25, 2023 |
Resubmission Closed | September 8, 2023 |
Final Program Notifications | September 15th, 2023 |
Organizing Committee
Position | Name | Affiliation |
---|---|---|
Workshop Chair | John Blaas | NCAR |
Program Chair | Matt Bidwell | NREL |
Organizing Committee | ||
Jay Blair | ASRC Federal | |
Subhasis Dasgupta | University fo California San Diego | |
Bill Guyton | Shell | |
Mike Hartman | Stanford University | |
Adam Hough | Shell | |
Kyle Hutson | Kansas State University | |
Gary Skouson | Penn State University |
Program Committee
Name | Affiliation |
---|---|
Jonathon Anderson | CIQ |
Matt Bidwell | National Renewable Energy Laboratory |
Adam Hough | Shell |
Andy Keen | Michigan State University |
David King | National Center for Supercomputing Applications |
Honwai Leong | DDN |
Ben Nickell | Idaho National Laboratory |
John Roberts | Argonne National Laboratory |
Matt Smith | The University of British Columbia |
Publication Information
All accepted papers and artifacts will be published on GitHub and archived with a DOI in Zenodo. You can view the previous years presentations here HPCSYSPROS SC22 Workshop Proceedings
Contact Information
If you need to contact us, send email to SIGHPC SYSPROS.
Links
- SC HPC Syspros Mailing List - you should join!
- Join our SIGHPC SYSPROS Slack team
- Email us with any questions