Analyzing HPC Support Tickets: Experience and Recommendations

Alexandra DeLucia,Elisabeth Moore
DOI: https://doi.org/10.48550/arXiv.2010.04321
2020-10-09
Distributed, Parallel, and Cluster Computing
Abstract:High performance computing (HPC) user support teams are the first line of defense against large-scale problems, as they are often the first to learn of problems reported by users. Developing tools to better assist support teams in solving user problems and tracking issue trends is critical for maintaining system health. Our work examines the Los Alamos National Laboratory HPC Consult Team's user support ticketing system and develops proof of concept tools to automate tasks such as category assignment and similar ticket recommendation. We also generate new categories for reporting and discuss ideas to improve future ticketing systems.
What problem does this paper attempt to address?