HPC System Acceptance: Controlled Chaos

Loading...
Thumbnail Image
Can’t use the file because of accessibility barriers? Contact us with the title of the item, permanent link, and specifics of your accommodation need.

Date

2016-11-14

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Over the last six decades, Los Alamos National Laboratory (LANL) has acquired, accepted, and integrated over 100 new HPC systems, from MANIAC in 1952 to Trinity in 2016. These systems range from small clusters to large supercomputers. Each type of system has its own challenges and having a well established and proven test, acceptance, and integration plan is valuable to the site and vendor to expedite the process. The topic of systems acceptance itself is quite broad, and for the purposes of this paper, it will be mostly focused on the system’s software and hardware components. Some discussion will be given to performance testing as well, but the purpose of this paper is to help HPC System Administrators with the acceptance process.

Description

Keywords

Systems Integration; Systems Testing; Systems Acceptance; Lessons Learned

Citation

Journal

DOI

Link(s) to data and video for this item

Relation

Rights

This content is released under the Creative Commons Attribution 3.0 Unported license (http://creativecommons.org/licenses/by/3.0/). This license includes the following terms: You are free to share - to copy, distribute and transmit the work and to remix - to adapt the work under the following conditions: attribution - you must attribute the work in the manner specified by the author or licensor (but not in any way that suggests that they endorse you or your use of the work). For any reuse or distribution, you must make clear to others the license terms of this work.

Type

Article