HPC for Everybody

Friday, November 03, 2017 - 11:00 am12:30 pm

Cory Lueninghoener, Wireless Couch Labs

Abstract: 

High performance computing (HPC) spans a variety of topics from standard system administration to large scale system design and development, and the topic is relevant to more than just the scientific computing community. The successful HPC admin or admin team needs to have some understanding of topics such as system architecture, scalability, parallel filesystems, networking, job scheduling, and software development tools to provide good support to their customers. This minitutorial will be a survey of tools, techniques, and concepts that can be used to get a new HPC capability started or give a boost to existing HPC installations, whether the customers are running scientific research, product design, safety simulations, or any other type of HPC job. The tutorial will include an organized slide deck with more detail than will be covered in the 90 minutes that can be used as reference material describing currently existing tools, what they are useful for, and where to find more information about them.

Cory Lueninghoener, Wireless Couch Labs

Cory Lueninghoener is a large-scale systems guy who has helped design, build, and manage some of the largest scientific computing resources in the world. During his time working with HPC systems at Argonne National Laboratory and Los Alamos National Laboratory, he worked with HPC platforms ranging in size from 100,000 to 900,000 processors. He is especially interested in turning large-scale system research into practice, and has also worked on configuration management and system management tools in the past. Cory was co-chair of LISA 2015 and is active in the large scale system engineering community.

BibTeX
@conference {207205,
author = {Cory Lueninghoener},
title = {{HPC} for Everybody},
year = {2017},
address = {San Francisco, CA},
publisher = {USENIX Association},
month = oct
}
Who should attend: 

People new to managing HPC resources; people interested in providing HPC resources on dedicated hardware or in cloud solutions; people currently providing HPC resources who are interested in other tools that exist.

Take back to work: 

An overview of tools, techniques, and concepts related to building small- and large-scale HPC systems. This will include an organized slide deck with more information than will be covered in the 90 minutes that can be used as reference material describing currently existing tools, what they are useful for, and where to find more information about them.

Topics include: 

High performance computing tools, techniques, and concepts