Supercomputers for Linux SysAdmins: Managing Modern HPC Clusters and Supercomputers from Software to Hardware
Supercomputers and High Performance Computing (HPC) clusters are not so exotic as people imagine these days. They give companies the power of computation like no one server can give alone. They make new drugs and materials discoveries, universe modeling and AI training, crash simulations and market research possible – all thanks to HPC clusters. Building or renting a HPC cluster is not so difficult either as cloud providers can give you resources to build one cheap and performative enough to use yourself, so If you are or want to become HPC cluster Sysadmin or manager, this book is for you.

Supercomputers for Linux SysAdmins delves into the world of modern HPC cluster architecture, hardware, software and resources management using a Linux/UNIX based approach. The number of HPC clusters is growing with an estimated 30 billion by 2030 but there are not enough sysadmins to run and manage them, this book serves to bridge this gap to help more Sysadmins and managers to transition into the exiting world of HPCs.

This book helps those with a strong foundational knowledge in Linux, to deal with supercomputers and HPC clusters. We start with the basic principles of supercomputer management, fundamentals of Linux and UNIX, Shell Scripting and systemd and well as other open source tools and frameworks, taking you thorough the security, monitoring and hardware requirements for supercomputers and HPC clusters.

You Will Learn:



• How to plan new supercomputers
• The main principles and technologies used in supercomputers and HPC clusters
• How to set up the software environments on new supercomputers
• To set up supercomputer and HPC cluster resources and jobs management
• To manage accounts, resource sharing and many more.

Who is it for:

The main audience of this book are regular UNIX/Linux sysadmins and managers, who should deal with HPC clusters on-prem or in cloud and those who are interested in supercomputers and HPC clusters and how to utilize them in their projects and teams.

1147389741
Supercomputers for Linux SysAdmins: Managing Modern HPC Clusters and Supercomputers from Software to Hardware
Supercomputers and High Performance Computing (HPC) clusters are not so exotic as people imagine these days. They give companies the power of computation like no one server can give alone. They make new drugs and materials discoveries, universe modeling and AI training, crash simulations and market research possible – all thanks to HPC clusters. Building or renting a HPC cluster is not so difficult either as cloud providers can give you resources to build one cheap and performative enough to use yourself, so If you are or want to become HPC cluster Sysadmin or manager, this book is for you.

Supercomputers for Linux SysAdmins delves into the world of modern HPC cluster architecture, hardware, software and resources management using a Linux/UNIX based approach. The number of HPC clusters is growing with an estimated 30 billion by 2030 but there are not enough sysadmins to run and manage them, this book serves to bridge this gap to help more Sysadmins and managers to transition into the exiting world of HPCs.

This book helps those with a strong foundational knowledge in Linux, to deal with supercomputers and HPC clusters. We start with the basic principles of supercomputer management, fundamentals of Linux and UNIX, Shell Scripting and systemd and well as other open source tools and frameworks, taking you thorough the security, monitoring and hardware requirements for supercomputers and HPC clusters.

You Will Learn:



• How to plan new supercomputers
• The main principles and technologies used in supercomputers and HPC clusters
• How to set up the software environments on new supercomputers
• To set up supercomputer and HPC cluster resources and jobs management
• To manage accounts, resource sharing and many more.

Who is it for:

The main audience of this book are regular UNIX/Linux sysadmins and managers, who should deal with HPC clusters on-prem or in cloud and those who are interested in supercomputers and HPC clusters and how to utilize them in their projects and teams.

59.99 Pre Order
Supercomputers for Linux SysAdmins: Managing Modern HPC Clusters and Supercomputers from Software to Hardware

Supercomputers for Linux SysAdmins: Managing Modern HPC Clusters and Supercomputers from Software to Hardware

by Sergey Zhumatiy
Supercomputers for Linux SysAdmins: Managing Modern HPC Clusters and Supercomputers from Software to Hardware

Supercomputers for Linux SysAdmins: Managing Modern HPC Clusters and Supercomputers from Software to Hardware

by Sergey Zhumatiy

Paperback(First Edition)

$59.99 
  • SHIP THIS ITEM
    Available for Pre-Order. This item will be released on December 11, 2025

Related collections and offers


Overview

Supercomputers and High Performance Computing (HPC) clusters are not so exotic as people imagine these days. They give companies the power of computation like no one server can give alone. They make new drugs and materials discoveries, universe modeling and AI training, crash simulations and market research possible – all thanks to HPC clusters. Building or renting a HPC cluster is not so difficult either as cloud providers can give you resources to build one cheap and performative enough to use yourself, so If you are or want to become HPC cluster Sysadmin or manager, this book is for you.

Supercomputers for Linux SysAdmins delves into the world of modern HPC cluster architecture, hardware, software and resources management using a Linux/UNIX based approach. The number of HPC clusters is growing with an estimated 30 billion by 2030 but there are not enough sysadmins to run and manage them, this book serves to bridge this gap to help more Sysadmins and managers to transition into the exiting world of HPCs.

This book helps those with a strong foundational knowledge in Linux, to deal with supercomputers and HPC clusters. We start with the basic principles of supercomputer management, fundamentals of Linux and UNIX, Shell Scripting and systemd and well as other open source tools and frameworks, taking you thorough the security, monitoring and hardware requirements for supercomputers and HPC clusters.

You Will Learn:



• How to plan new supercomputers
• The main principles and technologies used in supercomputers and HPC clusters
• How to set up the software environments on new supercomputers
• To set up supercomputer and HPC cluster resources and jobs management
• To manage accounts, resource sharing and many more.

Who is it for:

The main audience of this book are regular UNIX/Linux sysadmins and managers, who should deal with HPC clusters on-prem or in cloud and those who are interested in supercomputers and HPC clusters and how to utilize them in their projects and teams.


Product Details

ISBN-13: 9798868815997
Publisher: Apress
Publication date: 12/11/2025
Edition description: First Edition
Pages: 450
Product dimensions: 7.01(w) x 10.00(h) x (d)

About the Author

Sergey Zhumatiy has been managing supercomputers since 1999 starting out building and managing HPC clusters at Moscow State University and holds a PhD in computer science. Several supercomputers under his supervising, like Chebyshev, Lomonosov, Lomonosov-2, achieved top rankings in the top500 supercomputers list, and dominated the Russian top50 supercomputers list. Now he works as an HPC Architect and SysAdmin at NVIDIA.

Table of Contents

1: Introduction.- Chapter 2: What is "super"?.- Chapter 3: How to build and start it?.- Chapter 4: Supercomputer Hardware.- Chapter 5: InfiniBand.- 6: How a supercomputer does the job.- 7: UNIX and Linux – the basics.- Chapter 8: UNIX and Linux – working techniques.- Chapter 9: Network File Systems.- Chapter 10: Remote Management.- 11: Users – Accounting, Management.- 12: Users – quotas, access rights.- 13: Job management systems.- 14: OpenPBS, Torque.- 15: Slurm.- 16: Compilers and Environments- for Parallel Technologies.- 17: Parallel Computing Support Libraries.- 18: Booting and init.- 19: Node Setup, Software Installation.- 20: Out-of-the-Box Stacks and Deployment Systems.- 21: Cluster Management Systems – xCAT and Others.- 22: Communicating with Users.- 23: One-two-three instructions.- 24: Shell Scripts – basics and common mistakes.- 25: Systemd – A Short Course.- 26: Conclusion.

From the B&N Reads Blog

Customer Reviews