Blog

Fragments of code, concepts, and curiosity.

SLURM in the Wild: A Practical Guide for Academic Labs

A complete guide from basic concepts to production deployment, covering multi-node setup, GPU scheduling, advanced monitoring, and the hard-learned lessons from scaling a research lab from 2 to 30+ users across heterogeneous hardware.

50 min read
Infrastructure
GPU Computing