Computing at Scale

Jobs and Scheduling

Overview

Teaching: 15 min
Exercises: 0 min
Questions
  • How is a large-scale compute system organized?

  • What is a job?

define a job (input, calculation, output)

define structure of large scale system (head node + worker nodes)

describe the role of a scheduler

describe the very basics of how schedulers work (submit file, queue of tasks, allocation of resources)

On our system

Our high throughput system uses HTCondor as the batch scheduler.

more stuff about HTC

Key Points