Overview
Teaching: 15 min
Exercises: 0 minQuestions
How is a large-scale compute system organized?
What is a job?
define a job (input, calculation, output)
define structure of large scale system (head node + worker nodes)
describe the role of a scheduler
describe the very basics of how schedulers work (submit file, queue of tasks, allocation of resources)
Our high throughput system uses HTCondor
as the batch scheduler.
more stuff about HTC
Key Points
A job consists of a computational task, usually defined by input data and a piece of software, producing output data.
Most large scale systems consist of a head node for logging in and submitting jobs, where jobs are performed on worker nodes.
A batch scheduler controls where and when jobs run on the worker nodes.