11.2 Cluster Using Rules

1. HTCondor Job System

INFO

At present, parallel (such as multithreading and MPI) and GPU jobs are not supported in IHEP HTCondor cluster. Please refer to the Slurm Job System in case of any requirements for parallel jobs.

In HTCondor, jobs from different experiments are partitioned by user groups, which is similar to the job queue. User's primary OS group is used by default with HepJob. However, we can appoint a different one via the parameter of hep_sub.

	Limitation	Description
Jobs Number	10,000 Jobs/User	including the queueing, running and hold jobs, but without finished jobs.
Default Wall Time	100 hours	The recommended job walltime is between 0.5 to 4 hours. Please config your jobs in reasonable grainsize.
Long Wall Time	30 days	Jobs with long walltime limitation have a lower priority. Moreover, there are only a few nodes that accept long jobs. So, be cautious to submit long jobs which might be waiting for a long time in the queue.
Default Memory	according to the physical memory of computing nodes	The memory of IHEP cluster is enough for most user jobs. Generally it's unnecessary to concern about it.
Big Memory	can be specified in HepJob	There are only a few big memory nodes, and some special service jobs have higher priority to be scheduled on these nodes. So users' big memory jobs are not guaranteed.
Disk Space		Please refer to the contents for each experiment.

2. Slurm Job System

3. Hadoop Job System

results matching ""

No results matching ""