User Tools

Site Tools


tutorial:torque

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
tutorial:torque [2023/10/25 13:48]
mjm519 [Submitting a Small or Large Memory Job]
tutorial:torque [2024/01/17 12:55]
mjm519 [Queues]
Line 31: Line 31:
 | polyp1--polyp15  | 16 AMD Opteron(tm) Processor 6128 | 32 GB | --- | | polyp1--polyp15  | 16 AMD Opteron(tm) Processor 6128 | 32 GB | --- |
 | polyp30 | 24 Intel(R) Xeon(R) CPU E5-2620 v3 @ 2.40GHz | 128 GB | 2x K80 (4GPUs) | | polyp30 | 24 Intel(R) Xeon(R) CPU E5-2620 v3 @ 2.40GHz | 128 GB | 2x K80 (4GPUs) |
 +
 +
 +Configured Resources as provided in the Maui scheduler. This is pulled from Torque: 
 +                        PROCS: 16  
 +                        MEM: 31G  
 +                        SWAP: 63G  
  
 ===== Submitting Jobs ===== ===== Submitting Jobs =====
Line 124: Line 130:
  
 You can see limits using this command ''qstat -f -Q'' You can see limits using this command ''qstat -f -Q''
-^ Queue ^ Wall Time ^ +^ Queue       ^ Wall Time  
-batch  | 01:00:00  +urgent      | unlimited  
-| short  | 02:00:00  +| batch       | 01:00:00   
-| medium | 04:00:00  +| short       | 02:00:00   
-| long  | 72:00:00  +| medium      | 04:00:00   
-| very long  | 240:00:00 |+| long        | 72:00:00   
 +| very long   | 240:00:00  | 
 +| background  | unlimited 
 +| AMPL        |            | 
 +| MOSEK                  |
  
 +
 +Notes:
 +  * Urgent queue has no limits and jobs have a higher priority over all other jobs in the queues. Please be respectful of others if using this queue to complete time sensitive or critical jobs.
 +  * background queue has no limits and jobs have a lower priority over all other jobs in the queues.
 ===== Examples ===== ===== Examples =====
  
Line 145: Line 159:
 <code bash large.pbs>qsub  -l pmem=20gb  test.pbs</code> <code bash large.pbs>qsub  -l pmem=20gb  test.pbs</code>
  
-To see what resources have been assigned by the batch queuing system run this command:+To see what resources have been assigned by the batch queuing system run the ulimit command (bash) or limit comamnd: 
 +<code bash pbs job submission command>qsub -I -l nodes=1:ppn=1 -l pmem=30GB:vmem=4GB -q short -N test -e TEST.err -o TEST.out -w e</code>
 <code bash ulimit>user@polyp13:~$ ulimit -a <code bash ulimit>user@polyp13:~$ ulimit -a
 core file size          (blocks, -c) 0 core file size          (blocks, -c) 0
Line 163: Line 178:
 virtual memory          (kbytes, -v) unlimited virtual memory          (kbytes, -v) unlimited
 file locks                      (-x) unlimited</code> file locks                      (-x) unlimited</code>
 +
 +**[[https://www.geeksforgeeks.org/ulimit-soft-limits-and-hard-limits-in-linux|For more information on the ulimit command review this link.]]**
 ==== Running MATLAB ==== ==== Running MATLAB ====
  
Line 339: Line 356:
 </code> </code>
 and then schedule your jobs with Torque to perform experiments on GPU 1. and then schedule your jobs with Torque to perform experiments on GPU 1.
 +
 +
 +====== MOAB Scheduler ======
 +PBS Torque is used to schedule and run jobs on our cluster. Two PBS processes are required to run jobs. On the PBS server, the pbs_server process runs to accept your job and add it to the queue. It will also dispatch the job to the nodes to run under the pbs_mom process.
 +
 +
 +==== Useful MOAB Commands ====
 +1. [[https://docs.adaptivecomputing.com/maui/commands/showq.php|showq]] - Displays information about active, eligible, blocked, and/or recently completed jobs.
 +
 +2. [[https://docs.adaptivecomputing.com/maui/commands/showstart.php|showstart]] - Displays the estimated start time of a job based a number of analysis types.
 +
 +3. [[https://docs.adaptivecomputing.com/maui/commands/checkjob.php|checkjob]] - Allows end users to view the status of their own jobs.
 +
 +====Useful External Resources====
 +[[https://www.icer.msu.edu/sites/default/files/files/understand_job_scheduler_v2.pdf|MSU -Understand job scheduler and resource manager]] - Describes the batch queuing system and has some useful diagrams explaining the interrelationship between the scheduler and the server.
 +
 +[[https://wvuhpc.github.io/2019-Intro-HPC/07-jobs/index.html|WVU - Job Submission (Torque and Moab)]] - Lists frequently used commands for Torque and Moab. Also includes information on Prologue and Epilogue scripts.
 +
 +[[http://docs.adaptivecomputing.com/mwm/7-1-3/help.htm#pbsintegration.html|Moab-TORQUE/PBS Integration Guide]] - Guide for Administrators and integrators on the deployment and integration of PBS Torque and Moab into a computer system
 +
 +[[https://silas.net.br/tech/hpc/torque.html|Torque Notes]] - Information about the processes involved in using torque and debugging information.
 +
 +
tutorial/torque.txt · Last modified: 2024/02/28 13:12 by mjm519