User Tools

Site Tools


tutorial:torque

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
tutorial:torque [2021/06/17 08:25]
mjm519 [Running MATLAB]
tutorial:torque [2023/12/08 13:16]
mjm519 [Hardware]
Line 31: Line 31:
 | polyp1--polyp15  | 16 AMD Opteron(tm) Processor 6128 | 32 GB | --- | | polyp1--polyp15  | 16 AMD Opteron(tm) Processor 6128 | 32 GB | --- |
 | polyp30 | 24 Intel(R) Xeon(R) CPU E5-2620 v3 @ 2.40GHz | 128 GB | 2x K80 (4GPUs) | | polyp30 | 24 Intel(R) Xeon(R) CPU E5-2620 v3 @ 2.40GHz | 128 GB | 2x K80 (4GPUs) |
 +
 +
 +Configured Resources as provided in the Maui scheduler. This is pulled from Torque: 
 +                        PROCS: 16  
 +                        MEM: 31G  
 +                        SWAP: 63G  
  
 ===== Submitting Jobs ===== ===== Submitting Jobs =====
Line 44: Line 50:
 #PBS -o /home/mat614/TEST.out #PBS -o /home/mat614/TEST.out
 #PBS -l nodes=1:ppn=4  #PBS -l nodes=1:ppn=4 
 +#PBS -l pmem=2GB:vmem=1GB
 #PBS -q batch #PBS -q batch
  
Line 123: Line 130:
  
 You can see limits using this command ''qstat -f -Q'' You can see limits using this command ''qstat -f -Q''
-^ Queue ^ Wall Time ^ +^ Queue      ^ Wall Time  ^  
-| batch  | 01:00:00  +| batch      | 01:00:00   |  
-| short  | 02:00:00  +| short      | 02:00:00   |  
-| medium | 04:00:00  +| medium     | 04:00:00   |  
-| long  | 72:00:00  +| long       | 72:00:00   |  
-| very long  | 240:00:00 |+| very long  | 240:00:00   
 +| AMPL                  |  
 +| MOSEK      |            |  
  
 ===== Examples ===== ===== Examples =====
Line 144: Line 154:
 <code bash large.pbs>qsub  -l pmem=20gb  test.pbs</code> <code bash large.pbs>qsub  -l pmem=20gb  test.pbs</code>
  
 +To see what resources have been assigned by the batch queuing system run the ulimit command (bash) or limit comamnd:
 +<code bash pbs job submission command>qsub -I -l nodes=1:ppn=1 -l pmem=30GB:vmem=4GB -q short -N test -e TEST.err -o TEST.out -w e</code>
 +<code bash ulimit>user@polyp13:~$ ulimit -a
 +core file size          (blocks, -c) 0
 +data seg size           (kbytes, -d) 31457280
 +scheduling priority             (-e) 0
 +file size               (blocks, -f) unlimited
 +pending signals                 (-i) 128344
 +max locked memory       (kbytes, -l) unlimited
 +max memory size         (kbytes, -m) 31457280
 +open files                      (-n) 65536
 +pipe size            (512 bytes, -p) 8
 +POSIX message queues     (bytes, -q) 819200
 +real-time priority              (-r) 0
 +stack size              (kbytes, -s) unlimited
 +cpu time               (seconds, -t) unlimited
 +max user processes              (-u) 128344
 +virtual memory          (kbytes, -v) unlimited
 +file locks                      (-x) unlimited</code>
 +
 +**[[https://www.geeksforgeeks.org/ulimit-soft-limits-and-hard-limits-in-linux|For more information on the ulimit command review this link.]]**
 ==== Running MATLAB ==== ==== Running MATLAB ====
  
Line 320: Line 351:
 </code> </code>
 and then schedule your jobs with Torque to perform experiments on GPU 1. and then schedule your jobs with Torque to perform experiments on GPU 1.
 +
 +
 +====== MOAB Scheduler ======
 +PBS Torque is used to schedule and run jobs on our cluster. Two PBS processes are required to run jobs. On the PBS server, the pbs_server process runs to accept your job and add it to the queue. It will also dispatch the job to the nodes to run under the pbs_mom process.
 +
 +
 +==== Useful MOAB Commands ====
 +1. [[https://docs.adaptivecomputing.com/maui/commands/showq.php|showq]] - Displays information about active, eligible, blocked, and/or recently completed jobs.
 +
 +2. [[https://docs.adaptivecomputing.com/maui/commands/showstart.php|showstart]] - Displays the estimated start time of a job based a number of analysis types.
 +
 +3. [[https://docs.adaptivecomputing.com/maui/commands/checkjob.php|checkjob]] - Allows end users to view the status of their own jobs.
 +
 +====Useful External Resources====
 +[[https://www.icer.msu.edu/sites/default/files/files/understand_job_scheduler_v2.pdf|MSU -Understand job scheduler and resource manager]] - Describes the batch queuing system and has some useful diagrams explaining the interrelationship between the scheduler and the server.
 +
 +[[https://wvuhpc.github.io/2019-Intro-HPC/07-jobs/index.html|WVU - Job Submission (Torque and Moab)]] - Lists frequently used commands for Torque and Moab. Also includes information on Prologue and Epilogue scripts.
 +
 +[[http://docs.adaptivecomputing.com/mwm/7-1-3/help.htm#pbsintegration.html|Moab-TORQUE/PBS Integration Guide]] - Guide for Administrators and integrators on the deployment and integration of PBS Torque and Moab into a computer system
 +
 +[[https://silas.net.br/tech/hpc/torque.html|Torque Notes]] - Information about the processes involved in using torque and debugging information.
 +
 +
tutorial/torque.txt · Last modified: 2024/02/28 13:12 by mjm519