NERSCPowering Scientific Discovery Since 1974

Hopper batch walltime increase and new xfer queue

June 29, 2011 by Helen He (0 Comments)

There are two batch queue configuration changes on Hopper:

1) User jobs using fewer than 4,096 nodes can now run for up to 36 hrs.  In particular, the max wall time limits for reg_small (1-683 nodes), reg_med (684-2,048 nodes), and reg_big (2,049-4,096 nodes) jobs have been increased from 24 to 36 hrs.

2) A new "xfer" queue has been introduced. Users can use the xfer job to pre-stage input files for a large simulation and archive output files afterwards without the penalty of being charged for the large amount of compute nodes during the file transfers.  

The xfer queue on Hopper is being implemented on a local batch server (hopper06), which is separate from the main batch server on the main Hopper XE.  So there are some special notes about the xfer jobs, for example: their job ids are in a different format (such as 6000345.hopper06), and since xfer jobs do not use any compute nodes, the torque keyword "mppwidth" should not be specified in an xfer batch script.

The details of the batch queue configuration can be found at:
http://www.nersc.gov/users/computational-systems/hopper/running-jobs/queues-and-policies/

And the details of submitting an "xfer" job can be found at:
http://www.nersc.gov/users/computational-systems/hopper/running-jobs/batch-jobs/#xfer


Post your comment

You cannot post comments until you have logged in. Login Here.

Comments

No one has commented on this page yet.

RSS feed for comments on this page | RSS feed for all comments