NERSCPowering Scientific Discovery Since 1974

Open Issues

Hopper /scratch file system slow

August 6, 2014 | 0 Comments

Symptom:

Users have reported scripts hung when involving file copies from/to the /scratch file system, or jobs running in /scratch are slower than before since late last week. 

0 comments | Read the full post

Resolved: "error while loading shared libraries: libalpslli.so.0" with serial codes on login nodes

December 13, 2013 by Helen He | 0 Comments

Symptom:

Dynamic executables built with compiler wrappers running directly on the external login nodes are getting the following error message:

0 comments | Read the full post

Resolved: Running jobs error: "inet_arp_address_lookup"

September 22, 2013 by Helen He | 0 Comments

Symptom:

After the Hopper August 14 maintenance, users reporting get the error message similar as follows occassionaly:

0 comments | Read the full post

Resolved: qstat -a does not show correct hours in elpsed time for running jobs

July 12, 2013 by Helen He | 0 Comments

Sympton:

After the Torque/Moab upgrade on 6/19, the elapsed run time display from "qstat -a" does not show correct values for hours field for running jobs.  The minutes and seconds numbers are correct.  A job uses less than one hour displays correct elapsed time such as 50:03, but once a job's running time is over 59 min, the hours field is always displayed as 0.

0 comments | Read the full post

Unable to allocate hugepages in running jobs

January 14, 2013 by Helen He | 0 Comments

 

Symptom

User job sometimes get an error message similar to the following, usually at the start of a batch job, causing the job to abort:

0 comments | Read the full post

1 2 3