Project File System
The project file system is a global file system available on all NERSC computational systems. It allows sharing of data between users, systems, and/or (via science gateways) the "outside world". Default project directories are provided to every MPP project. Additional project directories can be provided upon request.
No, files in project directories are not subject to purging.
Daily backups are performed for project directories whose quotas are equal to or less than 5 TB; such backups are kept for 90 days.
If you require a restoration of lost data, please contact NERSC Consulting with pathnames and timestamps of the missing data. Such restore requests may take a few days to complete, restore time being dependent on the amount of data being retrieved. No backups are performed on project directories whose quotas are greater than 5 TB. IMPORTANT: All NERSC users should back up important files to HPSS on a regular basis. Ultimately, it is your responsiblity to protect from data loss.
There must be a project directory administrator associated with each project directory. This user must have a NIM role of PI, PI Proxy, or Project Manager.
Access control for project directories is based on Unix groups. The project directory administrator is responsible for managing membership in the associated group.
Default project directory quotas are 1 TB and 1,000,000 inodes. If your directory needs more than that, fill out the Disk Quota Increase Form.
To check your current usage and quota in a project directory, use the prjquota command, specifying the name of the project directory. For example, if you have access to a project directory named "bigsci":
% prjquota bigsci
------ Space (GB) ------- ----------- Inode -----------
Project Usage Quota InDoubt Usage Quota InDoubt
-------- ------- ------- ------- ------- ------- -------
bigsci 1455 3072 0 307423 500000 20
In the above example, the project directory "bigsci" has used about 1.5 TB of its 3 TB block quota, and about 307000 inodes out of its 500000 inode quota.
The system has a peak aggregate bandwidth of 40 GB/sec bandwidth for streaming I/O, although actual performance for user applications will depend on a variety of factors. Because NGF is a distributed network filesystem, performance typically will be less than that of filesystems that are local to a specific compute platform. This is usually an issue only for applications whose overall performance is sensitive to I/O performance.
Project directories are created in /project/projectdirs. The name of a "default" project directory is the same as its associated MPP repository. There is also a Unix group with the same name; all members of the repostiory are also members of the group. Access to the project directory is controlled by membership in this group.
Occasionally there are cases where the above model is too limiting. For example, some large projects might want a project directory to be accessible by members of multiple repositories. Also, some long-term projects outlive the specific repositories that constitute them. In these cases, a project directory administrator may request the creation of a "designer" project directory with a specific name. This will result in the creation of a new Unix group with that name, consisting soley of the project directory administrator, followed by the creation of the project directory itself. The project directory administrator must then use NIM to add users to the newly-created Unix group (this is a very simple operation). Only these users will be able to access the project directory. To request a designer project directory, please use the Project Directory Request Form.
Project directories will remain in existence as long as the owning project is active. Projects typically "end" at the end of a NERSC Allocation Year. This happens when the PI chooses not to renew the project, or DOE chooses not to provide an allocation for a renewal request. In either case, the following steps will occur following the termination of the project:
- Day 0
Users of the affected project directory will be notified that the project directory will be archived, and then removed from the file system, in 90 days.
- Day 30
The project directory will become read-only. Users will still be able to retrieve data from the project directory, but will not be able to store data into the directory.
- Day 60
The full pathname to the project directory will be modified. This will most likely cause automated scripts to fail.
- Day 90
User access to the directory will be terminated. The directory will then be archived in HPSS, under ownership of the PI, and subsequently removed from the file system.