Proposed Questions for the AY 2008 NERSC Allocations Request Form

Proposed changes are in blue. Questions and instructions are in green.

Project Overview

  1. Principal Investigator
  2. Other Authorized Preparers
  3. Senior Investigators
  4. Project Title
  5. Project Name
  6. Project Class (DOE Base, SciDAC, Startup, Guests, Overhead)
  7. Sponsoring Site
  8. Science Category
  9. DOE Office and Program

DOE Funding

  1. Is this project funded by the DOE Office of Science?
  2. Does this project make use of national security information? (yes | no)

    Does this project collect and store Social Security Numbers, Personally Identifiable Heatlh Information (names or other identifiers matched with health data), Driver's License Numbers, or Financial Account Numbers? (yes | no)

    If this project is given a NERSC award ... (keep these two questions)

Resources Requested

  1. Computational Resources Requested
                                           Hours       Hours
    Center   Resource  Alloc Type   Repo   Used 2007   Requested 2008  
    
    
  2. Mass Storage Resources Requested
                                           SRUs        SRUs
    Center   Resource  Alloc Type   Repo   Used 2007   Requested 2008 
    
    NERSC    HPSS
    
  3. Justification for Resources Requested

PDSF

  1. PDSF Repos

    [existing PDSF repos renewed here]

    Do you need a new PDSF project account? Y/N
    If yes, please enter your experiment name (3-8 characters):

Project Description

Can we allow this whole section to be submitted as an upload?
  1. Project Description
    Describe the research you are performing or will perform with NERSC resources. Your responses may be included in the NERSC Annual Report. Please update for the next year. (4000 character maximum.)
  2. Accomplishments
    Describe accomplishments made on this project in the previous year. State how the use of NERSC resources allowed you to meet your research objectives. Your response may be included in the NERSC Annual Report. (4000 character maximum.)
  3. Relevant Publications: enter in the order authors, title, journal

  4. Can we collect this in a systematic way such that it is easy to produce the pubs list?

Code Descriptions

19.1 Code and Application Descriptions
You should complete a copy of this page for each of your project's most frequently used codes and applications.

19.2 Code and Application Performance
Provide code performance data for typical processor counts used in production this past year. For machines with more than one processor per node enter # of procs as the number of nodes used times the number of processors per node.

code              # of                 Memory    how info was
name   machine    procs    Gflop/sec   (Gbytes)  collected / comments

____   _______    _____    ______      _______   ____________________
____   _______    _____    ______      _______   ____________________

Data / Analytics / Networking

  1. Storage and Data Movement
    Describe your storage and data movement requirements:
  2. Data and I/O
    • How much data does your project typically output in the course of the year?
    • What are your data characteristics?
      Please characterize the resolution and complexity of data produced by your simulation or collected and stored by your experiment. Describe characteristics such as:
      • Number of grid cells or zones
      • Typical number of variables per grid cell or data point
      • Grid dimensionality: 2D, 3D, 4D, higher
      • Grid meshing techniques used:
          __ no mesh                         
           __ structured rectilinear mesh             
           __ structured curvilinear mesh             
           __ unstructured mesh          
           __ hierarchical / adaptive mesh refinement  
           __ hybrid                        
        
        [NOTE: implement above as a checklist.]
    • How do you share data among your team members and how might you like to do it in the future if the present approach does not meet your needs? Does your project provide data to the public?
    • Do you do data subsetting? Do you (or would you like to) make use of any database systems or other similar technology for performing subset extraction?
    • How do you perform I/O? Do you write concurrently from all processors to shared files (parallel I/O) or one file per processor or node? Have you experienced problems or constraints due to I/O?
    • What file formats / data libraries do you use?
      • POSIX
      • F77 unformatted binary
      • MPI/IO
      • Parallel NetCDF
      • NetCDF
      • HDF5
      • H5Part
      • HDF4
      • FITS
      • NCARU
      • Global Arrays
      • community file format, describe: ______________________________
      • custom file format, describe: ______________________________
      • database format, describe: ______________________________
      • other, describe: ______________________________
      [NOTE: implement above as a checklist.]
    • REMOVE THIS QUESTION -- If you have a data intensive computing project (defined as a project which needs access to more than 500 Gigabytes of storage) please provide an overview of how your project produces, processes, manages and transfers data.
  3. Analytics Applications
    Which applications/libraries do you use or intend to use for data analysis and visualization at NERSC?
    • AVS
    • AVS/Express
    • EnSight
    • ferret
    • Garlic
    • GIMP
    • Gnuplot
    • Grace
    • GrADS
    • Gsharp
    • IDL
    • ImageMagick
    • Maple
    • Mathematica
    • MATLAB
    • MySQL
    • NCAR
    • ParaView
    • PERL
    • pgplot
    • PostgreSQL
    • Python
    • R
    • Rasmol
    • ROOT
    • Tcl/Tk
    • VisIt
    • VMD
    • Other, describe: __________________________________
    [NOTE: implement above as a checklist.]
  4. Networking
    If current network bandwidth between your site and NERSC does not meet your needs, please explain.

Other Info / Finish

  1. Other HPC Support
    List High Performance Computing support from outside NERSC for this project you currently have or expect in the next year.
  2. Additional Information
    Enter any additional information you would like the review committees to consider in support of this request.
  3. Feedback and Project Requirements
    Use this space to provide feedback on the allocations process or make other comments. If your project has special requirements that you would like NERSC to consider, describe them here.
  4. Current Request Status

algorithms checklist

Structured Mesh		Direct Solver
Unstructured Mesh	Dense Linear Algebra
Adaptive Mesh		Sparse Linear Algebra
No Mesh			Heavy use of integer/logical arithmetic
Iterative Solver	Monte Carlo
Particle Methods	FFT
Multigrid		Other: _____________________
Replace above with:
Particle methods
  __ Particle In Cell (PIC)
  __ Particle-in-Mesh
  __ N-Body
  __ Fast Multipole
  __ Barnes Hut
  __ Other: _________________________________

 Linear equationa solvers                         
   __ dense solver 
   __ sparse direct solver                   
   __ iterative method 
   __ multigrids/multilevels                 FFT
                                                 __ Local (no inter-node
                                                    communication is required)
                                                 __ Global (which span multiple
                                                    processors and/or nodes)
Eigenvalue solvers
   __ dense eigensolver                  
   __ sparse eigensolver                  

Other methods
   __ Monte Carlo
   __ Genomic search
   __ Nonlinear equations solver 
   __ Ordinary differential equation solver
   __ Numerical optimization
   __ Quadrature rules
   __ Interpolation, spline approximation 
   __ Use of extended precision floating-point arithmetic
   __ Heavy use of integer/logical arithmetic
   __ Other algorithms: ________________________________



languages checklist

Assembler      C	    C++	    Fortran90	
Fortran95      Java     UPC       Python   
MPI            LAPI	    OpenMP    Other _____________

numerical and performance libraries checklist

Aztec     Chombo      DIMEMAS     ESSL        FFTW         HPMLIB    HYPRE
IMSL      LAPACK      MASS        METIS        NAG      
Overture  PAPI        PARAVER     PARPACK  
PESSL     PETSC       PWSMP       ScaLAPACK    SLEPc     SUNDIALS  superlu  
TAO       TAU         Trilinos    VAMPIR
WSMP      Other: ______________________

limits to performance checklist

Algorithm scaling	   	All-to-all performance (large messages)
Load imbalance	          All-reduce performance (small messages)
Small problem size	   	Broadcast performance
Point-to-point latency	 Barrier performance
Point-to-point bandwidth   	MPI I/O
Available disk space	 POSIX I/O
Bug			 External network
Unknown   		Single node memory performance
Other: ________________