The latest Energy Research Supercomputer Users Group (ERSUG) meeting was held at Pacific Northwest Laboratory in Richland, Washington, on January 12 - 13, 1995. Some of the talks are summarized below.
Tom discussed the formation of the Distributed Computing Coordination Committee (DCCC) under the auspices of the ESnet Steering Committee. He stressed the importance of this issue, forecasting that most cycles used by ER principal investigators would be done in a Distributed Computing Environment within a very few years. The DCCC effort needs the input and guidance from ER users, and ERSUG is the only organization that represents them. He warned that if ERSUG was not proactive here, ERSUG users would have to accept what others decide to provide for them.
Ultimately, NERSC seeks to integrate the remote user's local environment with the centralized NERSC environment. This could be viewed as a unification of distributed services. An emphasis on service unification is essential because individual components of the service structure--in particular the soon-to-arrive, high-end massively parallel processing (MPP) computational platforms--will require this environment in order to be used to fullest potential.
In determining its major goals, NERSC must take into account two variables: the first is the technological change occurring in the world, and the second is the users' reactions to this change. The dominant components of technological change come from (1) the microprocessor revolution, (2) the adoption of UNIX and other standardized protocols (such as X) worldwide, (3) the exponential increase in wide area network (WAN) and local area network (LAN) bandwidths, and (4) the huge gains in tertiary storage capabilities.
The impact of the microprocessor has already been felt at NERSC. Used as a high-end workstation in the form of the Supercomputing Auxiliary Service (SAS), it helped the Center to regain some of the functionality lost through the adoption of UNIX on the supercomputer. SAS provides a rich set of tools and pre- and post-processing capabilities. This union between microprocessor and supercomputer represents an initial step to a multicomponent UPE.
Of even greater importance to NERSC is that some of the offspring of the microcomputers have largely surpassed the vector supercomputer in capability. These have evolved into three distinct species: MPP, the symmetric multiprocessor (SMP), and the workstation (or PC) cluster featuring a high-performance asynchronous transfer mode (ATM) interconnect. The latter is an interesting, but still unproven, technology in the research stage.
While the basic mission of NERSC remains essentially unchanged, the new technologies demand that the services offered evolve to become more tightly coupled, sophisticated, and global. The UPE will coordinate the services seen by the user across the NERSC infrastructure. In the later stages of its implementation, the UPE will extend beyond the centralized environment as we seek to unify the user-local environment with the NERSC centralized environment.
Some aspects of this environment are described below to give users a sense of how the UPE will affect their work:
SMPs also have an interesting architecture that may lend itself very well to the UPE envisioned at NERSC. The hardware architecture is close enough to that of the CRAYs to enable running and debugging micro/macrotasking applications. The basic microprocessor and node architecture, and the operating system, are in many ways similar to that of MPPs. Thus, writing, porting, and debugging parallel applications on the SMPs may be easier than on the current MPPs, given the MPP's lack of ability to timeshare.
At the ERSUG meeting we heard strong support for the NERSC study, and we are currently running one SMP machine for our study. Many NERSC users are active in this process, and to date we can report that people are pleased with the scalar performance and interactivity of the machine while loaded. So, early indications are good. We hope to report more in the near future.
The NERSC staff will continue to collaborate with research scientists to develop parallel versions of serial applications. Activities include analyzing existing serial applications and converting them to parallel, developing new algorithms better suited to parallelization, tuning parallel applications to minimize communication overhead and maximize single-processor performance, developing distributed applications that permit the interactive program control and/or real-time visualization of data for a parallel program, facilitating sharing of ideas between different research groups working on parallel code development, and developing tools to help users in the transition to parallel computing. Research scientists interested in collaborating with NERSC staff in the development of a parallel application or in need of tools to help in the conversion effort should send e-mail to consult@nersc.gov.
Another way that research scientists can get started is via the MPP Access Program. During Round 1 of the program, nine proposals were awarded allocations on four parallel platforms, including the T3D, CM-5, Paragon, and KSR1. During Round 2 of the program, 21 proposals were awarded allocations on the T3D. Details about the projects awarded time in Round 2 will appear in a future Buffer article. The next round of proposals will be due in the latter part of August for allocations starting October 1, 1995.
Finally, on June 14 - 30, NERSC will host the Summer Workshop on Massively Parallel Processing. The first three days of the workshop will include general tutorials and invited lectures on parallel computing. The second week will focus on advanced topics and hands-on experience with an MP computer system. During the third week, NERSC will provide computing facilities and consulting support for participants to work on their own parallel scientific computing projects.
The team identified performance metrics and gathered the data that would allow an analysis to be done. The key values were velocity (CPU time/wall-clock time once job begins execution), wait time (time between submission and initial execution), and held time (time between when a job is checkpointed because the allocation was depleted and when a new allocation is infused this affects velocity). After spending several months collecting data, we learned that it is extremely difficult to reconstruct the C90 environment at any moment using the information logged by UNICOS and NQS. We made many iterations refining NQSTAT because of the variations in the data we encountered. The result of that process, however, was a much-improved tool that NERSC customers can run.
Some of the observations and conclusions presented at the January ERSUG meeting were that the NQS data is very noisy, and post-analysis is extremely labor intensive. We saw that NQS was performing consistently with how it was tuned to perform, but in the class of large jobs, PNL's velocities were, in fact, lower than those of other comparable jobs. At that time our analysis was not complete. Since then we learned that the problems seem to be confined to the Gaussian and Crystal codes. Bruce Curtis was able to make some optimizations that resulted in significant improvements for the specific test cases we had. This information was communicated to PNL.
A recent milestone is the acquisition of a new Base Storage System modeled after proven National Storage Laboratory (NSL) technologies. This system includes a high-performance parallel interface (HiPPI)- and fiber distributed data interface (FDDI)-connected high-performance RS/6000 server with a 100-GB SCSI-2 fast/wide disk cache, a high-capacity IBM 3494 robotic system with support for new 3590 high-performance tape, and a commercial version of NSL-UniTree storage software. The NERSC Base Storage System is expected to be operational in early May 1995 and is similar to the storage system recently installed at Livermore Computing's Facility for Advanced Scalable Computing Technology (FAST).
Planned FY95 upgrades for the Base System include a new 95-GB HiPPI disk array and new high-capacity, high-speed tape drives (10-25 GB per cartridge uncompressed, and up to 20-MB-per-second data transfers for compressible data). Plans for acquiring a large Fully Configured Storage System (FCSS) in FY96 are also under way. The FCSS is planned as a large upgrade or augmentation to the Base System and is expected to completely replace the current CFS (Common File System) and IBM 4381 storage environment. The FCSS will include new disk and archival subsystems, and run a commercial high-performance, scalable storage management software product such as the NSL's High-Performance Storage System (HPSS).
These services will continue to be important, but there are new focus areas which have been introduced to keep NERSC on the leading edge of advances in technology. In this age of information and technological revolution, changes continually occur. NERSC is addressing these changes in several ways. We are working on providing a single interface to all information delivery systems. We are developing training techniques for utilizing new media, and providing technical expertise for collaboration and coordination with NERSC scientists and researchers.
The single-user-interface information delivery system is based on the Standard Generalized Markup Language (SGML) standard. The goal of this system is to deliver the information anywhere, anytime, using the appropriate presentation media. A WWW interface is the immediate vehicle by which this information will be integrated and presented. Some of the information and delivery systems being integrated over the diverse hardware and software platforms are CrayDoc, MAN pages, the NERSC documentation database, the REMEDY trouble ticket system database, bulletin boards, vendor help packages, and other WWW databases. We plan to present training classes using video-on-demand technology, video teleconferencing, and asynchronous access to the classes. The asynchronous access to stored classes can be presented either by file transfer and/or real-time playback by connecting to one of the prototype high-speed networks in the San Francisco Bay Area, via integrated services digital network (ISDN) or LAN connection to the Internet, or via modem connection over standard telephone lines.
Providing technical expertise for collaboration and coordination with NERSC scientists and research ers is critical. The User Services Group and other NERSC staff are working with researchers in such areas as parallel programming, code optimization, distributed computing, and visualization of scientific data. These efforts benefit both NERSC and the ER programs. We will continue to provide our traditional services and to build on new ones to satisfy customer needs and to promote their science.
ESnet Decnet Working Group (EDWG)--Chair: Phil DeMar, demar@fnal.gov. Coordinates DECnet functionality in the ESnet community in support of ESnet operations. Negotiates DECnet issues with the global DECnet community. EDWG is responsible for the migration of DECnet Phase IV to DECnet Phase V.
IPng Working Group--Chair: Bob Fink, rlfink@lbl.gov. Develops strategies and pilots for the implementation of the next generation of IP networking both for local sites and in concert with ESnet Management for the ESnet community.
Network Monitoring Task Force (NMTF)--Chair: Les Cottrell, cottrell@slac.stanford.edu. Acts as a focus group/forum for ESnet community sites in the area of network monitoring. Shares network monitoring information among participants (plans, experiences with hardware/tools/ applications, requirements, threshold metrics, and performance objectives).
Local Asynchronous Transfer Mode TaskForce (LATM)--Chair: Bob McMahon, mcmahon@anl.gov. Has successfully run a pilot project on implementing ATM in the LAN environment. Is now coordinating LAN ATM information within the ESnet community as ATM is further deployed.
Remote Conferencing Working Group (RCWG)--Chair: Kipp Kippenhan, kippenhan@fnal.gov. Advances collaborative video conferencing for both conference rooms and the desktop. Both ISDN and Internet multicast backbone (Mbone) style technologies are used. Furthermore, they are being developed to be interoperable. The RCWG works with the ESnet Video Conferencing Service at NERSC to coordinate video activities in the ESnet community.
Key Distribution Task Force (KDTF)--Chair: Bill Johnston, johnston@george.lbl.gov. Coordinates issues related to the deployment of secure keys to be used by e-mail technologies such as PEM and PGP, and authentication services such as digital signature. If appropriate, recommends strategies for such deployment. The KDTF is charged to ensure that their efforts are compatible with those of the Internet Engineering Task Force (IETF) and Federal interagency key distribution task forces.
Application Working Group (AWG)--Chair: Dick Kouzes, rt_kouzes@gate.pnl.gov. Develops strategies, tools, and pilot projects for collaboration in areas such as: (1) National Information Infrastructure (NII) focused projects; (2) information services, including data storage and retrieval, project documentation, and multimedia lab notebook and calendar; (3) distributed collaboration tools, including multimedia communications and software development; (4) collaboration on social organization issues, including effective standard operating procedures.
Architecture Task Force (ATF)--Chair: Arthurine Breckenridge, arbreck@sandia.gov. ATF is recommending a high-level architecture for a distributed collaboration environment that will eventually provide production-level support of research efforts in DOE. The architecture is being developed to complement, and possibly to help define, the DOE NII activities. It will also address the non-technical (social, political, and budgetary) issues to facilitate the establishment of such an environment.
Distributed Computing Environment Working Group (DCEWG)--Chair: Barry Howard, howard@nersc.gov. Examines and identifies the recommended appropriate elements of a distributed computing environment, including such components as OSF/DCE, the Common Desktop Environment (CDE), the Common Object Request Broker Architecture (CORBA), and Load Sharing. Responsible for recommending strategies and pilots for implementing these components.
Distributed System Management Working Group (DSMWG)--Chair: John Volmer, volmer@anl.gov. Develops strategies, tools, and pilot projects for effectively providing systems management to distributed heterogeneous systems. Will also interact with the DCEWG for the effective systems management of DCEWG layer tools.
Group Communications Working Group (GCWG)--Chair: Allen Sturtevant, aps@es.net. Develops interoperable communications methods and strategies for documents and other forms of group communications exchange, including FTP, Gopher, and WWW servers, MIME e-mail extensions, graphics formats, and database exchanges. The GCWG's objective is to support the ability to have seamless communications between individuals and groups across heterogeneous platforms and information environments.