POE supports the following user authentication methods, which are based on the SP security services methods, and are set using the PSSP chauthts command:
AIX authentication will be used, based on entries in the /etc/hosts.equiv or .rhosts files. This is the default mechanism.
DCE authentication will be used, which requires the following:
DCE authentication is attempted first. If this attempt is not successful, AIX authentication is tried.
No security methods are enabled. POE defaults to use AIX authentication, as it does with compatibility.
When POE is used in a standalone pSeries or RS/6000 workstation environment without the ssp.clients fileset installed, only AIX authentication can be used. See DCE User Authorization on a Standalone pSeries or RS/6000 Workstation for more information.
The lsauthts command can be used to check the authentication method in use. See IBM Parallel System Support Programs for AIX: Command and Technical Reference for more information on the chauthts and lsauthts commands.
DPCL supports the following user authentication methods, which are based on the SP security services methods, and are set using the PSSP chauthts command:
AIX authentication will be used, based on entries in the /etc/hosts.equiv or .rhosts files. This is the default mechanism.
DCE authentication will be used, which requires the following:
DCE authentication is attempted first. If this attempt is not successful, AIX authentication is tried.
AIX authentication will be used, based on entries in the /etc/hosts.equiv or .rhosts files.
The /etc/hosts.equiv file, the .rhosts file, or both, are used to specify DPCL user authorization when AIX authentication is used.
If the combination of the home node machine and user name:
For more information on .rhosts and /etc/host.equiv, see the chapter on managing jobs in IBM AIX 5L Version 5.1 Files Reference.
If AIX user authorization, or compatibility, (the default) is used as a security mechanism on the system, each node needs to be set up so that each userid is authorized to access that node or remote link from the initiating home node. Use the /etc/hosts.equiv file and/or the .rhosts file to specify this user ID authorization, as explained below.
If the combination of the home node machine and user name:
For more information on .rhosts and /etc/host.equiv, see the chapter on managing jobs in IBM AIX 5L Version 5.1 Files Reference.
When you enable DCE authentication as the SP security method of choice, POE will expect a valid set of DCE credentials in order to submit parallel jobs.
When both DCE and compatibility methods are enabled, POE attempts DCE authentication first. If DCE authentication is unsuccessful, POE will then use AIX authentication.
In order to use DCE with POE, you need the following:
POE requires that a DCE service principal is defined for the partition manager daemon (PMD), to control the use of the service with DCE. The PMD service is defined in the spsec_defaults file as ppe/pmdv3, with these attributes:
You need to define this PMD service principal for each host. This means that every POE remote node must have a PMD service principal defined for its host name.
The SP security installation and configuration should set up the PMD service principal automatically. See PSSP Administration Guide for more information.
For parallel jobs that involve a large number of tasks, see PSSP Planning for information about optimizing the replication of DCE servers.
In order to use DCE with DPCL, the system administrator must have properly set up the DPCL service principal as part of the SP security administration and configuration steps.
DPCL requires that a DCE service principal is defined for the DPCL daemon to control the use of the service with DCE. The DPCL service is defined in the spsec_defaults file as ppe/dpcl, with these attributes:
This DPCL service principal needs to be defined for each host; that is, every DPCL remote node must have a DPCL service principal defined for its host name.
The SP security installation and configuration should set up the DPCL service principal automatically. See PSSP Administration Guide for more information.
The POE remote nodes are the locations from where the parallel job was ultimately submitted. Therefore, it is on the remote nodes that it is decided which authentication method is to be used.
POE supports a mixed environment of DCE and non-DCE nodes only under certain conditions. When running on a mix of nodes with and without DCE enabled, the following conditions apply:
In this case, some nodes may fail to successfully authenticate with DCE, in which case the job will try to authenticate with AIX. When the parallel job starts, it may be running with some nodes authenticated to DCE and not others. This may result in a problem if the application needs to use resources under DCE control.
If it is absolutely critical that an application start with successful DCE authentication, enable nodes with DCE only as the authentication method, to ensure that DCE will be used.
The destination remote node (server) sets the security policy, and
ultimately enforces the security method. The following table shows the
various combinations of supported methods, and how POE security will
work:
| Home Node | Remote Node | Result |
|---|---|---|
| DCE | compatibility | DCE is ignored, AIX authentication is used. |
| DCE | DCE | DCE authentication is used. |
| DCE | DCE compatibility | DCE authentication is attempted first; AIX is used if DCE is unsuccessful. |
| DCE | none | DCE is ignored and POE defaults to use AIX authentication. |
| compatibility | compatibility | AIX authentication is used. |
| compatibility | DCE | DCE authentication is used; POE will fail. |
| compatibility | DCE compatibility | DCE authentication is attempted first; AIX is used if DCE is unsuccessful. |
| compatibility | none | POE defaults to use AIX authentication. |
| DCE compatibility | compatibility | AIX authentication is used. |
| DCE compatibility | DCE | DCE authentication is used. |
| DCE compatibility | DCE compatibility | DCE authentication is attempted first; AIX is used if DCE is unsuccessful. |
| DCE compatibility | none | POE defaults to use AIX authentication. |
| none | compatibility | AIX authentication is used. |
| none | DCE | DCE authentication is used; POE will fail. |
| none | DCE compatibility | DCE authentication is attempted first; AIX is used if DCE is unsuccessful. |
| none | none | POE defaults to use AIX authentication. |
When a DPCL client is run, it is the remote nodes where the DPCL daemons run that determine which authentication method is to be used on that node.
DPCL supports a mixed environment of DCE and non-DCE nodes. As the DPCL client goes through the authentication process as part of connecting to each DPCL daemon, it will use the authentication process required by that DPCL daemon. In a mixed environment of DCE and non-DCE nodes, the following conditions apply:
In this case, some nodes may fail to successfully authenticate with DCE, in which case the job will try to authenticate with AIX. When the parallel job starts, it may be running with some nodes authenticated to DCE and not others. This may result in a problem if the application needs to use resources under DCE control.
If it is absolutely critical that an application start with successful DCE authentication, nodes should be enabled with DCE only as the authentication method, to ensure that DCE will be used.
The following table shows the various combinations of supported methods and
how security will work.
| Home Node | Remote Node | Result |
|---|---|---|
| DCE | compatibility | DCE is ignored, AIX authentication is used. |
| DCE | DCE | DCE authentication is used. |
| DCE | DCE compatibility | DCE authentication is attempted first; AIX is used if DCE is unsuccessful. |
| DCE | none | AIX authentication is used. |
| compatibility | compatibility | AIX authentication is used. |
| compatibility | DCE | DCE authentication is used; DPCL will fail. |
| compatibility | DCE compatibility | DCE authentication is attempted first; AIX is used if DCE is unsuccessful. |
| compatibility | none | AIX authentication is used. |
| DCE compatibility | compatibility | AIX authentication is used. |
| DCE compatibility | DCE | DCE authentication is used. |
| DCE compatibility | DCE compatibility | DCE authentication is attempted first; AIX is used if DCE is unsuccessful. |
| DCE compatibility | none | AIX authentication is used. |
| none | compatibility | AIX authentication is used. |
| none | DCE | DCE authentication is used; DPCL will fail. |
| none | DCE compatibility | DCE authentication is attempted first; AIX is used if DCE is unsuccessful. |
| none | none | AIX authentication is used. |
In order to have DCE authentication for POE users on standalone RS/6000 workstations, either as a cluster or connected to an SP, the following must be in place on each workstation:
When DCE is used to submit a parallel job, it will obtain a new set of credentials prior to submitting the job. This is important for long-running applications, because once a job is submitted, the credentials cannot be refreshed or renewed. As a result, the credentials lifetime should be long enough to outlast the longest-running application.
Careful planning is recommended to avoid premature expiration of DCE credentials before a job can complete. The PSSP Administration Guide describes how to alter the default credentials lifetime.
Once credentials have expired, they should be removed using the rmxcred command, to prevent filling up the /var file system.
When POE is installed, it modifies entries in /etc/services and in /etc/inetd.conf to install the partition manager daemon. In doing so, it requires an available port number that must be the same number on all nodes on which POE is to be installed and running. You need to ensure such a port number is available.
A POE application may require additional IP buffers (mbufs) under any of the following circumstances:
The need for additional IP buffers is usually evident when repeated requests for memory are denied. Using the netstat -m command can tell you when such a condition exists. In such a case, it may be necessary to use the no command to change the network option system parameters on the home node or on the SP nodes being used in the partition. You can use the no command to initially check the values as well.
The number of IP buffers allocated in the kernel is controlled by the thewall parameter of the no command. Increasing the value of the thewall parameter increases the number of IP buffers.
Notes:
On SP nodes, you can use the dsh command to execute the no command on each node of an SP. See the section on tuning in IBM Parallel System Support Programs for AIX: Administration Guide for more information on dsh.
For non-SP nodes, you can also set the values at system boot time by adding the appropriate call to the no command in either /etc/rc.net or /etc/rc.tcpip.
For more information on mbufs, see IBM AIX 5L Version 5.1 Performance Management Guide.
POE Version 3 and POE Version 2 have limited compatibility. POE Version 3 can run on a combination of POE Version 3 and POE Version 2.4 remote nodes as long as the home node (where POE is started from) is running POE Version 3. Earlier versions of POE, such as Version 2.2 and 2.3, are not supported in a mixed environment. You must also be at the supported level of AIX and PSSP for the particular POE version within a partition to submit PE jobs. When the POE home node is at a Version 2.4 level, it can only work with remote nodes that are also running POE Version 2.4. POE Version 2.4 home nodes cannot run with POE Version 3.2 remote nodes.
When POE Version 3.2 is installed on top of a POE Version 2 node, POE Version 3 completely replaces the previous version of POE. All Version 2 commands and executables are removed and replaced with Version 3 levels.
See Chapter 3, Migrating to PE 3.2 for more information.
As part of the Version 3 installation, the Partition Manager daemon (pmd) and POE executables have different names than their Version 2 counterparts. Also, different TCP/IP port numbers and daemon service names are used.
The following table summarizes the differences and can be used to tell
which version of POE you have if you are not sure.
| Type of Name or Number | POE Version 2 | POE Version 3 |
|---|---|---|
| Service name in /etc/services | pmv2 | pmv3 |
| Daemon name in /etc/inetd.conf | pmdv2 | pmdv3 |
| Default port number | 6125 | 6126 |
| pmd executable name | pmdv2 | pmdv3 |