[ Bottom of Page | Previous Page | Next Page | Contents | Index ]
Parallel Environment for AIX 5L V4.1 Hitchhiker's Guide SA22-7947-00
Figures
About this book
Hitching a lift on the Vogon constructor ship
What is the IBM Parallel Environment for AIX?
What is the Parallel Operating Environment?
Before you start
Running POE
Who is in control?
The answer is 42
Message passing
Data decomposition
Functional decomposition
Duplication versus redundancy
Protocols supported
Thread debugging implications
Checkpointing and restarting a parallel program
Limitations
How checkpointing and restarting works
A checkpoint/restart scenario
Don't panic
Messages
Message catalog errors
Finding PE messages
Logging POE errors to a file
Message format
Diagnosing problems using the Install Verification Program
Cannot compile a parallel program
Cannot start a parallel job
Cannot execute a parallel program
The program runs but...
The parallel debugger is your friend
It core dumps
No output at all
It hangs
Let us attach the debugger
Other hangups
Bad output
Debugging and threads
So long and thanks for all the fish
Tuning the performance of a parallel application
How much communication is enough?
Tuning the performance of threaded programs
Why is this so slow?
Profile it
Parallelize it
Wrong answer!
Here is the fix!
It is still not fast enough!
Tuning summary
Babel fish
Point-to-point communication
SEND (Non-Blocking)
RECEIVE (Non-Blocking)
SEND (Blocking)
RECEIVE (Blocking)
SEND/RECEIVE (Blocking)
STATUS
WAIT
TASK_SET
TASK_QUERY
ENVIRON
STOPALL
PACK
UNPACK
VSEND (Blocking)
VRECV (Blocking)
PROBE
Collective communications
BROADCAST
COMBINE
CONCAT
GATHER
INDEX
PREFIX
REDUCE
SCATTER
SHIFT
SYNC
GETLABEL
GETMEMBERS
GETRANK
GETSIZE
GETTASKID
GROUP
PARTITION
Reduction functions
User-defined reduction functions
Global variables and constants
Last error code
Wildcards
General notes
Task identifiers
Message length
Creating MPI objects
Using wildcard receives
Reduction functions
Error handling
Before and after using MPI functions
Using message passing handlers
Mostly harmless
What is a safe program?
Safety and threaded programs
Using threaded programs with non-thread-safe libraries
Message ordering
Program progress when two processes initiate two matching sends and receives
Communication fairness
Resource limitations
Appendix A. A sample program to illustrate messages
Figuring out what all of this means
Appendix B. Parallel Environment internals
What happens when I compile my applications?
How do my applications start?
How does POE talk to the nodes?
How are signals handled?
What happens when my application ends?
Notices
Trademarks
Acknowledgements
Glossary
Bibliography
Information formats and them on the Web
System planning publications
RS/6000 SP Switch Router publications
Related hardware publications
RS/6000 SP software publications
AIX publications
Reliable Scalable Cluster Technology (RSCT) publications
Cluster Systems Management (CSM) publications
DCE publications
Redbooks
Non-IBM publications
Index
[ Top of Page | Previous Page | Next Page | Contents | Index ]
|