I have seen RELION zombies many times. One of the worker MPI ranks fails, and the master MPI rank waits fruitlessly for it to return. Look in the error log for some kind of MPI failure. There is nothing to do but cancel the job and continue from where it left off. The MPI rank typically fails due to lack of memory either on a CPU or the GPU.
David Hoover
HPC @ NIH
########################################################################
To unsubscribe from the CCPEM list, click the following link:
https://www.jiscmail.ac.uk/cgi-bin/webadmin?SUBED1=CCPEM&A=1
|