hi,
try to install ganglia with gpu support:
https://developer.nvidia.com/ganglia-monitoring-system
there you can see e.g. if the gpu runs out of memory, is waiting for
incoming data or which steps (e.g. motioncorr) can handle parallel
processes on one gpu.
cheers,
wolfgang
On 02/22/2017 12:58 PM, Oliver Clarke wrote:
> Hi,
>
> I’m running Relion 2 on a workstation with 2X Titan-X cards and 256G RAM.
>
> Everything has been running smoothly, but in the last couple of weeks I have had problems with 2D class averaging - multiple runs on two different datasets have just silently stalled at some point during the expectation step, without crashing or writing any error messages to the log - it’ll just sit at for example 8/20min forever unless killed.
>
> If I restart the run or continue from the previous iteration, there is usually no issue on the second time around.
>
> Does anyone have any suggestions as to how to diagnose this, e.g. whether it is a software or a hardware issue (I suspect the latter as it is sporadic even with the same input and only started happening recently, maybe a faulty GPU)?
>
> Cheers
> Oli
--
Universitätsklinikum Hamburg-Eppendorf (UKE)
@ Centre for Structral Systems Biology (CSSB)
@ Institute of Molecular Biotechnology (IMBA)
Dr. Bohr-Gasse 3-7 (Room 6.14)
1030 Vienna, Austria
Tel.: +43 (1) 790 44-4649
Email: [log in to unmask]
http://www.cssb-hamburg.de/
--
_____________________________________________________________________
Universitätsklinikum Hamburg-Eppendorf; Körperschaft des öffentlichen Rechts; Gerichtsstand: Hamburg | www.uke.de
Vorstandsmitglieder: Prof. Dr. Burkhard Göke (Vorsitzender), Prof. Dr. Dr. Uwe Koch-Gromus, Joachim Prölß, Rainer Schoppik
_____________________________________________________________________
SAVE PAPER - THINK BEFORE PRINTING
|