Hello again,
> I think this is more or less what was happening, the multicore queue
> jobs get on top, run out of space so can't be started and then maui gets
> all lazy with the scheduling of the "stuck" jobs. After increasing my
> RESERVATIONDEPTH and setting MAXIJOBS=1 for the multicore queue
> scheduling seems to be working as expected for the first time. Whether
> this is due to these changes or some other factor (maybe my tears
> soaking into my keyboard invoked mercy from the dark gods of cluster
> computing?). There are still many improvements I want to try (liek
> Stuart's suggestion at partitioning my nodes, but at least now I have a
> baseline that works!
Well I kind of spoke too soon yesterday - after filling the queues just
long enough for me to get my hopes up maui then started playing silly
buggers again. It's behaving better, keeping things 80%-90% full (with
peaks and troughs in free job slot utilisation) , but that's still too
much waste.
Once more unto maui.cfg I go (I hope to implement some of Stuart's
priority and weighting suggestions), any further advice would of course
be appreciated!
Have a good weekend all!
Matt
|