Hi
we observe that many jobs stay in running state forever.
But the job was done successfully and output copied back to WMS. What
could be the reason?
example:
gridka24 $ glite-wms-job-status
https://lb-1-fzk.gridka.de:9000/HToevj_pLkQDZcEKfXQQCw
...Current Status: Running...
but:
gridka24 $ glite-wms-job-logging-info -v 2
https://lb-1-fzk.gridka.de:9000/HToevj_pLkQDZcEKfXQQCw
Event: Done
- Arrived = Wed Dec 1 15:20:05 2010 CET
- Exit code = 0
- Host = c01-016-117.gridka.de
- Reason = job completed
@LB mysql db:
| HToevj_pLkQDZcEKfXQQCw | 14 | DG.LLLID=2430000
DG.USER="/O=GermanGrid/OU=Uni Karlsruhe/CN=Andreas Oehler"
DATE=20101201142005.522378 HOST="c01-016-117.gridka.de" PROG=edg-wms
LVL=SYSTEM DG.PRIORITY=4 DG.SOURCE="LRMS" DG.SRC_INSTANCE=""
DG.EVNT="Done"
DG.JOBID="https://lb-1-fzk.gridka.de:9000/HToevj_pLkQDZcEKfXQQCw"
DG.SEQCODE="UI=000000:NS=0000000004:WM=000004:BH=0000000000:JSS=000002:LM=000002:LRMS=000005:APP=000000:LBS=000000"
DG.DONE.STATUS_CODE="OK" DG.DONE.REASON="job completed"
DG.DONE.EXIT_CODE="0"
@WMS:
# cat
/var/glite/SandboxDir/HT/https_3a_2f_2flb-1-fzk.gridka.de_3a9000_2fHToevj_5fpLkQDZcEKfXQQCw/output/gc.stdout
<some output, job done correct>
LB and WMS are different hosts and have latest updates.
I tried to restart interlogd processes.. no effect.
Regards
Dimitri
--
Dimitri Nilsen, Dipl.-Ing(FH)
Karlsruhe Institute of Technology (KIT)
Steinbuch Centre for Computing
Postfach 3640
76344 Eggenstein-Leopoldshafen, Germany
Tel.: +49 7247 82-8607
Fax.: +49 7247 82-4972
Email: [log in to unmask]
|