On Fri, 24 Jun 2005, Yan Ben-Hammou wrote:
> Hi ,
> when i submit a job from marseille ui on our site the log file in the
> altas001 directory (atlas is my VO) has a
> GLOBUS_GRAM_JOB_MANAGER_STATE_FAILED_TWO_PHASE error.
> do someone has a idea.
See this Wiki entry:
http://goc.grid.sinica.edu.tw/gocwiki/Globus_error_79%3a_connecting_to_the_job_manager_failed%2e
> the total log file is inserted :
>
>
> 6/24 15:40:44 JM: Security context imported
> 6/24 15:40:44 JM: Adding new callback contact
> (url=https://marseilleui.mrs.grid.cnrs.fr:21001/, mask=1048575)
> 6/24 15:40:44 JM: Added successfully
> 6/24 15:40:44 Pre-parsed RSL string: &("executable" = "testjob.jdl"
> )("stdout" = "x-gass-cache://" # $("GLOBUS_GRAM_JO
> B_CONTACT") # "stdout" "anExtraTag" )("stderr" = "x-gass-cache://" #
> $("GLOBUS_GRAM_JOB_CONTACT") # "stderr" "anExtraT
> ag" )
> 6/24 15:40:44
> <<<<<Job Request RSL
> &("executable" = "testjob.jdl" )("stdout" = "x-gass-cache://" #
> $("GLOBUS_GRAM_JOB_CONTACT") # "stdout" "anExtraTag" )
> ("stderr" = "x-gass-cache://" # $("GLOBUS_GRAM_JOB_CONTACT") # "stderr"
> "anExtraTag" )
> >>>>>Job Request RSL
> 6/24 15:40:44
> <<<<<Job Request RSL (canonical)
> &("executable" = "testjob.jdl" )("stdout" = "x-gass-cache://" #
> $("GLOBUS_GRAM_JOB_CONTACT") # "stdout" "anExtraTag" )
> ("stderr" = "x-gass-cache://" # $("GLOBUS_GRAM_JOB_CONTACT") # "stderr"
> "anExtraTag" )
> >>>>>Job Request RSL (canonical)
> 6/24 15:40:44 Job Manager State Machine (entering):
> GLOBUS_GRAM_JOB_MANAGER_STATE_MAKE_SCRATCHDIR
> 6/24 15:40:44
> <<<<<Job RSL
> &("environment" = ("HOME" "/home/atlas001" ) ("LOGNAME" "atlas001" )
> )("executable" = "testjob.jdl" )("stdout" = "x-ga
> ss-cache://" # $("GLOBUS_GRAM_JOB_CONTACT") # "stdout" "anExtraTag"
> )("stderr" = "x-gass-cache://" # $("GLOBUS_GRAM_JO
> B_CONTACT") # "stderr" "anExtraTag" )
> >>>>>Job RSL
> 6/24 15:40:44
> <<<<<Job RSL (post-eval)
> &("environment" = ("HOME" "/home/atlas001" ) ("LOGNAME" "atlas001" )
> )("executable" = "testjob.jdl" )("stdout" = "x-ga
> ss-cache://https://lcfgng.cs.tau.ac.il:20001/20367/1119616844/stdout"
> "anExtraTag" )("stderr" = "x-gass-cache://https:
> //lcfgng.cs.tau.ac.il:20001/20367/1119616844/stderr" "anExtraTag" )
> >>>>>Job RSL (post-eval)
> 6/24 15:40:44
> <<<<<Job RSL (post-validation)
> &("directory" = $("HOME") )("stdin" = "/dev/null" )("count" = "1"
> )("job_type" = "multiple" )("gram_my_job" = "collect
> ive" )("dry_run" = "no" )("environment" = ("HOME" "/home/atlas001" )
> ("LOGNAME" "atlas001" ) )("executable" = "testjob
> .jdl" )("stdout" =
> "x-gass-cache://https://lcfgng.cs.tau.ac.il:20001/20367/1119616844/stdout"
> "anExtraTag" )("stderr"
> =
> "x-gass-cache://https://lcfgng.cs.tau.ac.il:20001/20367/1119616844/stderr"
> "anExtraTag" )
> >>>>>Job RSL (post-validation)
> 6/24 15:40:44
> <<<<<Job RSL (post-validation-eval)
> &("directory" = "/home/atlas001" )("stdin" = "/dev/null" )("count" = "1"
> )("job_type" = "multiple" )("gram_my_job" = "
> collective" )("dry_run" = "no" )("environment" = ("HOME" "/home/atlas001"
> ) ("LOGNAME" "atlas001" ) )("executable" = "
> testjob.jdl" )("stdout" =
> "x-gass-cache://https://lcfgng.cs.tau.ac.il:20001/20367/1119616844/stdout"
> "anExtraTag" )("s
> tderr" =
> "x-gass-cache://https://lcfgng.cs.tau.ac.il:20001/20367/1119616844/stderr"
> "anExtraTag" )
> >>>>>Job RSL (post-validation-eval)
> 6/24 15:40:44 JMI: Getting RSL output value
> 6/24 15:40:44 JMI: Processing output positions
> 6/24 15:40:44 JMI: Getting RSL output value
> 6/24 15:40:44 JMI: Processing output positions
> 6/24 15:40:44 Job Manager State Machine (entering):
> GLOBUS_GRAM_JOB_MANAGER_STATE_REMOTE_IO_FILE_CREATE
> 6/24 15:40:44 JM: Opening output destinations
> 6/24 15:40:44 JM: stdout goes to
> x-gass-cache://lcfgng.cs.tau.ac.il/20367.1119616844/dev/stdout
> 6/24 15:40:44 JM: stderr goes to
> x-gass-cache://lcfgng.cs.tau.ac.il/20367.1119616844/dev/stderr
> 6/24 15:40:44 JM: Opening
> x-gass-cache://https://lcfgng.cs.tau.ac.il:20001/20367/1119616844/stdout
> 6/24 15:40:44 JM:
> x-gass-cache://https://lcfgng.cs.tau.ac.il:20001/20367/1119616844/stdout
> maps to /home/atlas001/.glo
> bus/.gass_cache/local/md5/c2/4e73e51ff30968799cff270424d543/md5/15/a822d0a5e4829a1abf0e110b771e3a/data
> 6/24 15:40:44 JM: Opened file handle 0x80bf988.
> 6/24 15:40:44 JM: exiting
> globus_l_gram_job_manager_output_destination_open()
> 6/24 15:40:44 JM: Opening
> x-gass-cache://https://lcfgng.cs.tau.ac.il:20001/20367/1119616844/stderr
> 6/24 15:40:44 JM:
> x-gass-cache://https://lcfgng.cs.tau.ac.il:20001/20367/1119616844/stderr
> maps to /home/atlas001/.glo
> bus/.gass_cache/local/md5/c2/4e73e51ff30968799cff270424d543/md5/38/0183cdc81f294011ea9477afdd8d99/data
> 6/24 15:40:44 JM: Opened file handle 0x80afc90.
> 6/24 15:40:44 JM: exiting
> globus_l_gram_job_manager_output_destination_open()
> 6/24 15:40:44 stdout or stderr is being used, starting to poll
> 6/24 15:40:44 no opens in progress, registering state machine callback
> 6/24 15:40:44 JM: Finished opening output destinations
> 6/24 15:40:44 Job Manager State Machine (entering):
> GLOBUS_GRAM_JOB_MANAGER_STATE_OPEN_OUTPUT
> 6/24 15:40:44 JM: GSSAPI type is GSI.. relocating proxy
> 6/24 15:40:44 JMI: testing job manager scripts for type fork exist and
> permissions are ok.
> 6/24 15:40:44 JMI: completed script validation: job manager type is fork.
> 6/24 15:40:44 JMI: in globus_gram_job_manager_script_proxy_relocate()
> 6/24 15:40:44 JMI: cmd = proxy_relocate
> Fri Jun 24 15:40:44 2005 JM_SCRIPT: New Perl JobManager created.
> Fri Jun 24 15:40:44 2005 JM_SCRIPT: proxy_relocate(enter)
> 6/24 15:40:44 JMI: while return_buf = GRAM_SCRIPT_X509_USER_PROXY =
> /home/atlas001/.globus/.gass_cache/local/md5/f9/dd
> 3891e7a5db2bd6bc171613794fa9/md5/54/6b6072ebfcbf0a0aecf96c0077cd10/data
> 6/24 15:40:44 Job Manager State Machine (entering):
> GLOBUS_GRAM_JOB_MANAGER_STATE_PROXY_RELOCATE
> 6/24 15:40:44 JM: Relocated Proxy to
> /home/atlas001/.globus/.gass_cache/local/md5/f9/dd3891e7a5db2bd6bc171613794fa9/md
> 5/54/6b6072ebfcbf0a0aecf96c0077cd10/data
> 6/24 15:40:44 JM: before sending to client: rc=0 (Success)
> 6/24 15:40:44 Job Manager State Machine (exiting):
> GLOBUS_GRAM_JOB_MANAGER_STATE_TWO_PHASE
> 6/24 15:40:44 Job Manager State Machine (entering):
> GLOBUS_GRAM_JOB_MANAGER_STATE_TWO_PHASE
> 6/24 15:40:44 Job Manager State Machine (entering):
> GLOBUS_GRAM_JOB_MANAGER_STATE_TWO_PHASE_COMMITTED
> 6/24 15:40:44 Job Manager State Machine (entering):
> GLOBUS_GRAM_JOB_MANAGER_STATE_STAGE_IN
> 6/24 15:40:44 JMI: testing job manager scripts for type fork exist and
> permissions are ok.
> 6/24 15:40:44 JMI: completed script validation: job manager type is fork.
> 6/24 15:40:44 JMI: in globus_gram_job_manager_submit()
> 6/24 15:40:44 JMI: local stdout filename =
> /home/atlas001/.globus/.gass_cache/local/md5/f9/dd3891e7a5db2bd6bc171613794
> fa9/md5/cf/973154c21276dc3796d3afe57a6e1f/data.
> 6/24 15:40:44 JMI: local stderr filename =
> /home/atlas001/.globus/.gass_cache/local/md5/f9/dd3891e7a5db2bd6bc171613794
> fa9/md5/6e/ce2b9c030924523c4fb17b867196eb/data.
> 6/24 15:40:44 JMI: cmd = submit
> 6/24 15:40:44 JMI: returning with success
> Fri Jun 24 15:40:44 2005 JM_SCRIPT: New Perl JobManager created.
> 6/24 15:40:44 JMI: while return_buf = GRAM_SCRIPT_ERROR = 5
> 6/24 15:40:44 Job Manager State Machine (entering):
> GLOBUS_GRAM_JOB_MANAGER_STATE_SUBMIT
> 6/24 15:40:44 JM: in globus_gram_job_manager_reporting_file_create()
> 6/24 15:40:44 JM: not reporting job information
> 6/24 15:40:44 JM: in globus_gram_job_manager_history_file_create()
> 6/24 15:40:44 Job Manager State Machine (entering):
> GLOBUS_GRAM_JOB_MANAGER_STATE_FAILED
> 6/24 15:40:44 closing destination
> x-gass-cache://https://lcfgng.cs.tau.ac.il:20001/20367/1119616844/stdout
> 6/24 15:40:44 JM: exiting
> globus_l_gram_job_manager_output_destination_close()
> 6/24 15:40:44 closing destination
> x-gass-cache://https://lcfgng.cs.tau.ac.il:20001/20367/1119616844/stderr
> 6/24 15:40:44 JM: exiting
> globus_l_gram_job_manager_output_destination_close()
> 6/24 15:40:44 Job Manager State Machine (entering):
> GLOBUS_GRAM_JOB_MANAGER_STATE_FAILED_CLOSE_OUTPUT
> 6/24 15:40:44 JM: NOT empty client callback list.
> 6/24 15:40:44 JM: sending callback of status 4 (failure code 5) to
> https://marseilleui.mrs.grid.cnrs.fr:21001/.
> 6/24 15:40:44 Job Manager State Machine (entering):
> GLOBUS_GRAM_JOB_MANAGER_STATE_FAILED_TWO_PHASE
> 6/24 15:40:44 Job Manager State Machine (entering):
> GLOBUS_GRAM_JOB_MANAGER_STATE_FAILED_TWO_PHASE_COMMITTED
> 6/24 15:40:44 Job Manager State Machine (entering):
> GLOBUS_GRAM_JOB_MANAGER_STATE_FAILED_FILE_CLEAN_UP
> 6/24 15:40:44 Job Manager State Machine (entering):
> GLOBUS_GRAM_JOB_MANAGER_STATE_FAILED_SCRATCH_CLEAN_UP
> 6/24 15:40:44 JMI: testing job manager scripts for type fork exist and
> permissions are ok.
> 6/24 15:40:44 JMI: completed script validation: job manager type is fork.
> 6/24 15:40:44 JMI: cmd = cache_cleanup
> Fri Jun 24 15:40:44 2005 JM_SCRIPT: New Perl JobManager created.
> Fri Jun 24 15:40:44 2005 JM_SCRIPT: cache_cleanup(enter)
> Fri Jun 24 15:40:44 2005 JM_SCRIPT: cache_cleanup(exit)
> 6/24 15:40:44 Job Manager State Machine (entering):
> GLOBUS_GRAM_JOB_MANAGER_STATE_FAILED_CACHE_CLEAN_UP
> 6/24 15:40:44 JM: in globus_gram_job_manager_reporting_file_remove()
> 6/24 15:40:45 JM: exiting globus_gram_job_manager.
>
|