Hi,
we have found this kind of error around, with clock not aligned as
possible reason.
Can you check that on the UI you are using the clock is correctly
synchronized with ntp?
Our UI and RB seem to be ok.
The source of this hint:
the 2nd S.Campana talk on:
http://infnforge.cnaf.infn.it/cdsagenda/fullAgenda.php?ida=a0442
cheers
Alessandro C
Alessandro P
Emmanuel Medernach wrote:
> Maarten Litmaath wrote:
>
>> Emmanuel Medernach wrote:
>>
>>> Status Reason: cannot retrieve previous matches for
>>> https://egee-rb-01.cnaf.infn.it:9000/..
>>
>>
>> That problem usually occurs when one or more MySQL tables have
>> reached a hard limit (size of the file, number of rows);
>> the admins of egee-rb-01.cnaf.infn.it should have a look:
>>
>> ls -l /var/lib/mysql/lbserver20/
>> ls -l /var/lib/mysql/*.err
>> tail /var/lib/mysql/*.err
>>
>> It can also happen if the file system has been full recently.
>> Another possibility is that the DB got cleaned up or reset.
>> What does this command report:
>>
>> edg-job-get-logging-info -v 1 https://egee-rb-01.cnaf.infn.it:9000/..
>>
> Hello Maarten,
>
> The VO used is Biomed, and the user name is Cheick Oumar Thiam. Here is
> attached the output if it helps.
>
>
> ------------------------------------------------------------------------
>
>
> **********************************************************************
> LOGGING INFORMATION:
>
> Printing info for the Job : https://egee-rb-01.cnaf.infn.it:9000/4I14SV1bEXuwC2ehgjfLWw
>
> ---
> Event: RegJob
> - host = clrglop208.in2p3.fr
> - ns = egee-rb-01.cnaf.infn.it:7772
> - nsubjobs = 0
> - seed = uLU0BArrdV98O41PLThJ5Q
> - source = UserInterface
> - timestamp = Fri Dec 2 11:31:27 2005
> - user = /O=GRID-FR/C=FR/O=CNRS/OU=LPC/CN=Cheick Oumar [log in to unmask]
> ---
> Event: Transfer
> - dest_host = egee-rb-01.cnaf.infn.it
> - dest_instance = egee-rb-01.cnaf.infn.it:7772
> - destination = NetworkServer
> - host = clrglop208.in2p3.fr
> - result = START
> - source = UserInterface
> - timestamp = Fri Dec 2 11:31:29 2005
> - user = /O=GRID-FR/C=FR/O=CNRS/OU=LPC/CN=Cheick Oumar [log in to unmask]
> ---
> Event: Transfer
> - dest_host = egee-rb-01.cnaf.infn.it
> - dest_instance = egee-rb-01.cnaf.infn.it:7772
> - destination = NetworkServer
> - host = clrglop208.in2p3.fr
> - result = OK
> - source = UserInterface
> - timestamp = Fri Dec 2 11:31:38 2005
> - user = /O=GRID-FR/C=FR/O=CNRS/OU=LPC/CN=Cheick Oumar [log in to unmask]
> ---
> Event: Accepted
> - from = UserInterface
> - from_host = egee-rb-01.cnaf.infn.it
> - host = egee-rb-01.cnaf.infn.it
> - source = NetworkServer
> - src_instance = 7772
> - timestamp = Fri Dec 2 10:37:05 2005
> - user = /C=IT/O=INFN/OU=Host/L=CNAF/CN=egee-rb-01.cnaf.infn.it
> ---
> Event: DeQueued
> - host = egee-rb-01.cnaf.infn.it
> - queue = /var/edgwl/workload_manager/input.fl
> - source = WorkloadManager
> - src_instance = WM
> - timestamp = Fri Dec 2 10:37:06 2005
> - user = /C=IT/O=INFN/OU=Host/L=CNAF/CN=egee-rb-01.cnaf.infn.it
> ---
> Event: EnQueued
> - host = egee-rb-01.cnaf.infn.it
> - queue = /var/edgwl/jobcontrol/queue.fl
> - reason = unavailable
> - result = START
> - source = WorkloadManager
> - src_instance = WM
> - timestamp = Fri Dec 2 10:37:08 2005
> - user = /C=IT/O=INFN/OU=Host/L=CNAF/CN=egee-rb-01.cnaf.infn.it
> ---
> Event: EnQueued
> - host = egee-rb-01.cnaf.infn.it
> - queue = /var/edgwl/jobcontrol/queue.fl
> - reason = unavailable
> - result = OK
> - source = WorkloadManager
> - src_instance = WM
> - timestamp = Fri Dec 2 10:37:09 2005
> - user = /C=IT/O=INFN/OU=Host/L=CNAF/CN=egee-rb-01.cnaf.infn.it
> ---
> Event: DeQueued
> - host = egee-rb-01.cnaf.infn.it
> - local_jobid = unavailable
> - queue = /var/edgwl/jobcontrol/queue.fl
> - source = JobController
> - src_instance = unique
> - timestamp = Fri Dec 2 10:37:10 2005
> - user = /C=IT/O=INFN/OU=Host/L=CNAF/CN=egee-rb-01.cnaf.infn.it
> ---
> Event: Transfer
> - dest_host = localhost
> - dest_instance = /var/edgwl/logmonitor/CondorG.log/CondorG.1133519375.log
> - dest_jobid = unavailable
> - destination = LogMonitor
> - host = egee-rb-01.cnaf.infn.it
> - reason = unavailable
> - result = START
> - source = JobController
> - src_instance = unique
> - timestamp = Fri Dec 2 10:37:10 2005
> - user = /C=IT/O=INFN/OU=Host/L=CNAF/CN=egee-rb-01.cnaf.infn.it
> ---
> Event: Transfer
> - dest_host = localhost
> - dest_instance = /var/edgwl/logmonitor/CondorG.log/CondorG.1133519375.log
> - dest_jobid = 114697
> - destination = LogMonitor
> - host = egee-rb-01.cnaf.infn.it
> - reason = unavailable
> - result = OK
> - source = JobController
> - src_instance = unique
> - timestamp = Fri Dec 2 10:37:10 2005
> - user = /C=IT/O=INFN/OU=Host/L=CNAF/CN=egee-rb-01.cnaf.infn.it
> ---
> Event: Accepted
> - from = JobController
> - from_host = localhost
> - from_instance = unavailable
> - host = egee-rb-01.cnaf.infn.it
> - local_jobid = 114697
> - source = LogMonitor
> - src_instance = unique
> - timestamp = Fri Dec 2 10:37:18 2005
> - user = /C=IT/O=INFN/OU=Host/L=CNAF/CN=egee-rb-01.cnaf.infn.it
> ---
> Event: Transfer
> - dest_host = unavailable
> - dest_instance = /var/edgwl/logmonitor/CondorG.log/CondorG.1133519375.log
> - dest_jobid = unavailable
> - destination = LRMS
> - host = egee-rb-01.cnaf.infn.it
> - reason = 7 authentication failed: GSS Major Status: Authentication Failed GSS Minor Status Error Chain: init.c:499: globus_gss_assist_init_sec_context_async: Error during context initialization init_sec_context
> - result = FAIL
> - source = LogMonitor
> - src_instance = unique
> - timestamp = Fri Dec 2 10:37:18 2005
> - user = /C=IT/O=INFN/OU=Host/L=CNAF/CN=egee-rb-01.cnaf.infn.it
> ---
> Event: Done
> - exit_code = 1
> - host = egee-rb-01.cnaf.infn.it
> - reason = Job got an error while in the CondorG queue.
> - source = LogMonitor
> - src_instance = unique
> - status_code = FAILED
> - timestamp = Fri Dec 2 10:47:46 2005
> - user = /C=IT/O=INFN/OU=Host/L=CNAF/CN=egee-rb-01.cnaf.infn.it
> ---
> Event: Resubmission
> - host = egee-rb-01.cnaf.infn.it
> - reason = unavailable
> - result = WILLRESUB
> - source = LogMonitor
> - src_instance = unique
> - tag = unavailable
> - timestamp = Fri Dec 2 10:47:47 2005
> - user = /C=IT/O=INFN/OU=Host/L=CNAF/CN=egee-rb-01.cnaf.infn.it
> ---
> Event: EnQueued
> - host = egee-rb-01.cnaf.infn.it
> - queue = /var/edgwl/workload_manager/input.fl
> - reason = unavailable
> - result = START
> - source = LogMonitor
> - src_instance = unique
> - timestamp = Fri Dec 2 10:47:47 2005
> - user = /C=IT/O=INFN/OU=Host/L=CNAF/CN=egee-rb-01.cnaf.infn.it
> ---
> Event: EnQueued
> - host = egee-rb-01.cnaf.infn.it
> - queue = /var/edgwl/workload_manager/input.fl
> - reason = unavailable
> - result = OK
> - source = LogMonitor
> - src_instance = unique
> - timestamp = Fri Dec 2 10:47:47 2005
> - user = /C=IT/O=INFN/OU=Host/L=CNAF/CN=egee-rb-01.cnaf.infn.it
> ---
> Event: Abort
> - host = egee-rb-01.cnaf.infn.it
> - reason = cannot retrieve previous matches for https://egee-rb-01.cnaf.infn.it:9000/4I14SV1bEXuwC2ehgjfLWw
> - source = WorkloadManager
> - src_instance = WM
> - timestamp = Fri Dec 2 10:47:47 2005
> - user = /C=IT/O=INFN/OU=Host/L=CNAF/CN=egee-rb-01.cnaf.infn.it
> ---
> Event: EnQueued
> - host = egee-rb-01.cnaf.infn.it
> - queue = /var/edgwl/workload_manager/input.fl
> - result = OK
> - source = NetworkServer
> - timestamp = Fri Dec 2 10:37:06 2005
> - user = /C=IT/O=INFN/OU=Host/L=CNAF/CN=egee-rb-01.cnaf.infn.it
>
> **********************************************************************
>
--
Alessandro Cavalli
INFN - CNAF
Viale Berti Pichat 6/2
40127 Bologna
Italy
tel: +39 051 6092849
fax: +39 051 6092746
ICQ: 12771368
|