Greetings,
I found something interesting.. apparently CREAM didn't get the
correct result from BLAH:
10 Nov 2015 00:19:18,366 INFO
org.glite.ce.cream.jobmanagement.cmdexecutor.blah.BLParserClient -
initializeConnection: getting info about BLParser (pbs) from BLAH
(retry count=97/100)
10 Nov 2015 00:20:18,368 DEBUG
org.glite.ce.cream.jobmanagement.cmdexecutor.blah.BLAHExecutor -
BLAH_GET_HOSTPORT 0
10 Nov 2015 00:20:19,370 DEBUG
org.glite.ce.cream.jobmanagement.cmdexecutor.blah.BLAHExecutor -
getBlahOutput: S
10 Nov 2015 00:20:20,371 DEBUG
org.glite.ce.cream.jobmanagement.cmdexecutor.blah.BLAHExecutor -
getBlahOutput: S 1
10 Nov 2015 00:20:20,371 DEBUG
org.glite.ce.cream.jobmanagement.cmdexecutor.blah.BLAHExecutor -
getBlahOutput: 0 0 pbs/Error\ reading\ host:port
However, when I tried to query BLAH manually it seems return correct answer:
[root@khaldun ~]# blahpd
$GahpVersion: 1.8.0 Mar 31 2008 INFN\ blahpd\ (poly,new_esc_format) $
BLAH_GET_HOSTPORT 0
S
RESULTS
S 1
0 0 pbs/khaldun.biruni.upm.my:56554
Perhaps this is the main issue?
Regards
On Tue, Nov 10, 2015 at 7:37 AM, Muhammad Farhan SJAUGI
<[log in to unmask]> wrote:
> Dear Steve,
>
> Thank you for your feedback. I can confirm that the new blparser is
> used instead the old one.
>
> I'm wondering how the cream communicate with blparser? is it via
> socket or merely call the programming api?
>
> Regards
>
> On Mon, Nov 9, 2015 at 10:13 PM, Stephen Jones <[log in to unmask]> wrote:
>> Hi Muhammad,
>>
>> Here's something to check.
>>
>> http://grid.pd.infn.it/cream/field.php?n=Main.CREAMAndBlparserConfiguration
>>
>> If the "blparser" service is used by the "Old Blah Parser", perhaps you are
>> accidentally starting the wrong parser?
>>
>> Note: I think the "BNotifier" and "BUpdaterPBS" processes belong to the "New
>> Blah Parser". Maybe...
>>
>> So check which parser you are using.
>>
>> Cheers,
>>
>> Steve
>>
>>
>>
>> On 11/08/2015 10:03 AM, Muhammad Farhan SJAUGI wrote:
>>>
>>> Greetings,
>>>
>>> One of cluster shows strange behavior... CREAM unable to submit the
>>> job to BLAH because the blparser service is not alive:
>>>
>>> 08 Nov 2015 09:54:06,375 WARN
>>> org.glite.ce.creamapi.jobmanagement.cmdexecutor.AbstractJobExecutor -
>>> submission to BLAH failed [jobId=CREAM524062606; reason=The job cannot
>>> be submitted because the blparser service is not alive; retry
>>> count=3/3]
>>>
>>> I can confirm that the blparser service is up:
>>>
>>> [root@khaldun etc]# ps ax | grep BNotifier
>>> 3155 ? Sl 0:00 /usr/libexec/BNotifier
>>>
>>> [root@khaldun etc]# ps ax | grep BUpdaterPBS
>>> 3167 ? S 0:00 /usr/libexec/BUpdaterPBS
>>>
>>> Also I found from the cream log another info as below (but im not sure
>>> whether it is related or not)
>>>
>>> 08 Nov 2015 09:55:21,782 INFO
>>> org.glite.ce.cream.jobmanagement.cmdexecutor.blah.BLParserClient -
>>> initializeConnection: getting info about BLParser (pbs) from BLAH
>>> (retry count=95/100)
>>>
>>> I have tried restarting the service and even re-run yaim, both were
>>> not able to solve the problem...
>>>
>>> Is there anyone can help me to fix this problem?
>>>
>>> Regards
>>>
>>
>>
>> --
>> Steve Jones [log in to unmask]
>> Grid System Administrator office: 220
>> High Energy Physics Division tel (int): 43396
>> Oliver Lodge Laboratory tel (ext): +44 (0)151 794 3396
>> University of Liverpool http://www.liv.ac.uk/physics/hep/
>
>
>
> --
> Muhammad Farhan Sjaugi, S.Kom. M.Sc
>
> Technical Coordinator
> Academic Grid Malaysia
> c/o UNITEN
> email: [log in to unmask]
>
> Lecturer/Programmer
> Perdana University Centre for Bioinformatics
> email: [log in to unmask]
--
Muhammad Farhan Sjaugi, S.Kom. M.Sc
Technical Coordinator
Academic Grid Malaysia
c/o UNITEN
email: [log in to unmask]
Lecturer/Programmer
Perdana University Centre for Bioinformatics
email: [log in to unmask]
|