On Fri, 19 Aug 2005, Jiri Kosina wrote:
> On Fri, 19 Aug 2005, Jiri Kosina wrote:
>
> > When, in the time I see that the published information is incorrect, I run
> > the /opt/lcg/libexec/lcg-info-dynamic-ce (or lcg-info-dynamic-wrapper) script
> > manually on the CE, the published values are correct. So it seems like the
> > BDII/MDS is not able to get the correct values, even though the scripts are
> > returning the properly.
>
> A little correction to this: I have replaced qmgr and pbsnodes binaries
> with my wrappers to catch the commandline and log the output that is
> passed to the lcg-info-* script by these PBS commands. They seemed perfect
> even at the times when incorrect values were published.
You are looking at cached information.
Let the wrappers also log the time taken by the real commands.
The algorithm of the generic information provider is as follows:
if the cached information is older than 20 seconds, the dynamic plug-in
is run and its output will be used if it comes within 5 seconds,
otherwise the cached values will be used, unless the file is more than
10 minutes old, in which case the static defaults are used.
The dynamic plug-in may continue in the background to refresh the cached
information, for up to 10 minutes, after which it will be killed.
> Also, I have seen that sometimes even when manually invoking the
> lcg-info-dynamic-ce or lcg-info-dynamic-wrapper scripts, I get wrong
> results (even though the data given by qstat and obsnodes are correct).
> Does anyone happen to have an idea what could be the problem?
>
> The intervals are oscilating around 10 minutes - for approximately 10
> minutes everything is published correctly, and then for some time (30-60
> seconds) incorrect information is published. The similar intervals apply
> for manual execution of the scripts.
>
> Thanks in advance,
>
>
|