Hi,
We have the same problem here.
Problem:
After update to u26 job-list-match stopped working
Symptoms:
-----------> glite-job-list-match hostname.jdl
Selected Virtual Organisation name (from proxy certificate extension): dteam
Connecting to host wms1.cyf-kr.edu.pl, port 7772
===================== glite-job-list-match failure ======================
No Computing Element matching your job requirements has been found!
======================================================================
----------->
Facts:
-----------> cat hostname.jdl
VirtualOrganisation="dteam";
Executable = "/bin/hostname";
Arguments = "-f";
StdOutput = "hello-mes.out";
StdError = "hstderr.log";
OutputSandbox = {"hello-mes.out","hstderr.log"};
----------->
-----------> cat /var/glite/workload_manager/ismdump.fl
...
NetworkServer = [
II_Port = 2170;
Gris_Port = 2170;
II_Timeout = 30;
Gris_Timeout = 20;
II_DN = "mds-vo-name=local, o=grid";
Gris_DN = "mds-vo-name=local, o=grid";
II_Contact = "wms1.cyf-kr.edu.pl";
BacklogSize = 64;
ListeningPort = 7772;
MasterThreads = 8;
DispatcherThreads = 10;
SandboxStagingPath = "${GLITE_LOCATION_VAR}/SandboxDir";
LogFile = "${GLITE_LOCATION_LOG}/networkserver_events.log";
LogLevel = 6;
EnableQuotaManagement = false;
MaxInputSandboxSize = 10000000;
EnableDynamicQuotaAdjustment = false;
QuotaAdjustmentAmount = 10000;
QuotaInsensibleDiskPortion = 2.0;
ConnectionTimeout = 300;
ListMatchParadise = "${GLITE_LOCATION_TMP}/MatchArea";
DLI_SI_CatalogTimeout = 60;
];
...
--------------->
BDII is located on the same machine. Accessible, not firewalled. Working
properly.
/var/glite/workload_manager/ismdump.fl is empty.
As i have observed, no communication to bdii (2170) occurs when list-match
command is issued. (verified with tcpdump). No such communication found in
bdii.log.
Only sign of data exchange can be found after /etc/init.d/gLite restart and it
looks like this:
-->
Jun 26 14:43:45 wms1 slapd[14895]: daemon: conn=2 fd=7 connection from
IP=127.0.0.1:50466 (IP=127.0.0.1:2173) accepted.
Jun 26 14:43:45 wms1 slapd[14895]: conn=2 op=0 BIND dn="" method=128
Jun 26 14:43:45 wms1 slapd[14895]: conn=2 op=0 RESULT tag=97 err=0 text=
Jun 26 14:43:45 wms1 slapd[14895]: conn=2 op=1 SRCH base="cn=subschema"
scope=0 filter="(objectClass=*)"
Jun 26 14:43:45 wms1 slapd[14895]: conn=2 op=1 RESULT tag=101 err=0 text=
Jun 26 14:43:45 wms1 slapd[14895]: conn=2 op=2 UNBIND
Jun 26 14:43:45 wms1 slapd[14895]: conn=-1 fd=7 closed
-->
but /var/glite/workload_manager/ismdump.fl is still empty.
Logs:
---------> workload_manager_events.log
26 Jun, 14:49:26 -I: [Info]
get_new_requests(/home/glbuild/GLITE_3_0_3_RC1/org.glite.wms.manager/src/server/DispatcherFromFileList.cpp:1054):
considering match of https://localhost:6000/sIHJcl78oqEsZ3blZrto9A
26 Jun, 14:49:26 -D: [Debug]
process_match(/home/glbuild/GLITE_3_0_3_RC1/org.glite.wms.manager/src/server/RequestHandler.cpp:407):
considering match
https://localhost:6000/sIHJcl78oqEsZ3blZrto9A /tmp/0xb74f00e0.20070626144926671646 -1
0
---------->
Any help will be much appreciated. I gave up
> > -----Original Message-----
> > From: LHC Computer Grid - Rollout
> > [mailto:[log in to unmask]] On Behalf Of Burke,
> > S (Stephen)
> > Sent: Wednesday 6 June 2007 16:30
> > To: [log in to unmask]
> > Subject: Re: [LCG-ROLLOUT] WMSLB can't match any job to any CE
> >
> > LHC Computer Grid - Rollout
> >
> > > [mailto:[log in to unmask]] On Behalf Of
> > > Vrijaldenhoven, Serge saidL
> > > on WMSLB (which is also the BDII):
> > > LCG_GFAL_INFOSYS=moon.ehv.campus.philips.com:2170
> >
> > I can't connect to that, is it behind a firewall?
>
> Yes, it's behind a firewall
>
> > > Somehow /var/glite/workload_manager/ismdump.fl is very empty
> > > on our system...
>
> ...and the 1*10^6 dollar question is: how to get it filled?
>
> > Well, that would explain why the list-match doesn't work ...
> >
> > Stephen
>
Best Regards
--
Lukasz Flis
|