Hi Dimitri,
your WNs are on sl6, aren't they?
in past weeks some problems were noticed between Nagios and sl6 WNs on
some sitezs, look for example this ticket:
https://ggus.eu/ws/ticket_info.php?ticket=86451
cheers,
Alessandro
Il 12/10/2012 16:21, Dimitri Nilsen ha scritto:
> Hi,
>
> we have a problem with a CREAMCE, EMI2, that tests submitting by
> nagios failing with an error (Exit Code !=0)
> Looking into logs we see:
> Message Broker destination:
> /queue/grid.probe.metricOutput.EGEE.ngi-de-nagios_gridka_de
> 2012-10-12 16:04:52,643 ConnStomp INFO slept in message dispatch loop.
> ./nagrun.sh: line 423: 2234 Segmentation fault $NAGROOT/bin/nagios
> $NAGCONF > $FNAGPID 2>&1
>
> Any ideas what could be the problem? The submission of simple
> "/bin/hostname" jobs works fine.
>
> Regards
> Dimitri
>
>
> ************************
>
> Complete job output here:
>
> ----------------
>
>
> WARNING: job submission OK - problem on WN [Done (Exit Code !=0)]
> WARNING: job submission OK - problem on WN [Done (Exit Code !=0)]
>
> Testing from: ngi-de-nagios.gridka.de
> DN: /C=DE/O=GermanGrid/OU=KIT/CN=Robot: grid client - Dimitri
> Nilsen/CN=proxy
> VOMS FQANs: /ops/NGI/Germany/Role=NULL/Capability=NULL,
> /ops/NGI/Role=NULL/Capability=NULL, /ops/Role=NULL/Capability=NULL,
> /ops/NGI/Switzerland/Role=NULL/Capability=NULL
> glite-wms-job-status
> https://ngi-de-lbmon-1.scc.kit.edu:9000/QDRXbxwleZroxrGOgu4Ptg
>
>
> ======================= glite-wms-job-status Success
> =====================
> BOOKKEEPING INFORMATION:
>
> Status info for the Job :
> https://ngi-de-lbmon-1.scc.kit.edu:9000/QDRXbxwleZroxrGOgu4Ptg
> Current Status: Done (Exit Code !=0)
> Exit code: 1
> Status Reason: Job Terminated Successfully
> Destination: vgn003.hep.physik.uni-siegen.de:8443/cream-pbs-ops
> Submitted: Fri Oct 12 16:04:37 2012 CEST
> ==========================================================================
>
> Getting job output: OK.
>
>
> Job output:
>
> Launched with parameters: -v ops -f /ops/NGI/Germany -d
> /queue/grid.probe.metricOutput.EGEE.ngi-de-nagios_gridka_de -n PROD -t
> 600 -w 1 -l prod-lfc-shared-central.cern.ch -s
> dcache.fz-juelich.de,ophelia.zih.tu-dresden.de,se.bfg.uni-freiburg.de
> === [Fri Oct 12 16:04:48 CEST 2012] ===
> === Running on ===
> === Site: UNI-SIEGEN-HEP
> === CE: vgn003.hep.physik.uni-siegen.de:8443/cream-pbs-ops
> === WN: vgn020.hep.physik.uni-siegen.de
> === WN arch: x86_64.
> === [Fri Oct 12 16:04:48 CEST 2012] ===
> === Running on ===
> === Site: UNI-SIEGEN-HEP
> === CE: vgn003.hep.physik.uni-siegen.de:8443/cream-pbs-ops
> === WN: vgn020.hep.physik.uni-siegen.de
> === WN arch: x86_64
> Check Python version:
> /usr/bin/python
> Python 2.6.6
> Can we import Python LDAP ...
> YES.
> Launching MTA.
> /home/ops03/home_cream_296803620/CREAM296803620/nagios/bin/mta-simple
> --dirq /tmp/sam.2198.687/msg-outgoing --destination
> /queue/grid.probe.metricOutput.EGEE.ngi-de-nagios_gridka_de
> --broker-network PROD --pidfiledir
> /home/ops03/home_cream_296803620/CREAM296803620/nagios/var/ -v info
> --bdii-uri bdii-fzk.gridka.de:2170
> Setting Nagios configuration.
>
> Nagios Core 3.3.1
> Copyright (c) 2009-2011 Nagios Core Development Team and Community
> Contributors
> Copyright (c) 1999-2009 Ethan Galstad
> Last Modified: 07-25-2011
> License: GPL
>
> Website: http://www.nagios.org
> Reading configuration data...
> Read main config file okay...
> Processing object config directory
> '/home/ops03/home_cream_296803620/CREAM296803620/nagios/etc/common.d'...
> Processing object config file
> '/home/ops03/home_cream_296803620/CREAM296803620/nagios/etc/common.d/commands.cfg'...
> Processing object config file
> '/home/ops03/home_cream_296803620/CREAM296803620/nagios/etc/common.d/host.cfg'...
> Processing object config file
> '/home/ops03/home_cream_296803620/CREAM296803620/nagios/etc/common.d/timeperiods.cfg'...
> Processing object config file
> '/home/ops03/home_cream_296803620/CREAM296803620/nagios/etc/common.d/contacts.cfg'...
> Processing object config directory
> '/home/ops03/home_cream_296803620/CREAM296803620/nagios/etc/wn.d'...
> Processing object config directory
> '/home/ops03/home_cream_296803620/CREAM296803620/nagios/etc/wn.d/cadist'...
> Processing object config file
> '/home/ops03/home_cream_296803620/CREAM296803620/nagios/etc/wn.d/cadist/commands.cfg'...
> Processing object config file
> '/home/ops03/home_cream_296803620/CREAM296803620/nagios/etc/wn.d/cadist/services.cfg'...
> Processing object config directory
> '/home/ops03/home_cream_296803620/CREAM296803620/nagios/etc/wn.d/org.sam'...
> Processing object config file
> '/home/ops03/home_cream_296803620/CREAM296803620/nagios/etc/wn.d/org.sam/commands.cfg'...
> Processing object config file
> '/home/ops03/home_cream_296803620/CREAM296803620/nagios/etc/wn.d/org.sam/services.cfg'...
> Read object config files okay...
>
> Running pre-flight check on configuration data...
>
> Checking services...
> Checked 11 services.
> Checking hosts...
> Checked 1 hosts.
> Checking host groups...
> Checked 0 host groups.
> Checking service groups...
> Checked 0 service groups.
> Checking contacts...
> Checked 1 contacts.
> Checking contact groups...
> Checked 1 contact groups.
> Checking service escalations...
> Checked 0 service escalations.
> Checking service dependencies...
> Checked 0 service dependencies.
> Checking host escalations...
> Checked 0 host escalations.
> Checking host dependencies...
> Checked 0 host dependencies.
> Checking commands...
> Checked 9 commands.
> Checking time periods...
> Checked 1 time periods.
> Checking for circular paths between hosts...
> Checking for circular host and service dependencies...
> Checking global event handlers...
> Checking obsessive compulsive processor commands...
> Checking misc settings...
>
> Total Warnings: 0
> Total Errors: 0
>
> Things look okay - No serious problems were detected during the
> pre-flight check
> Launching Nagios: Fri Oct 12 16:04:51 CEST 2012
> Launch Nagios as background process.
> Message Broker URI: stomp://msg.cro-ngi.hr:6163/
> Message Broker destination:
> /queue/grid.probe.metricOutput.EGEE.ngi-de-nagios_gridka_de
> 2012-10-12 16:04:52,643 ConnStomp INFO slept in message dispatch loop.
> ./nagrun.sh: line 423: 2234 Segmentation fault $NAGROOT/bin/nagios
> $NAGCONF > $FNAGPID 2>&1
>
> Nagios Core 3.3.1
> Copyright (c) 2009-2011 Nagios Core Development Team and Community
> Contributors
> Copyright (c) 1999-2009 Ethan Galstad
> Last Modified: 07-25-2011
> License: GPL
>
> Website: http://www.nagios.org
> Nagios 3.3.1 starting... (PID=2234)
> Local time is Fri Oct 12 16:04:51 CEST 2012
> 2012-10-12 16:04:53,645 ConnStomp INFO slept in message dispatch loop.
> 2012-10-12 16:04:54,647 ConnStomp INFO slept in message dispatch loop.
> 2012-10-12 16:04:55,649 ConnStomp INFO slept in message dispatch loop.
> 2012-10-12 16:04:56,650 ConnStomp INFO slept in message dispatch loop.
> 2012-10-12 16:04:57,664 ConnStomp INFO slept in message dispatch loop.
> 2012-10-12 16:04:58,666 ConnStomp INFO slept in message dispatch loop.
> 2012-10-12 16:04:59,667 ConnStomp INFO slept in message dispatch loop.
> 2012-10-12 16:05:00,669 ConnStomp INFO slept in message dispatch loop.
> 2012-10-12 16:05:01,670 ConnStomp INFO slept in message dispatch loop.
> 2012-10-12 16:05:02,672 ConnStomp INFO slept in message dispatch loop.
> WARNING: No Nagios status.dat file after 10 sec.
> WARNING:
> /home/ops03/home_cream_296803620/CREAM296803620/nagios/var/status.dat
> WARNING: Cannot proceed. Check WN. Bailing out.
> 2012-10-12 16:05:02,692 ConnStomp INFO Statistics:
> Connections: 0
> Messages sent: 0
> Receipts received: 0
> Messages received: 0
> Errors: 0
--
Dr. Alessandro Paolini
INFN - CNAF
Viale Berti Pichat 6/2
40127 Bologna
Italy
tel: +39 051 6092723
fax: +39 051 6092916
ICQ: 192172027
skype: alex.paolini
**********************
"credo nel potere del riso e delle lacrime"
"come antidoto all'odio ed al terrore"
"un giorno senza un sorriso"
"è un giorno perso" >>> Charlie Chaplin
|