Hello,
The CREAM-CE had a problem : Tomcat daemon stopped responding (Nagios
test on port 8443). In the /opt/glite/var/log/glite-ce-cream.log, I
found messages like those reported below.
Restarting tomcat cured the problem and I have found references to
similar bug in Savannah. Our CREAM-CE is not up-to-date and maybe
recent versions do not have this problem but I wanted to report
anyway.
Version we have :
rpm -qa | grep -i cream
glite-CREAM-3.1.9-0
glite-ce-cream-1.10.3-0
glite-yaim-cream-ce-4.0.7-2
Thanks
JM
Messages in /opt/glite/var/log/glite-ce-cream.log :
--------------------------------------------------------------------------------------------------------------
** BEGIN NESTED EXCEPTION **
java.net.SocketException
MESSAGE: Too many open files
STACKTRACE:
java.net.SocketException: Too many open files
at java.net.Socket.createImpl(Socket.java:388)
at java.net.Socket.<init>(Socket.java:362)
at java.net.Socket.<init>(Socket.java:209)
at
com.mysql.jdbc.StandardSocketFactory.connect(StandardSocketFactory.java:256)
at com.mysql.jdbc.MysqlIO.<init>(MysqlIO.java:271)
at com.mysql.jdbc.Connection.createNewIO(Connection.java:2921)
at com.mysql.jdbc.Connection.<init>(Connection.java:1555)
at
com.mysql.jdbc.NonRegisteringDriver.connect(NonRegisteringDriver.java:285)
at
org.apache.tomcat.dbcp.dbcp.DriverConnectionFactory.createConnection(DriverConnectionFactory.java:38)
at
org.apache.tomcat.dbcp.dbcp.PoolableConnectionFactory.makeObject(PoolableConnectionFactory.java:294)
at
org.apache.tomcat.dbcp.pool.impl.GenericObjectPool.borrowObject(GenericObjectPool.java:840)
at
org.apache.tomcat.dbcp.dbcp.PoolingDataSource.getConnection(PoolingDataSource.java:96)
at
org.apache.tomcat.dbcp.dbcp.BasicDataSource.getConnection(BasicDataSource.java:880)
at
org.glite.ce.common.db.DatasourceManager.getConnection(DatasourceManager.java:93)
at
org.glite.ce.cream.jobmanagement.db.JobDBManager.retrieveJob(JobDBManager.java:532)
at
org.glite.ce.cream.jobmanagement.db.JobDBImplementation.retrieveJob(JobDBImplementation.java:162)
at
org.glite.ce.creamapi.jobmanagement.cmdexecutor.AbstractJobExecutor.retrieveJob(AbstractJobExecutor.java:125)
at
org.glite.ce.cream.jobmanagement.cmdexecutor.blah.LRMSEventsProcessor.processEvent(LRMSEventsProcessor.java:88)
at
org.glite.ce.cream.jobmanagement.cmdexecutor.blah.BLParserClient.readDataFromSocket(BLParserClient.java:265)
at
org.glite.ce.cream.jobmanagement.cmdexecutor.blah.BLParserClient.run(BLParserClient.java:303)
** END NESTED EXCEPTION **
Attempted reconnect 3 times. Giving up.
24 May 2009 17:02:06,428
org.glite.ce.cream.jobmanagement.cmdexecutor.blah.LRMSEventsProcessor -
job CREAM615917738 not found!
24 May 2009 17:02:10,433 org.glite.ce.common.db.DatasourceManager -
Server connection failure during transaction. Due to underlying
exception: 'java.net.SocketException: Too many open files'.
--------------------------------------------------------------------------------------------------------------
--
------------------------------------------------------------------------
Jean-michel BARBET | Tel: +33 (0)2 51 85 84 86
Laboratoire SUBATECH Nantes France | Fax: +33 (0)2 51 85 84 79
CNRS-IN2P3/Ecole des Mines/Universite | E-Mail: [log in to unmask]
------------------------------------------------------------------------
|