Hi all,
sorry for the multiple, to avoid the abnormal size growing of mom
logfile and also affect stability of front end server, have forced
delete the job. the msg no longer repeatedly append to the log. before
that, it have increase to 300M in 10min, after purging existing log file
and restart mom daemon as well. still have no idea why the log msg will
keep growing up to 11G?
did any other sites observe the same situation with atlas athena jobs?
thanks
BR,
J
Jason Shih wrote:
> Hi all,
>
>
> did anyone encounter the same situation before? the following msg keep
> appending to the mom log:
>
> 09/06/2009 19:31:33;0080; pbs_mom;Svr;preobit_reply;in while loop, no
> error from job stat
> 09/06/2009 19:31:33;0001;
> pbs_mom;Job;1841624.batch02.grid.sinica.edu.tw;Type CopyFiles request
> received from [log in to unmask], sock=13
> 09/06/2009 19:31:33;0080; pbs_mom;Svr;preobit_reply;top of preobit_reply
> 09/06/2009 19:31:33;0080;
> pbs_mom;Svr;preobit_reply;DIS_reply_read/decode_DIS_replySvr worked, top
> of while loop
>
> and result in around 11G log msg today? also the frequent query have
> cause front end server log file increase over 2G as well. wondering at
> which step the post processing of the job fail? but mom is actually
> functional normally, while zero server load observed for the atlas
> athena processes on the WN. execute since 'Sep05 0:00', any idea?
>
>
> BR,
> J
>
--
Jason Shih
ASGC/OPS
Tel: +886-2-2789-8374
Fax: +886-2-2783-5434
|