Dear Maarten/All,
We've deployed gLExec on EMI-2 (SL6) last month on our site and it was working fine for around a week or two....
In order to get gLExec operational, we had introduced a line in our users.conf as below:
================================
55311:pilops01:46003,45000:opspil,ops:ops:pilot
================================
Also under group.conf, pilot role has been enabled for OPS VO as:
================================
"/ops/ROLE=pilot":::pilot:
================================
But, amazingly, since over a week now, its corresponding ROC Nagios tests i.e. "org.sam.glexec.CE-JobState-/ops/Role=pilot" as well as "org.sam.glexec.CE-JobSubmit-/ops/Role=pilot" are failing consistently for some mysterious reason ...
For instance, the full error logs could be found at:
https://rocnagios.grid.sinica.edu.tw/nagios/cgi-bin/extinfo.cgi?type=2&host=pcncp04.ncp.edu.pk&service=org.sam.glexec.CE-JobState-%2Fops%2FRole%3Dpilot
and
https://rocnagios.grid.sinica.edu.tw/nagios/cgi-bin/extinfo.cgi?type=2&host=pcncp04.ncp.edu.pk&service=org.sam.glexec.CE-JobSubmit-%2Fops%2FRole%3Dpilot
respectively.
Those hyperlinks seems to claim multiple type of problems such as:
1) "/bin/mkdir: cannot create directory `/var/cream_sandbox/opspil': Permission deniedchmod: cannot access `/var/cream_sandbox/opspil"
2) "Couldn't find a valid proxy. globus_sysconfig"
3) "Unable to find user certificate or key: /var/cream_sandbox/opspil/CN_Liaw_SyueYi_182693_OU_GRID_O_AS_C_TW_ops_Role_pilot_Capability_NULL_ops038/proxy//13811441852E819983rocwms012Egrid2Esinica2Eedu2Etw_18273784823970.16631"
etc.. etc..
Interestingly, it seems to be only affecting the OPS VO while CMS VO is running fine !! (according to ROC SAM Nagios)
Are such failure problematic for our Site Availability and Reliability?
We are not clear what could be the root cause of the situation being faced...
Can someone help us in this matter?
Thanks,
-- Best Regards --
Adeel-ur-Rehman
NCP-LCG2 Site Admin
Disclaimer: This email and any attachments may contain confidential material and is solely for the use of the intended recipient(s). If you have received this email in error, please notify the sender immediately and delete this email. If you are not the intended recipient(s), you must not use, retain or disclose any information contained in this email. Any views or opinions are solely those of the sender and do not necessarily represent those of National Centre for Physics (NCP). NCP does accept responsibility for any errors or omissions that are present in the message, or any attachment, that have arisen as a result of email transmission.
|