Hello Yuri
The user in question is known to torque but local users should have nothing to do with CE, IMO. Otherwise it brakes security rules. We keep local and grid user directories in separate top directories on the wn's.
Thanks
Elena
On 12 Nov 2013, at 14:20, Yuri P. Ivanov wrote:
> Dear Elena,
>
>> On Tue, 12 Nov 2013, Elena Korolkova wrote:
>> ...
>>
>> [LRMS]
>> lrms_backend_cmd: /usr/libexec/lrmsinfo-pbs
>> [Scheduler]
>> cycle_time : 0
>> vo_max_jobs_cmd: /usr/libexec/vomaxjobs-maui -h torque.hep
>>
>>
>> [root@lcgce2 ~]# /usr/libexec/lcg-info-dynamic-scheduler -c /etc/lrms/scheduler.conf
>> ERROR:Analyzer.DataHandler:Cannot analyze: {'startAnchor': 'start_time', 'name': 'job_118523', 'qtime': 1383602689, 'jobid': '928260.torque.shef.ac.uk', 'queue': 'long', 'start': 1383602881, 'state': 'running', 'cpucount': 1, 'user': 'ksuruliz', 'maxwalltime': 691200, 'walltime': 660946} (Missing group in {'startAnchor': 'start_time', 'name': 'job_118523', 'qtime': 1383602689, 'jobid': '928260.torque.shef.ac.uk', 'queue': 'long', 'start': 1383602881, 'state': 'running', 'cpucount': 1, 'user': 'ksuruliz', 'maxwalltime': 691200, 'walltime': 660946})
>> ERROR:Analyzer.DataHandler:Cannot analyze: {'startAnchor': 'start_time', 'name': 'job_118524', 'qtime': 1383602689, 'jobid': '928261.torque.shef.ac.uk', 'queue': 'long', 'start': 1383606429, 'state': 'running', 'cpucount': 1, 'user': 'ksuruliz', 'maxwalltime': 691200, 'walltime': 657373} (Missing group in {'startAnchor': 'start_time', 'name': 'job_118524', 'qtime': 1383602689, 'jobid': '928261.torque.shef.ac.uk', 'queue': 'long', 'start': 1383606429, 'state': 'running', 'cpucount': 1, 'user': 'ksuruliz', 'maxwalltime': 691200, 'walltime': 657373})
>> ......
>> ERROR:Analyzer.DataHandler:Cannot analyze: {'name': 'a3c3b0b68546e2', 'qtime': 1384263805, 'jobid': '961475.torque.shef.ac.uk', 'queue': 'short', 'state': 'queued', 'user': 'mcfayden', 'maxwalltime': 28800} (Missing group in {'name': 'a3c3b0b68546e2', 'qtime': 1384263805, 'jobid': '961475.torque.shef.ac.uk', 'queue': 'short', 'state': 'queued', 'user': 'ksuruliz', 'maxwalltime': 28800})
>> ERROR:lcg-info-dynamic-scheduler:Execution error: Traceback (most recent call last):
>> File "/usr/libexec/lrmsinfo-pbs", line 60, in main
>> QStatHandler.parse(sContainer, infile)
>> File "/var/tmp/lcg-info-dynamic-scheduler-pbs-2.4.2-2.el5-root-andreett/usr/lib/python2.4/site-packages/TorqueInfoUtils/QStatHandler.py", line 211, in parse
>> Exception: exceptions.Exception: (Cannot find user for 928260.torque.shef.ac.uk)
>
> Usually it means that this user (i.e. local one) is not known for the
> system where this command executed. Please check it with "id ksuruliz"
> or "finger ksuruliz". We also had similar errors when PBS server was
> separate from creamCE machine. CE also needs to know local users.
>
> With the best wishes,
>
> Yuri Ivanov
__________________________________________________
Dr Elena Korolkova
Email: [log in to unmask]
Tel.: +44 (0)114 2223553
Fax: +44 (0)114 2223555
Department of Physics and Astronomy
University of Sheffield
Sheffield, S3 7RH, United Kingdom
|