On 08/15/2014 02:43 PM, Andrew Lahiff wrote:
> Hi Steve,
>
> We haven't ever seen that. Currently:
>
> [root@condor01 ~]# condor_status -af HasFileTransfer | sort | uniq -c
> 10349 true
>
> Do all nodes have this problem or just some?
All of 7 of them. I was making two sets of changes. Once involved
accounting/fairshare, but I got
bogged down and tried to get scaling working. This involved a script
like this that gives
the scaling factors:
# cat modules/emi/files/condor/get_scale_factor.sh
#!/bin/bash
/usr/bin/perl -ne 'if (/model name/) {print "RalScaling = " ;
s/.*CPU\s*//;s/\s.*//;(/E5620/)?print 1.205:(/L5420/)?print
0.896:(/X5650/)?print 1.229:(/E5-2630/)?print 1.386:print 1.0; print
"\n";exit}' /proc/cpuinfo
Yes, I know it's a lash up. And I added some config like this:
LOCAL_CONFIG_FILE = /root/scripts/get_scale_factor.sh|
STARTD_ATTRS = $(STARTD_ATTRS) RalScaling
I was also bringing on some new (and different) nodes (that's why I need
this feature).
Anyway, I was in the midst of all that when this TARGET.HasFileTransfer
error came up, out of the blue!
There's practically no documentation on it. I've reverted a lot of edits
now, and it's scheduling again.
I'll do things one-at-a-time for a bit from now on.
I'll let you know when I find out what it was all about.
Cheers,
Steve
> Regards,
> Andrew.
>
> ________________________________________
> From: Stephen Jones [[log in to unmask]]
> Sent: Friday, August 15, 2014 2:28 PM
> To: [log in to unmask]
> Subject: TARGET.HasFileTransfer
>
> My condor jobs have suddenly stopped going on nodes due to HasFileTransfer.
>
> Has anybody seen this?
>
> Cheers
>
> Steve
>
> ---------
> # condor_q -analyze 26108.0
> ...
> The Requirements expression for your job is:
>
> ( TARGET.Arch == "X86_64" ) && ( TARGET.OpSys == "LINUX" ) &&
> ( TARGET.Disk >= RequestDisk ) && ( TARGET.Memory >= RequestMemory ) &&
> ( TARGET.Cpus >= RequestCpus ) && ( TARGET.HasFileTransfer )
>
>
> Suggestions:
>
> Condition Machines Matched Suggestion
> --------- ---------------- ----------
> 1 ( TARGET.HasFileTransfer ) 0 REMOVE
> 2 ( TARGET.Arch == "X86_64" ) 7
> 3 ( TARGET.OpSys == "LINUX" ) 7
> 4 ( TARGET.Disk >= 22 ) 7
> 5 ( TARGET.Memory >= 4000 ) 7
> 6 ( TARGET.Cpus >= 8 ) 7
>
> ---------
>
>
>
> --
> Steve Jones [log in to unmask]
> System Administrator office: 220
> High Energy Physics Division tel (int): 42334
> Oliver Lodge Laboratory tel (ext): +44 (0)151 794 2334
> University of Liverpool http://www.liv.ac.uk/physics/hep/
--
Steve Jones [log in to unmask]
System Administrator office: 220
High Energy Physics Division tel (int): 42334
Oliver Lodge Laboratory tel (ext): +44 (0)151 794 2334
University of Liverpool http://www.liv.ac.uk/physics/hep/
|