Print

Print


On Tue, Feb 22, 2005 at 05:54:13PM +0100 or thereabouts, Ricardo Graciani wrote:
> Hi Wei,

I see you have maui installed and

# showq -i

is showing all the dteam jobs will run next. If you also
want to reserve a CPU for a particular  group

SRCFG[sl3]          STARTTIME=0:00:00 ENDTIME=24:00:00
SRCFG[sl3]          PERIOD=INFINITY
SRCFG[sl3]          TASKCOUNT=2 RESOURCES=PROCS:1,MEM:400
SRCFG[sl3]          GROUPLIST=dteam
SRCFG[sl3]          NODEFEATURES=mon

in maui.cfg and restart maui.

# showres -n

will show you what node the reservation has been made on.

Read the maui manual though for the details for how to tweak
this.

 Steve




>
>         It depends on your CPU, but about 1 day for a 2GHz Pentium 4.
>
>         I would suggest that you set up your site so that not all WN can
> be occupied by "long" jobs. It is not a problem "per se" but dteam test
> jobs will have to wait.
>
>         Regards
>
>                 Ricardo
>
>
> ========================================================================
> ========
>
> Ricardo Graciani Diaz
>
> Dept. Estructura i Constituents de la Materia
> Facultat de Fisica                              Tel: +34 93 403 9183
> Universitat de Barcelona                        Fax: +34 93 402 1198
>
> Diagonal, 647
> E-08028 Barcelona
>
> ========================================================================
> ========
>
>
>
> > -----Mensaje original-----
> > De: LHC Computer Grid - Rollout [mailto:[log in to unmask]]
> En
> > nombre de Wei Xing
> > Enviado el: martes, 22 de febrero de 2005 17:44
> > Para: [log in to unmask]
> > Asunto: [LCG-ROLLOUT] LHCB jobs.
> >
> > Hi,
> >
> > Normally, how long do the lhcb jobs take?
> >
> > As you can see, my CE queue is full of lhcb jobs, and all worker nodes
> > are busy for more than 8 hours.
> >
> >
> ========================================================================
> ==
> > =
> > qstat
> > Job id           Name             User             Time Use S Queue
> > ---------------- ---------------- ---------------- -------- - -----
> > 8058.ce101         STDIN            lhcb001          08:08:14 R lhcb
> > 8059.ce101         STDIN            lhcb001          08:04:31 R lhcb
> > 8060.ce101         STDIN            lhcb001          07:59:55 R lhcb
> > 8061.ce101         STDIN            lhcb001          07:59:55 R lhcb
> > 8062.ce101         STDIN            lhcb001          07:55:36 R lhcb
> > 8063.ce101         STDIN            lhcb001          07:55:15 R lhcb
> > 8064.ce101         STDIN            lhcb001          07:55:40 R lhcb
> > 8065.ce101         STDIN            lhcb001          07:54:31 R lhcb
> > 8066.ce101         STDIN            lhcb001          07:52:02 R lhcb
> > 8067.ce101         STDIN            lhcb001          07:51:11 R lhcb
> > 8068.ce101         STDIN            lhcb001          07:44:34 R lhcb
> > 8069.ce101         STDIN            lhcb001          07:43:40 R lhcb
> > 8070.ce101         STDIN            lhcb001          07:44:12 R lhcb
> > 8071.ce101         STDIN            lhcb001          07:43:19 R lhcb
> > 8072.ce101         STDIN            lhcb001                 0 Q lhcb
> > 8073.ce101         STDIN            lhcb001                 0 Q lhcb
> > 8074.ce101         STDIN            lhcb001                 0 Q lhcb
> > 8075.ce101         STDIN            lhcb001                 0 Q lhcb
> > 8076.ce101         STDIN            lhcb001                 0 Q lhcb
> > 8077.ce101         STDIN            lhcb001                 0 Q lhcb
> > 8078.ce101         STDIN            lhcb001                 0 Q lhcb
> > 8079.ce101         STDIN            lhcb001                 0 Q lhcb
> > 8080.ce101         STDIN            lhcb001                 0 Q lhcb
> > 8081.ce101         STDIN            lhcb001                 0 Q lhcb
> > 8082.ce101         STDIN            lhcb001                 0 Q lhcb
> > 8083.ce101         STDIN            lhcb001                 0 Q lhcb
> > 8084.ce101         STDIN            lhcb001                 0 Q lhcb
> > 8085.ce101         STDIN            lhcb001                 0 Q lhcb
> > 8086.ce101         STDIN            lhcb001                 0 Q lhcb
> > 8087.ce101         STDIN            lhcb001                 0 Q lhcb
> > 8088.ce101         STDIN            lhcb001                 0 Q lhcb
> > 8089.ce101         STDIN            lhcb001                 0 Q lhcb
> > 8090.ce101         STDIN            lhcb001                 0 Q lhcb
> > 8091.ce101         STDIN            lhcb001                 0 Q lhcb
> > 8092.ce101         STDIN            lhcb001                 0 Q lhcb
> > 8093.ce101         STDIN            lhcb001                 0 Q lhcb
> > 8094.ce101         STDIN            lhcb001                 0 Q lhcb
> > 8095.ce101         STDIN            lhcb001                 0 Q lhcb
> > 8096.ce101         STDIN            lhcb001                 0 Q lhcb
> > 8097.ce101         STDIN            lhcb001                 0 Q lhcb
> > 8098.ce101         STDIN            lhcb001                 0 Q lhcb
> > 8099.ce101         STDIN            lhcb001                 0 Q lhcb
> > 8100.ce101         STDIN            lhcb001                 0 Q lhcb
> > 8101.ce101         STDIN            lhcb001                 0 Q lhcb
> > 8102.ce101         STDIN            lhcb001                 0 Q lhcb
> > 8103.ce101         STDIN            dteam002                0 Q short
> > 8106.ce101         STDIN            dteam002                0 Q short
> > 8119.ce101         STDIN            dteam002                0 Q short
> > 8144.ce101         STDIN            dteam001                0 Q short
> >
> >
> >
> ========================================================================
> ==
> > ===
> >
> > pbsnodes -a
> > wn101.grid.ucy.ac.cy
> >      state = job-exclusive
> >      np = 2
> >      properties = lcgpro
> >      ntype = cluster
> >      jobs = 0/8070.ce101.grid.ucy.ac.cy, 1/8071.ce101.grid.ucy.ac.cy
> >      status = arch=linux,uname=Linux wn101.grid.ucy.ac.cy
> > 2.4.21-20.ELsmp #1 SMP Thu Sep 2 16:47:25 CDT 2004 i686,sessions=18355
> > 4154 9906
> >
> 10647,nsessions=4,nusers=3,idletime=13914,totmem=2077604kb,availmem=1005
> 89
> > 2kb,physmem=1025356kb,ncpus=4,loadave=1.99,rectime=1109090294
> >
> > wn102.grid.ucy.ac.cy
> >      state = job-exclusive,busy
> >      np = 2
> >      properties = lcgpro
> >      ntype = cluster
> >      jobs = 0/8068.ce101.grid.ucy.ac.cy, 1/8069.ce101.grid.ucy.ac.cy
> >      status = arch=linux,uname=Linux wn102.grid.ucy.ac.cy
> > 2.4.21-20.ELsmp #1 SMP Thu Sep 2 16:47:25 CDT 2004 i686,sessions=18264
> > 18100 19278 26079
> >
> 26820,nsessions=5,nusers=4,idletime=14008,totmem=2077604kb,availmem=9973
> 84
> > kb,physmem=1025356kb,ncpus=4,loadave=2.00,rectime=1109090304
> >
> > wn103.grid.ucy.ac.cy
> >      state = job-exclusive,busy
> >      np = 2
> >      properties = lcgpro
> >      ntype = cluster
> >      jobs = 0/8066.ce101.grid.ucy.ac.cy, 1/8067.ce101.grid.ucy.ac.cy
> >      status = arch=linux,uname=Linux wn103.grid.ucy.ac.cy
> > 2.4.21-20.ELsmp #1 SMP Thu Sep 2 16:47:25 CDT 2004 i686,sessions=3342
> > 10709 3055
> >
> 3814,nsessions=4,nusers=3,idletime=58030,totmem=2077120kb,availmem=10011
> 96
> > kb,physmem=1024872kb,ncpus=4,loadave=2.00,rectime=1109090304
> >
> > wn104.grid.ucy.ac.cy
> >      state = job-exclusive
> >      np = 2
> >      properties = lcgpro
> >      ntype = cluster
> >      jobs = 0/8064.ce101.grid.ucy.ac.cy, 1/8065.ce101.grid.ucy.ac.cy
> >      status = arch=linux,uname=Linux wn104.grid.ucy.ac.cy
> > 2.4.21-20.ELsmp #1 SMP Thu Sep 2 16:47:25 CDT 2004 i686,sessions=3340
> > 11436 6145
> >
> 6892,nsessions=4,nusers=3,idletime=59423,totmem=3121344kb,availmem=20548
> 32
> > kb,physmem=1024872kb,ncpus=4,loadave=2.02,rectime=1109090313
> >
> > wn105.grid.ucy.ac.cy
> >      state = job-exclusive,busy
> >      np = 2
> >      properties = lcgpro
> >      ntype = cluster
> >      jobs = 0/8062.ce101.grid.ucy.ac.cy, 1/8063.ce101.grid.ucy.ac.cy
> >      status = arch=linux,uname=Linux wn105.grid.ucy.ac.cy
> > 2.4.21-20.ELsmp #1 SMP Thu Sep 2 16:47:25 CDT 2004 i686,sessions=3359
> > 10679 27524
> >
> 28180,nsessions=4,nusers=3,idletime=615473,totmem=3121344kb,availmem=203
> 45
> > 40kb,physmem=1024872kb,ncpus=4,loadave=2.00,rectime=1109090320
> >
> > wn106.grid.ucy.ac.cy
> >      state = job-exclusive
> >      np = 2
> >      properties = lcgpro
> >      ntype = cluster
> >      jobs = 0/8060.ce101.grid.ucy.ac.cy, 1/8061.ce101.grid.ucy.ac.cy
> >      status = arch=linux,uname=Linux wn106.grid.ucy.ac.cy
> > 2.4.21-20.ELsmp #1 SMP Thu Sep 2 16:47:25 CDT 2004 i686,sessions=3359
> > 10673 1194
> >
> 1850,nsessions=4,nusers=3,idletime=59152,totmem=3065116kb,availmem=19900
> 88
> > kb,physmem=1024872kb,ncpus=4,loadave=2.02,rectime=1109090313
> >
> > wn107.grid.ucy.ac.cy
> >      state = job-exclusive
> >      np = 2
> >      properties = lcgpro
> >      ntype = cluster
> >      jobs = 0/8058.ce101.grid.ucy.ac.cy, 1/8059.ce101.grid.ucy.ac.cy
> >      status = arch=linux,uname=Linux wn107.grid.ucy.ac.cy
> > 2.4.21-20.ELsmp #1 SMP Thu Sep 2 16:47:25 CDT 2004 i686,sessions=3339
> > 11581 21919
> >
> 22725,nsessions=4,nusers=3,idletime=20071,totmem=3121344kb,availmem=2059
> 29
> > 6kb,physmem=1024872kb,ncpus=4,loadave=2.00,rectime=1109090313
> >
> >
> > ============================================================
> >
> > top
> >
> > 22720 lhcb001   25   0  171M 160M 47060 R    100.0 16.0 486:22   0
> > ld-linux.so.2
> > 23528 lhcb001   25   0  178M 170M 43744 R    100.0 17.0 482:31   3
> > ld-linux.so.2
> >
> >
> > Regards
> >
> > Wei
> >
> > --
> > ============================================================
> > Wei Xing, M.Sc.
> > Research Associate                    Tel: 00357-22892663
> > Dept. of Computer Science             Fax: 00357-22892701
> > University of Cyprus                  email: [log in to unmask]
> > PO Box 20537
> > CY1678, Nicosia, CYPRUS

--
Steve Traylen
[log in to unmask]
http://www.gridpp.ac.uk/