On Dec 15, 2003, at 6:33 PM, Aleksandar Donev wrote: > Hello, > > My department is getting a new cluster and they are thinking about > what kinds > of processors to use. Among the options is G5. > > I would appreciate any info about Fortran 95 on G5 platforms. As far > as I > know, Absoft, NAG and now IBM (still beta) support this chip on OS X. > Experiences as well as any information on web-pages to look at for info > (pricing, requirements, etc.) would be helpful. > "We write quantum chemistry software (ADF) for several platforms, including OS X. We recently had access to a Dual 2Ghz G5, and did some testing. Here is what I posted to the scitech list at Apple: G4 466Mhz, 512Mb RAM, 1 processor Absoft Compiler (-O) CPU= 9394.72 Elapsed= 11076.76 XLF Compiler (-O3 -qhot -qtune=g5 -qarch=g5) CPU= 6003.03 Elapsed= 6284.52 G4 1Ghz, 2 processor Absoft compiler, 2 processor with PVM CPU= 2570.18 Elapsed= 2605.85 G5 2GHz, 512Mb RAM, 2 processor Absoft Compiler (-O) CPU= 1963.01 Elapsed= 1968.87 XLF Compiler (-O3 -qhot -qtune=g5 -qarch=g5) CPU= 1291.18 Elapsed= 1294.25 XLF Compiler (same flags), 2 processors, PVM CPU= 739.54 Elapsed= 800.63 Dual Xeon PIV 2.4 GHz, 1 CPU PGF90 Compiler CPU= 2091.60 Elapsed= 2193.39 IFC Intel compiler (-O3) CPU= 1143.88 Elapsed= 1272.71 Dual-CPU Pentium IV cluster, 2.4 GHz IFC compiler, 1 processor CPU= 1273.81 Elapsed= ??? IFC compiler, 2 processor CPU= 784.35 Elapsed= ??? 1.5 GHz Madison itanium2 IFC compiler, 1 processor CPU= 840.20 Elapsed= 886.14 IFC compiler, 2 processor CPU= 432.58 Elapsed= 460.48 1.5 GHz Itanium2 HP Compiler, 1 processor CPU= 586.78 Elapsed= 604.61 My conclusions: The XLF compiler produces code that is around 30-40% faster than Absoft, at least for ADF. I am a bit disappointed by the G5's performance with ADF. The times for the G5 are approximately in proportion to those for my old G4, once clock speed has been accounted for. I was hoping that architecture improvements in the G5, such as two FPUs, and that phenomenal frontside bus, would be seen in the benchmarks, but ADF doesn't seem to gain much from them. The times for a 2GHz G5 are very similar to those of a 2.4GHz Pentium, when using the Intel compiler on the Pentium (the portland group compiler produces considerably slower code). The G5 does seem to scale a bit better when run on two processors than the Pentium. I have no benchmarks for a 3.2GHz Pentium, but assuming that performance scales with clock speed, Intel still has the upper hand for running ADF. The difference for other programs seems to be less than for ADF though. Lastly, the new Itanium2 is an efficient and fast chip, but also more expensive. The HP compiler seems to be uncharacteristically good ;-)" Since writing these comments, I now think I know why the G5 is not great for ADF. ADF is old code, and is basically written as a bit dot product. The G5 has 2 FPUs, and two load/store units. A dot product requires two load/stores per multiply, and so basically the G5 is only using 1 FPU. This explains why it is not really any faster than the G4, when for many codes it is. So, if you code is well optimized, I think you will find the G5 to be very competitive with the fastest pentiums of the day, even with half the clock speed. Drew McCormack Theoretical Chemistry Free University, Amsterdam