JiscMail Logo
Email discussion lists for the UK Education and Research communities

Help for COMP-FORTRAN-90 Archives


COMP-FORTRAN-90 Archives

COMP-FORTRAN-90 Archives


COMP-FORTRAN-90@JISCMAIL.AC.UK


View:

Message:

[

First

|

Previous

|

Next

|

Last

]

By Topic:

[

First

|

Previous

|

Next

|

Last

]

By Author:

[

First

|

Previous

|

Next

|

Last

]

Font:

Proportional Font

LISTSERV Archives

LISTSERV Archives

COMP-FORTRAN-90 Home

COMP-FORTRAN-90 Home

COMP-FORTRAN-90  January 2011

COMP-FORTRAN-90 January 2011

Options

Subscribe or Unsubscribe

Subscribe or Unsubscribe

Log In

Log In

Get Password

Get Password

Subject:

Re: Finding vector size

From:

Bill Long <[log in to unmask]>

Reply-To:

Fortran 90 List <[log in to unmask]>

Date:

Tue, 4 Jan 2011 16:18:00 -0600

Content-Type:

text/plain

Parts/Attachments:

Parts/Attachments

text/plain (101 lines)

On 1/4/11 1:28 PM, Greenberg, Naomi wrote:
> Bill,
>    I inherited some code that was optimized for a Cray vector machine.  There
> originally were very large computational loops that worked over large
> arrays.  The person who vectorized it (very successfully) broke up the large

It sounds like the code was already vectorized.   "optimized for a Cray 
vector machine" did allow vector loads from memory that were strided or 
gathers (scatters for stores), and hardware vectorization of conditional 
expressions, so the range of code that vectorized was a lot wider than 
with an x86-64-style processor.  However, if the original array 
references were stride-1 (contiguous in memory), then the original code 
might not be that bad for the x86-64.  It is much easier to have the 
compiler internally break up the loop than recoding it by hand. Even if 
you have to play around with compiler flags.  x86-64 compilers have 
gotten a lot better at vectorization in recent years.

> loop into a series of small simple clear vectorizable loops that computed
> partials and then combined them later.  He sized the partial arrays based on
> a vector size (R(nvec,2,3), with loops going in nvec chunks).  My questions

Actually, I suspect he sized the temp arrays based mainly on the cache 
size.  If you fail to keep these temps in cache, then most of the 
benefit of doing this sort of hand optimization is lost.  The value of 
nvec should be a multiple of the hardware vector length, but what 
multiple depends on the cache sizes and the number of temps.  If you 
pick nvec = n*16, the value of nvec should be a multiple of the vector 
length for all of the "small vector" architectures.  Play around with 
different values of n to get optimal performance.

> now are 1) what should nvec be set for an Itanium-based linux system or
> other 32 bit system (Intel compiler)? Is there a way to compute this
> automatically? 2) Is this the best way to do this still? Can I be hurt by it
> on other machines? Again, it's not just the loop counters that are sized,

If you try to migrate this code to a GPU accelerated system, you will 
really want the original long vectors back again.   The games involving 
R(nvec,2,3) style optimizations are particular to the particular 
architecture - it is not a general scheme where nvec is a parameter that 
can be varied to encompass all architectures.

Cheers,
Bill



> it's the actual data structure sizes also.
>
> Thanks,
> Naomi
>
> -----Original Message-----
> From: Fortran 90 List [mailto:[log in to unmask]] On Behalf Of
> Bill Long
> Sent: Tuesday, January 04, 2011 11:48 AM
> To: [log in to unmask]
> Subject: Re: Finding vector size
>
> The optimal vector size for each machine should be known internally by
> the compiler.  Write the loop as the algorithm dictates.  The compiler
> will divide it up into vector "chunks" automatically if the body of the
> loop can be executed by vector hardware.  User attempts to manually
> reform loops for presumed vector lengths,  pipelining,  or cache
> blocking are generally counter-productive.  The result is code that is
> unclear to read,  difficult to maintain,  and confusing to the compiler.
>    Compiler optimizers work best on simple, clean loops.
>
> Cheers,
> Bill
>
>
> On 1/4/11 9:37 AM, Greenberg, Naomi wrote:
>> I am trying to find a way to configure code before compile time to set
>> the optimal loop vectorization size for the user's machine and then
>> (using the Fortran preprocessor) get that value and set the loop size to
>> this value. For example, on Machine1, nvec might be 64, on machine2, it
>> might be 1024, and the code would "do i=1,nvec" (obviously not quite
>> that way). The question is whether there's a way to automatically get
>> the optimal vector size from each machine (using Linux) or whether
>> there's a better way to get the same result? Any suggestions are welcome!
>>
>> Naomi Greenberg
>>
>> /Member of the Research Staff/
>>
>> Riverside Research Institute
>>
>> (212) 502-1718 (ph)
>>
>> (212) 502-1729 (fax)
>>
>> [log in to unmask]
>>
>

-- 
Bill Long                                           [log in to unmask]
Fortran Technical Support    &                 voice: 651-605-9024
Bioinformatics Software Development            fax:   651-605-9142
Cray Inc./Cray Plaza, Suite 210/380 Jackson St./St. Paul, MN 55101

Top of Message | Previous Page | Permalink

JiscMail Tools


RSS Feeds and Sharing


Advanced Options


Archives

December 2023
February 2023
November 2022
September 2022
February 2022
January 2022
June 2021
November 2020
September 2020
June 2020
May 2020
April 2020
December 2019
October 2019
September 2019
March 2019
February 2019
January 2019
November 2018
October 2018
September 2018
August 2018
July 2018
May 2018
April 2018
March 2018
February 2018
January 2018
December 2017
November 2017
October 2017
September 2017
August 2017
July 2017
June 2017
May 2017
April 2017
March 2017
January 2017
December 2016
November 2016
October 2016
September 2016
August 2016
July 2016
June 2016
May 2016
April 2016
March 2016
February 2016
December 2015
November 2015
October 2015
September 2015
August 2015
June 2015
April 2015
March 2015
January 2015
December 2014
November 2014
October 2014
August 2014
July 2014
May 2014
April 2014
March 2014
February 2014
January 2014
December 2013
November 2013
July 2013
June 2013
May 2013
April 2013
February 2013
January 2013
December 2012
November 2012
October 2012
September 2012
August 2012
July 2012
June 2012
April 2012
March 2012
February 2012
January 2012
December 2011
November 2011
October 2011
September 2011
August 2011
July 2011
June 2011
May 2011
April 2011
March 2011
February 2011
January 2011
December 2010
November 2010
October 2010
August 2010
July 2010
June 2010
March 2010
February 2010
January 2010
December 2009
October 2009
August 2009
July 2009
June 2009
March 2009
February 2009
January 2009
December 2008
November 2008
October 2008
September 2008
August 2008
July 2008
June 2008
May 2008
April 2008
March 2008
February 2008
December 2007
November 2007
October 2007
September 2007
August 2007
July 2007
June 2007
May 2007
April 2007
March 2007
January 2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997


JiscMail is a Jisc service.

View our service policies at https://www.jiscmail.ac.uk/policyandsecurity/ and Jisc's privacy policy at https://www.jisc.ac.uk/website/privacy-notice

For help and support help@jisc.ac.uk

Secured by F-Secure Anti-Virus CataList Email List Search Powered by the LISTSERV Email List Manager