|
Date: Wed, 6 Jun 2012 20:07:18 +0530
From: SAYANTAN DATTA <std2048@...il.com>
To: john-dev@...ts.openwall.com
Subject: Re: HD 7970 LDS Bank Conflicts
Hi,
On Wed, Jun 6, 2012 at 7:06 PM, Alain Espinosa <alainesp@...il.com> wrote:
> Division and modulus are very expensive. I change to use AND and shift
> and i get 25% increase only by that (changing the algorithm).
>
Yes it's true. But I've used modulus and division at only twice or thrice
per kernel which corresponds to 3/(1024*32) th of total ALU ops per kernel.
So its not a major issue.
Why, 4KB is still under the limit for local memory if enough big
> worksize is used.
>
Using LDS I'm getting 60% better performance than using only global memory
even though the ALU is underutilized. Also I haven't done anything to
minimize LDS bank conflicts. Hoping to get even more performance after this
optimization.
Regards,
Sayantan
Content of type "text/html" skipped
Powered by blists - more mailing lists
Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.