Follow @Openwall on Twitter for new release announcements and other news
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Date: Sun, 01 Apr 2012 20:17:55 +0200
From: magnum <john.magnum@...hmail.com>
To: john-dev@...ts.openwall.com
Subject: Re: fast hashes on GPU

On 04/01/2012 07:46 PM, Solar Designer wrote:
> On Sun, Apr 01, 2012 at 06:21:14PM +0200, magnum wrote:
>> On 03/31/2012 02:19 PM, Solar Designer wrote:
>>> ...Oh, I just got it to:
>>>
>>> Many salts:     38062K c/s real, 38062K c/s virtual
>>> Only one salt:  26270K c/s real, 26270K c/s virtual
>>>
>>> by simply adding "#pragma unroll 64" before the last loop in
>>> sha512_block().
>>
>> Did you add just that very pragma line,
> 
> Yes.  There were similar lines for nearby loops, but somehow not for
> that one yet.
> 
>> or did you also add something
>> like "#pragma OPENCL EXTENSION cl_nv_pragma_unroll : enable" somewhere
>> as well?
> 
> No.  This was CUDA code, not OpenCL.

Ah, yes.

Lukas, or anyone, could you explain how to use pragma unroll in OpenCL?
I don't seem to get any impact from it (whereas manual unrolling
provides a significant speedup).

magnum

Powered by blists - more mailing lists

Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.