Date: Sat, 22 Aug 2015 04:48:56 +0300 From: Solar Designer <solar@...nwall.com> To: john-dev@...ts.openwall.com Subject: Re: PHC: Argon2 on GPU On Fri, Aug 21, 2015 at 05:40:42PM +0200, Agnieszka Bielec wrote: > 2015-08-20 22:34 GMT+02:00 Solar Designer <solar@...nwall.com>: > > You could start by experimenting with a much simpler than Argon2 yet in > > some ways similar kernel: implement some trivial operation like XOR on > > different vector widths and see whether/how this changes the assembly. > > Then make it slightly less trivial (just enough to prevent the compiler > > from optimizing things out) and add uses of private or local memory, > > and see if you can make it run faster by using wider vectors per the > > same private or local memory usage. > > I tested (only 960m) > -copying memory from __private to __private > - from __global to __private > -xoring private tables with __prrivate tables > > using ulong, ulong2, ulong4, ulong8 (I was getting empty kernel using ulong16) Great. Where is the OpenCL code for these? I'd like to take a look at what exactly you were testing. Alexander
Powered by blists - more mailing lists
Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.