Date: Wed, 30 Jan 2013 18:05:17 +0100 From: magnum <john.magnum@...hmail.com> To: john-dev@...ts.openwall.com Subject: Re: Proposed optimizations to pwsafe On 30 Jan, 2013, at 11:46 , Brian Wallace <nightstrike9809@...il.com> wrote: > I cleaned up the OpenCL and CUDA code, with some additional optimizations. The optimizations were not getting the performance I want, so I started on trying to convert to SIMD code. So far I have it working at about a 1/6 of the speed of the latest optimized, and I'm not really sure why. If anyone can give me a heads up on any obvious bottle necks, please let me know. Vectorizing is usually detrimental on nvidia, and on 7970 as well. For older AMD cards and CPU it can be rewarding. So if you do this please do with -DVECTORIZE or something (See Office formats for examples). magnum
Powered by blists - more mailing lists
Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.