Follow @Openwall on Twitter for new release announcements and other news
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Date: Wed, 30 Jan 2013 18:05:17 +0100
From: magnum <>
Subject: Re: Proposed optimizations to pwsafe

On 30 Jan, 2013, at 11:46 , Brian Wallace <> wrote:

> I cleaned up the OpenCL and CUDA code, with some additional optimizations.  The optimizations were not getting the performance I want, so I started on trying to convert to SIMD code.  So far I have it working at about a 1/6 of the speed of the latest optimized, and I'm not really sure why.  If anyone can give me a heads up on any obvious bottle necks, please let me know.  

Vectorizing is usually detrimental on nvidia, and on 7970 as well. For older AMD cards and CPU it can be rewarding. So if you do this please do with -DVECTORIZE or something (See Office formats for examples).


Powered by blists - more mailing lists

Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.