john-dev - Vectorizing OpenCL

Follow @Openwall on Twitter for new release announcements and other news

[<prev] [next>] [day] [month] [year] [list]

Message-ID: <55ad18876ddcead0b360eef7f9c2f469@smtp.hushmail.com>
Date: Tue, 24 Apr 2012 21:49:14 +0200
From: magnum <john.magnum@...hmail.com>
To: john-dev@...ts.openwall.com
Subject: Vectorizing OpenCL

I had a go at vectorizing raw-sha1-opencl just to see how it's done. It
was *extremely* easy (I had a sneak at Lukas' phpass for hints).

I expected it to boost AMD GPU's but it doesn't (not on Cedar at least,
12% drop). It had a devastating effect on 9600GT, about half the speed.
and GTX580 had a 20% drop.

For some reason, Intel's OpenCL-CPU compiler fails to vectorize it (ie.
make SSE2 out of it). That's strange. But the AMD CPU compiler seemed to
get the hint, it doesn't say anything in the build log but got like 3x
faster (though it was much slower than Intel before the change).

All in all, there seems to be no point in using this code. I'm posting
it here as an example. Nothing is optimised.

magnum

View attachment "0001-Raw-sha1-openssl-vectorized-just-for-getting-the-han.patch" of type "text/x-patch" (6566 bytes)

Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.