Follow @Openwall on Twitter for new release announcements and other news
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Date: Thu, 20 May 2010 14:22:43 +0200
Subject: Re: C compiler generated SSE2 code

I tested with a handful of compilers on my core 2 computer. With all 
compilers the fastest code was produced with DES_BS_VECTOR = 2, no 
VECTOR34 and no MIX.

The following combinations did not work at all :

DES_BS_VECTOR = 4, VECTOR34 and MIX, with gcc 4.5.0, 4.3.2 and icc 10.1. 
Clang did work fine however.

The best speeds were :
Reference : Vanilla, gcc-4.5.0 : 2706K/2329K

icc 10.1 : 2662K/2259K
gcc 4.3.2  : 2692K/2314K
gcc 4.5.0 : 2713K/2332K (with your inline flags)
clang : 2763K/2363K (also with your inline flags)

What I find interesting is that clang performs so well and icc so badly. 
I always thought it was a better compiler performance-wise for 
intrinsics, as it performs much better than the others with md5.

I'll try icc 11 ASAP.

Powered by blists - more mailing lists

Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.