Follow @Openwall on Twitter for new release announcements and other news
[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Date: Tue, 22 May 2012 02:04:32 +0200
From: "magnum" <>
Subject: Re: Nvidia compiler bug

On Mon, 21 May 2012 23:56:15 +0200 Claudio André 
<> wrote:
>Hi, looking at the "verbosity" of Nvidia compiler and comparing 
>Lukas CUDA code compiler output (thanks for your good code), i 
>the compiler was doing something silly.
>So, i used another valid path to achieve what i want, checked if 
>it was 
>doing what i was expecting and:
>- result 3,2x faster.

Interesting. What went wrong and how did you mitigate it?

Btw I'm curious why your attempt at avoiding byte addressable store 
failed. When/where was it misaligned?


Powered by blists - more mailing lists

Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.