Openwall GNU/*/Linux - a small security-enhanced Linux distro for servers
[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Date: Tue, 22 May 2012 02:04:32 +0200
From: "magnum" <john.magnum@...hmail.com>
To: john-dev@...ts.openwall.com
Subject: Re: Nvidia compiler bug



On Mon, 21 May 2012 23:56:15 +0200 Claudio André 
<claudioandre.br@...il.com> wrote:
>Hi, looking at the "verbosity" of Nvidia compiler and comparing 
>against 
>Lukas CUDA code compiler output (thanks for your good code), i 
>realized 
>the compiler was doing something silly.
>So, i used another valid path to achieve what i want, checked if 
>it was 
>doing what i was expecting and:
>- result 3,2x faster.

Interesting. What went wrong and how did you mitigate it?

Btw I'm curious why your attempt at avoiding byte addressable store 
failed. When/where was it misaligned?

magnum

Powered by blists - more mailing lists

Your e-mail address:

Powered by Openwall GNU/*/Linux - Powered by OpenVZ