Date: Tue, 22 May 2012 02:04:32 +0200 From: "magnum" <john.magnum@...hmail.com> To: john-dev@...ts.openwall.com Subject: Re: Nvidia compiler bug On Mon, 21 May 2012 23:56:15 +0200 Claudio André <claudioandre.br@...il.com> wrote: >Hi, looking at the "verbosity" of Nvidia compiler and comparing >against >Lukas CUDA code compiler output (thanks for your good code), i >realized >the compiler was doing something silly. >So, i used another valid path to achieve what i want, checked if >it was >doing what i was expecting and: >- result 3,2x faster. Interesting. What went wrong and how did you mitigate it? Btw I'm curious why your attempt at avoiding byte addressable store failed. When/where was it misaligned? magnum
Powered by blists - more mailing lists
Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.