john-dev - Re: descrypt speed

Follow @Openwall on Twitter for new release announcements and other news

[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]

Message-ID: <7174a14ccd6fe3099723d792e418c22a@smtp.hushmail.com>
Date: Sun, 02 Nov 2014 22:19:03 +0100
From: magnum <john.magnum@...hmail.com>
To: john-dev@...ts.openwall.com
Subject: Re: descrypt speed

On 2014-11-02 18:59, Royce Williams wrote:
> On Thu, Oct 30, 2014 at 9:33 PM, magnum <john.magnum@...hmail.com> wrote:
>> On 2014-10-31 06:02, Royce Williams wrote:
>>> On a GTX970, shouldn't this be sm_52?
>>
>> You can force this by editing NVCC_FLAGS in Makefile. Add something like
>> "-arch sm_50" (or 52). But I doubt it will make much difference and it will
>> only affect CUDA formats.
>
> In my system with both an sm_20 and an sm_50 card, when running solely
> descrypt-opencl (not CUDA), the ptxas info shows that sm_50 is involved in
> some way.  Is this cosmetic?

OpenCL compiles a suitable (different) kernel for each and you do not 
have to configure anything.

You can configure CUDA for compiling several archs at once, see "nvcc 
--help". It something like "-gencode arch=compute_20,code=sm_20 -gencode 
arch=compute_50,code=sm_50" (added to NVCC_FLAGS instead of just -arch 
sm_xx). The one most suitable of them will be picked at runtime.

magnum

Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.