Openwall GNU/*/Linux - a small security-enhanced Linux distro for servers
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Date: Sun, 02 Nov 2014 22:19:03 +0100
From: magnum <>
Subject: Re: descrypt speed

On 2014-11-02 18:59, Royce Williams wrote:
> On Thu, Oct 30, 2014 at 9:33 PM, magnum <> wrote:
>> On 2014-10-31 06:02, Royce Williams wrote:
>>> On a GTX970, shouldn't this be sm_52?
>> You can force this by editing NVCC_FLAGS in Makefile. Add something like
>> "-arch sm_50" (or 52). But I doubt it will make much difference and it will
>> only affect CUDA formats.
> In my system with both an sm_20 and an sm_50 card, when running solely
> descrypt-opencl (not CUDA), the ptxas info shows that sm_50 is involved in
> some way.  Is this cosmetic?

OpenCL compiles a suitable (different) kernel for each and you do not 
have to configure anything.

You can configure CUDA for compiling several archs at once, see "nvcc 
--help". It something like "-gencode arch=compute_20,code=sm_20 -gencode 
arch=compute_50,code=sm_50" (added to NVCC_FLAGS instead of just -arch 
sm_xx). The one most suitable of them will be picked at runtime.


Powered by blists - more mailing lists

Your e-mail address:

Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.