john-dev - Re: PHC: Lyra2 on GPU

Follow @Openwall on Twitter for new release announcements and other news

[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]

Message-ID: <20150713173144.GB12357@openwall.com>
Date: Mon, 13 Jul 2015 20:31:44 +0300
From: Solar Designer <solar@...nwall.com>
To: john-dev@...ts.openwall.com
Subject: Re: PHC: Lyra2 on GPU

Agnieszka,

On Mon, Jul 13, 2015 at 04:33:24PM +0200, Agnieszka Bielec wrote:
> 2015-07-12 16:55 GMT+02:00 Solar Designer <solar@...nwall.com>:
> > Can you also try:
> >
> > t = 1
> > m = 80
> > c = 256
> > p = 1
> >
> > This should be almost 2 MB.
> 
> these tests are for 960m this time

Of course, I meant this for a bigger card.

> and I discovered now that the best number of lws also differ for
> various costs but it isn't autotuned

Why isn't it auto-tuned?

> >> I said that I'm using local memory but I wanted to say __private ,
> >> sorry if caused confusion
> >
> > OK.  I guess you're putting the current row (24 KB) in there?  And when
> > you were using global memory before, you had the current row fetched
> > from and sent to global memory each time?
> 
> it's not 24KB. I wrote that there are very small chunks an when I
> tried 2x, 3x, 5x bigger - speed decreased.
> but I'm sceptic about so huge cache in local memory because we have
> e.g. 32KB for all lws number of threads and speed will decrease after
> only if I change lws from 64 to 1

You're right indeed.

So how much private memory are you using per Lyra2 instance?

Alexander

Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.