Follow @Openwall on Twitter for new release announcements and other news
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Date: Mon, 13 Jul 2015 20:31:44 +0300
From: Solar Designer <solar@...nwall.com>
To: john-dev@...ts.openwall.com
Subject: Re: PHC: Lyra2 on GPU

Agnieszka,

On Mon, Jul 13, 2015 at 04:33:24PM +0200, Agnieszka Bielec wrote:
> 2015-07-12 16:55 GMT+02:00 Solar Designer <solar@...nwall.com>:
> > Can you also try:
> >
> > t = 1
> > m = 80
> > c = 256
> > p = 1
> >
> > This should be almost 2 MB.
> 
> these tests are for 960m this time

Of course, I meant this for a bigger card.

> and I discovered now that the best number of lws also differ for
> various costs but it isn't autotuned

Why isn't it auto-tuned?

> >> I said that I'm using local memory but I wanted to say __private ,
> >> sorry if caused confusion
> >
> > OK.  I guess you're putting the current row (24 KB) in there?  And when
> > you were using global memory before, you had the current row fetched
> > from and sent to global memory each time?
> 
> it's not 24KB. I wrote that there are very small chunks an when I
> tried 2x, 3x, 5x bigger - speed decreased.
> but I'm sceptic about so huge cache in local memory because we have
> e.g. 32KB for all lws number of threads and speed will decrease after
> only if I change lws from 64 to 1

You're right indeed.

So how much private memory are you using per Lyra2 instance?

Alexander

Powered by blists - more mailing lists

Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.