Date: Thu, 20 Aug 2015 04:53:55 +0300 From: Solar Designer <solar@...nwall.com> To: john-dev@...ts.openwall.com Subject: Re: PHC: Argon2 on GPU On Wed, Aug 19, 2015 at 07:41:02PM +0200, Agnieszka Bielec wrote: > ptxas info : Function properties for FillSegment > ptxas . 0 bytes stack frame, 17400 bytes spill stores, > 19352 bytes spill loads > ptxas info : Function properties for GenerateAddresses > ptxas . 0 bytes stack frame, 7780 bytes spill stores, > 11648 bytes spill loads The spills in FillSegment and GenerateAddresses are pretty bad. Where do they come from, and why so much? In FillSegment you use 1 KB per work-item for addresses, in GenerateAddresses you use 2 KB for two blocks. GenerateAddresses is called from FillSegment, so adds its private memory needs on top of FillSegment's. Alexander
Powered by blists - more mailing lists
Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.