Openwall GNU/*/Linux - a small security-enhanced Linux distro for servers
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Date: Sat, 21 Apr 2012 23:17:12 -0300
From: Claudio André <claudioandre.br@...il.com>
To: john-dev@...ts.openwall.com
Subject: Re: New RAR OpenCL kernel


> This is similar to what I get on a mobile E2 "Loveland" GPU. Are the
> GPU's this slow, or do I have a major problem? This is about half the
> speed of one CPU core. I do get warnings about register spill on that
> one though.
>

I suppose you don't have a profiler for AMD. So attached an output.

You can do the "WTF" thing (install Visual Studio, install the plugin, 
etc). Or you can open the attached CSV (from line 17, comma and 
semi-colon separated, see the image).

It is not science, and i'm a confused user myself. But.
- ALUBusy (very low).
- ALUPacking (low).

My GPU seems difficult to "full speed".

Cheers.

[ CONTENT OF TYPE image/png SKIPPED ]

# ProfilerVersion=2.4.1314
# Application=/home/claudio/bin/john/to_commit/run/john
# ApplicationArgs=--format=rar -t
# Device AMD Phenom(tm) II X6 1075T Processor PlatformVendor=Advanced Micro Devices, Inc.
# Device AMD Phenom(tm) II X6 1075T Processor PlatformName=AMD Accelerated Parallel Processing
# Device AMD Phenom(tm) II X6 1075T Processor PlatformVersion=OpenCL 1.1 AMD-APP (898.1)
# Device AMD Phenom(tm) II X6 1075T Processor CLDriverVersion=2.0
# Device AMD Phenom(tm) II X6 1075T Processor CLRuntimeVersion=OpenCL 1.1 AMD-APP (898.1)
# Device AMD Phenom(tm) II X6 1075T Processor NumberAppAddressBits=64
# Device Juniper PlatformVendor=Advanced Micro Devices, Inc.
# Device Juniper PlatformName=AMD Accelerated Parallel Processing
# Device Juniper PlatformVersion=OpenCL 1.1 AMD-APP (898.1)
# Device Juniper CLDriverVersion=CAL 1.4.1703
# Device Juniper CLRuntimeVersion=OpenCL 1.1 AMD-APP (898.1)
# Device Juniper NumberAppAddressBits=32
# OS=Ubuntu 11.10 \n \l
Method , ExecutionOrder , ThreadID , CallIndex , GlobalWorkSize , WorkGroupSize , Time , LocalMemSize , VGPRs , SGPRs , ScratchRegs , FCStacks , Wavefronts , ALUInsts , FetchInsts , WriteInsts , LDSFetchInsts , LDSWriteInsts , ALUBusy , ALUFetchRatio , ALUPacking , FetchSize , CacheHit , FetchUnitBusy , FetchUnitStalled , WriteUnitStalled , FastPath , CompletePath , PathUtilization , LDSBankConflict
SetCryptKeys__k1_Juniper1 ,     1 , 6873 , 37 , {    128       1       1} , {  128     1     1} ,      6014.58922 ,       20480 ,    57 , NA ,     0 ,     5 ,         8.00 ,  30624163.50 ,         5.88 ,         5.12 ,    360452.50 ,         5.50 ,         1.92 ,   5212623.57 ,        35.09 ,        39.12 ,         3.99 ,         0.00 ,         0.00 ,         0.00 ,        34.00 ,         0.00 ,       100.00 ,         0.08

Powered by blists - more mailing lists

Your e-mail address:

Powered by Openwall GNU/*/Linux - Powered by OpenVZ