Follow @Openwall on Twitter for new release announcements and other news
[<prev] [next>] [thread-next>] [day] [month] [year] [list]
Date: Thu, 29 Mar 2018 20:30:52 +0200
From: "timajus ." <timajus@...il.com>
To: john-users@...ts.openwall.com
Subject: OpenCL on Windows

 Hello all,
this is tim from italy. I've cloned the github repository in my cygwin64,
i've made symbolic links to the opencl libs and includes respectively to
/usr/local/lib and /usr/local/include, then i've issued the following
configure command

./configure LDFLAGS=-L/usr/local/lib CFLAGS="-O2 -I/usr/local/include"

Configured for building John the Ripper jumbo:

Target CPU ................................. x86_64 AVX2, 64-bit LE
AES-NI support ............................. run-time detection
Target OS .................................. cygwin
Cross compiling ............................ no
Legacy arch header ......................... x86-64.h

Optional libraries/features found:
Memory map (share/page large files) ........ yes
Fork support ............................... yes
OpenMP support ............................. yes (not for fast formats)
OpenCL support ............................. yes
Generic crypt(3) format .................... yes
libgmp (PRINCE mode and faster SRP formats)  yes
128-bit integer (faster PRINCE mode) ....... yes
libz (pkzip format, gpg2john) .............. yes
libbz2 (gpg2john extra decompression logic)  yes
libpcap (vncpcap2john and SIPdump) ......... no
librexgen (regex cracking mode) ............ no
OpenMPI support (default disabled) ......... no
ZTEX USB-FPGA module 1.15y support ......... no

Install missing libraries to get any needed features that were omitted.

Configure finished.  Now "make -s clean && make -sj4" to compile.

then i've started the build process:

- make -s clean && make -sj4

No issue found during the build.

$ make -s clean && make -sj4
ar: creazione di aes.a
ar: creazione di secp256k1.a

Make process completed.



$ ../run/john  --list=build-info
Version: 1.8.0.13-jumbo-1-bleeding-e4e157178 2018-03-28 15:23:20 +0200
Build: cygwin 64-bit x86_64 AVX2 AC OMP
SIMD: AVX2, interleaving: MD4:3 MD5:3 SHA1:1 SHA256:1 SHA512:1
CPU tests: AVX2
$JOHN is ../run/
Format interface version: 14
Max. number of reported tunable costs: 4
Rec file version: REC4
Charset file version: CHR3
CHARSET_MIN: 1 (0x01)
CHARSET_MAX: 255 (0xff)
CHARSET_LENGTH: 24
SALT_HASH_SIZE: 1048576
Max. Markov mode level: 400
Max. Markov mode password length: 30
gcc version: 6.4.0
OpenCL headers version: 2.1
Crypto library: OpenSSL
OpenSSL library version: 0100020bf      (loaded: 0100020ef)
OpenSSL 1.0.2k  26 Jan 2017     (loaded: OpenSSL 1.0.2n  7 Dec 2017)
GMP library version: 6.1.2
File locking: fcntl()
fseek(): fseek
ftell(): ftell
fopen(): fopen
memmem(): System's


BUT when i try to run john with opencl format i get an error.

I've issued the following command (from the src directory):

$ ../run/john --wordlist=../run/wl.txt --format=descrypt-opencl
../run/hashes.txt --verbosity=5 --devices=0

and this is the result. (My device is a NVIDIA GeForce GTX 1050 Ti with
CUDA)

initUnicode(UNICODE, UTF-8/ISO-8859-1)
UTF-8 -> UTF-8 -> UTF-8
Device 0: GeForce GTX 1050 Ti
Using default input encoding: UTF-8
Loaded 20 password hashes with 19 different salts (descrypt-opencl,
traditional crypt(3) [DES OpenCL])
Remaining 19 password hashes with 18 different salts
Loaded 6 hashes with 4 different salts to test db from test vectors
Options used: -I ../run/kernels -cl-mad-enable -DSM_MAJOR=6 -DSM_MINOR=1
-cl-nv-verbose -D__GPU__ -DDEVICE_INFO=524306 -DSIZEOF_SIZE_T=8
-DDEV_VER_MAJOR=388 -DDEV_VER_MINOR=19 -D_OPENCL_COMPILER
$JOHN/kernels/DES_bs_hash_checking_kernel.cl
Options used: -I ../run/kernels -cl-mad-enable -DSM_MAJOR=6 -DSM_MINOR=1
-cl-nv-verbose -D__GPU__ -DDEVICE_INFO=524306 -DSIZEOF_SIZE_T=8
-DDEV_VER_MAJOR=388 -DDEV_VER_MINOR=19 -D_OPENCL_COMPILER -D ITER_COUNT=1
-D MASK_ENABLED=0 -D LOC_0=-1 -D LOC_1=-1 -D LOC_2=-1 -D LOC_3=-1 -D
IS_STATIC_GPU_MASK=0 -D CONST_CACHE_SIZE=65536
$JOHN/kernels/DES_bs_finalize_keys_kernel.cl
Options used: -I ../run/kernels -cl-mad-enable -DSM_MAJOR=6 -DSM_MINOR=1
-cl-nv-verbose -D__GPU__ -DDEVICE_INFO=524306 -DSIZEOF_SIZE_T=8
-DDEV_VER_MAJOR=388 -DDEV_VER_MINOR=19 -D_OPENCL_COMPILER -D
WORK_GROUP_SIZE=0 -D index0=31 -D index1=16 -D index2=17 -D index3=18 -D
index4=3 -D index5=4 -D index6=3 -D index7=20 -D index8=21 -D index9=22 -D
index10=7 -D index11=8 -D index24=15 -D index25=0 -D index26=1 -D index27=2
-D index28=19 -D index29=20 -D index30=19 -D index31=4 -D index32=5 -D
index33=6 -D index34=23 -D index35=24 -D index48=63 -D index49=48 -D
index50=49 -D index51=50 -D index52=35 -D index53=36 -D index54=35 -D
index55=52 -D index56=53 -D index57=54 -D index58=39 -D index59=40 -D
index72=47 -D index73=32 -D index74=33 -D index75=34 -D index76=51 -D
index77=52 -D index78=51 -D index79=36 -D index80=37 -D index81=38 -D
index82=55 -D index83=56  $JOHN/kernels/DES_bs_kernel_h.cl
Salt compiled from Source:910
GWS: 1048576, LWS: 128
Salt compiled from Binary:910
Salt compiled from Binary:2275
Salt compiled from Binary:990
Salt compiled from Binary:0
Updated internal tables and buffers for salt 910.
Updated internal tables and buffers for salt 2275.
Updated internal tables and buffers for salt 990.
Updated internal tables and buffers for salt 0.
Note: This format may be a lot faster with --mask acceleration (see
doc/MASK).
Options used: -I ../run/kernels -cl-mad-enable -DSM_MAJOR=6 -DSM_MINOR=1
-cl-nv-verbose -D__GPU__ -DDEVICE_INFO=524306 -DSIZEOF_SIZE_T=8
-DDEV_VER_MAJOR=388 -DDEV_VER_MINOR=19 -D_OPENCL_COMPILER
$JOHN/kernels/DES_bs_hash_checking_kernel.cl
Options used: -I ../run/kernels -cl-mad-enable -DSM_MAJOR=6 -DSM_MINOR=1
-cl-nv-verbose -D__GPU__ -DDEVICE_INFO=524306 -DSIZEOF_SIZE_T=8
-DDEV_VER_MAJOR=388 -DDEV_VER_MINOR=19 -D_OPENCL_COMPILER -D ITER_COUNT=1
-D MASK_ENABLED=0 -D LOC_0=-1 -D LOC_1=-1 -D LOC_2=-1 -D LOC_3=-1 -D
IS_STATIC_GPU_MASK=0 -D CONST_CACHE_SIZE=65536
$JOHN/kernels/DES_bs_finalize_keys_kernel.cl
Note: building per-salt kernels. This takes e.g. 2 hours for 4096 salts.
Salt compiled from Binary:258
Salt compiled from Binary:517
Alarm clock

$  echo $?
142


These are messages in cygwin64.



The output from windows powershell for the same test:

PS D:\CUDA\jtr\src> ..\run\john.exe --wordlist=..\run\wl.txt
--format=descrypt-opencl ..\run\hashes.txt --verbosity=5
initUnicode(UNICODE, UTF-8/ISO-8859-1)
UTF-8 -> UTF-8 -> UTF-8
Device 0: GeForce GTX 1050 Ti
Using default input encoding: UTF-8
Loaded 20 password hashes with 19 different salts (descrypt-opencl,
traditional crypt(3) [DES OpenCL])
Remaining 19 password hashes with 18 different salts
Loaded 6 hashes with 4 different salts to test db from test vectors
Options used: -I /run/kernels -cl-mad-enable -DSM_MAJOR=6 -DSM_MINOR=1
-cl-nv-verbose -D__GPU__ -DDEVICE_INFO=524306 -DSIZEOF_SIZE_T=8
-DDEV_VER_MAJOR=388 -DDEV_VER_MINOR=19 -D_OPENCL_COMPILER
$JOHN/kernels/DES_bs_hash_checking_kernel.cl
Build log:
ptxas info    : 0 bytes gmem
ptxas info    : Compiling entry function 'DES_bs_cmp' for 'sm_61'
ptxas info    : Function properties for DES_bs_cmp
ptxas         .     0 bytes stack frame, 0 bytes spill stores, 0 bytes
spill loads
ptxas info    : Used 141 registers, 392 bytes cmem[0], 48 bytes cmem[2]
ptxas info    : Compiling entry function 'DES_bs_cmp_high' for 'sm_61'
ptxas info    : Function properties for DES_bs_cmp_high
ptxas         .     256 bytes stack frame, 0 bytes spill stores, 0 bytes
spill loads
ptxas info    : Used 32 registers, 392 bytes cmem[0], 48 bytes cmem[2]
Options used: -I /run/kernels -cl-mad-enable -DSM_MAJOR=6 -DSM_MINOR=1
-cl-nv-verbose -D__GPU__ -DDEVICE_INFO=524306 -DSIZEOF_SIZE_T=8
-DDEV_VER_MAJOR=388 -DDEV_VER_MINOR=19 -D_OPENCL_COMPILER -D ITER_COUNT=1
-D MASK_ENABLED=0 -D LOC_0=-1 -D LOC_1=-1 -D LOC_2=-1 -D LOC_3=-1 -D
IS_STATIC_GPU_MASK=0 -D CONST_CACHE_SIZE=65536
$JOHN/kernels/DES_bs_finalize_keys_kernel.cl
Build log:
ptxas info    : 0 bytes gmem
ptxas info    : Compiling entry function 'DES_bs_finalize_keys' for 'sm_61'
ptxas info    : Function properties for DES_bs_finalize_keys
ptxas         .     0 bytes stack frame, 0 bytes spill stores, 0 bytes
spill loads
ptxas info    : Used 29 registers, 352 bytes cmem[0], 32 bytes cmem[2]
Options used: -I /run/kernels -cl-mad-enable -DSM_MAJOR=6 -DSM_MINOR=1
-cl-nv-verbose -D__GPU__ -DDEVICE_INFO=524306 -DSIZEOF_SIZE_T=8
-DDEV_VER_MAJOR=388 -DDEV_VER_MINOR=19 -D_OPENCL_COMPILER -D
WORK_GROUP_SIZE=0 -D index0=31 -D index1=16 -D index2=17 -D index3=18 -D
index4=3 -D index5=4 -D index6=3 -D index7=20 -D index8=21 -D index9=22 -D
index10=7 -D index11=8 -D index24=15 -D index25=0 -D index26=1 -D index27=2
-D index28=19 -D index29=20 -D index30=19 -D index31=4 -D index32=5 -D
index33=6 -D index34=23 -D index35=24 -D index48=63 -D index49=48 -D
index50=49 -D index51=50 -D index52=35 -D index53=36 -D index54=35 -D
index55=52 -D index56=53 -D index57=54 -D index58=39 -D index59=40 -D
index72=47 -D index73=32 -D index74=33 -D index75=34 -D index76=51 -D
index77=52 -D index78=51 -D index79=36 -D index80=37 -D index81=38 -D
index82=55 -D index83=56  $JOHN/kernels/DES_bs_kernel_h.cl
Build log:
ptxas info    : 0 bytes gmem
ptxas info    : Compiling entry function 'DES_bs_25' for 'sm_61'
ptxas info    : Function properties for DES_bs_25
ptxas         .     0 bytes stack frame, 0 bytes spill stores, 0 bytes
spill loads
ptxas info    : Used 235 registers, 344 bytes cmem[0]
Salt compiled from Source:910
GWS: 1048576, LWS: 128
Salt compiled from Binary:910
Salt compiled from Binary:2275
Salt compiled from Binary:990
Salt compiled from Binary:0
Updated internal tables and buffers for salt 910.
Updated internal tables and buffers for salt 2275.
Updated internal tables and buffers for salt 990.
Updated internal tables and buffers for salt 0.
Note: This format may be a lot faster with --mask acceleration (see
doc/MASK).
Options used: -I /run/kernels -cl-mad-enable -DSM_MAJOR=6 -DSM_MINOR=1
-cl-nv-verbose -D__GPU__ -DDEVICE_INFO=524306 -DSIZEOF_SIZE_T=8
-DDEV_VER_MAJOR=388 -DDEV_VER_MINOR=19 -D_OPENCL_COMPILER
$JOHN/kernels/DES_bs_hash_checking_kernel.cl
Options used: -I /run/kernels -cl-mad-enable -DSM_MAJOR=6 -DSM_MINOR=1
-cl-nv-verbose -D__GPU__ -DDEVICE_INFO=524306 -DSIZEOF_SIZE_T=8
-DDEV_VER_MAJOR=388 -DDEV_VER_MINOR=19 -D_OPENCL_COMPILER -D ITER_COUNT=1
-D MASK_ENABLED=0 -D LOC_0=-1 -D LOC_1=-1 -D LOC_2=-1 -D LOC_3=-1 -D
IS_STATIC_GPU_MASK=0 -D CONST_CACHE_SIZE=65536
$JOHN/kernels/DES_bs_finalize_keys_kernel.cl
Note: building per-salt kernels. This takes e.g. 2 hours for 4096 salts.
Salt compiled from Binary:258
Salt compiled from Binary:517

here you can find both files i've used for test:
https://anonym.to/?http://www37.zippyshare.com/v/VjI6zmRA/file.html

Could you please help me out?

Thank you very much in advance.

Powered by blists - more mailing lists

Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.