|
|
Message-ID: <CABJfu+Yeuga44jN9MMbXot_JTTab=J_bg5bS81nwEmBa_O1MRg@mail.gmail.com>
Date: Thu, 30 Jul 2015 23:28:33 +0200
From: Viktor Gazdag <woodspeed@...il.com>
To: john-users@...ts.openwall.com
Subject: John the ripper with Tesla GPUs on Debian
Hi all,
I've installed the latest Tesla driver from nvidia site, not the
nvidia vga driver and not from repo.
We downloaded and installed the ubuntu 14.04 deb package and the
dependences with aptitude and some dependencies with ubuntu packages,
because some version weren't high enough or weren't available. Maybe
the cuda*.run installer would be better, but I haven't had that much
free space left to try to install it.
We've installed librexgen from github, but john didn't find one lib,
because it was in an another directory (I think it was in the
librexgen/c/ directory).
Before run configure, I've added the following env (these are needed
for running john, too):
export CC=gcc-4.9
export PATH=$PATH:/usr/local/cuda-7.0/bin
export LD_LIBRARY_PATH=:/usr/local/cuda-7.0/lib64
export PATH=$PATH:/usr/local/cuda-7.0/targets/x86_64-linux/include
export LD_LIBRARY_PATH=:/usr/local/cuda-7.0/targets/x86_64-linux/lib
export PATH=$PATH:/usr/lib/gcc/x86_64-linux-gnu/4.9
The last export is needed, because system didn't find cc1plus to compile things.
./configure --enable-experimental-code
make clean && make -s
After it, I could list cuda devices:
./john --list=cuda-devices
CUDA runtime version 7.0
CUDA driver version 7.0
8 CUDA devices found:
CUDA Device #0
Name: Tesla K20Xm
Type: discrete
Compute capability: 3.5 (sm_35)
Number of stream processors: 2688 (14 x 192)
Clock rate: 732 Mhz
Memory clock rate (peak) 2600 Mhz
Memory bus width 384 bits
Peak memory bandwidth: 249 GB/s
Total global memory: 5.0 GB (ECC)
Total shared memory per block: 48.0 KB
Total constant memory: 64.0 KB
L2 cache size 1.0 MB
Kernel execution timeout: No
Concurrent copy and execution: Bi-directional
Concurrent kernels support: Yes
Warp size: 32
Max. GPRs/thread block 65536
Max. threads per block 1024
Max. resident threads per MP 2048
PCI device topology: 0d:00.0
NVML id: 2
Fan speed: n/a
GPU temp: 24°C
Utilization: 0%
CUDA Device #1
Name: Tesla K20Xm
Type: discrete
Compute capability: 3.5 (sm_35)
Number of stream processors: 2688 (14 x 192)
Clock rate: 732 Mhz
Memory clock rate (peak) 2600 Mhz
Memory bus width 384 bits
Peak memory bandwidth: 249 GB/s
Total global memory: 5.0 GB (ECC)
Total shared memory per block: 48.0 KB
Total constant memory: 64.0 KB
L2 cache size 1.0 MB
Kernel execution timeout: No
Concurrent copy and execution: Bi-directional
Concurrent kernels support: Yes
Warp size: 32
Max. GPRs/thread block 65536
Max. threads per block 1024
Max. resident threads per MP 2048
PCI device topology: 0e:00.0
NVML id: 3
Fan speed: n/a
GPU temp: 24°C
Utilization: 0%
CUDA Device #2
Name: Tesla K20Xm
Type: discrete
Compute capability: 3.5 (sm_35)
Number of stream processors: 2688 (14 x 192)
Clock rate: 732 Mhz
Memory clock rate (peak) 2600 Mhz
Memory bus width 384 bits
Peak memory bandwidth: 249 GB/s
Total global memory: 5.0 GB (ECC)
Total shared memory per block: 48.0 KB
Total constant memory: 64.0 KB
L2 cache size 1.0 MB
Kernel execution timeout: No
Concurrent copy and execution: Bi-directional
Concurrent kernels support: Yes
Warp size: 32
Max. GPRs/thread block 65536
Max. threads per block 1024
Max. resident threads per MP 2048
PCI device topology: 09:00.0
NVML id: 0
Fan speed: n/a
GPU temp: 23°C
Utilization: 0%
CUDA Device #3
Name: Tesla K20Xm
Type: discrete
Compute capability: 3.5 (sm_35)
Number of stream processors: 2688 (14 x 192)
Clock rate: 732 Mhz
Memory clock rate (peak) 2600 Mhz
Memory bus width 384 bits
Peak memory bandwidth: 249 GB/s
Total global memory: 5.0 GB (ECC)
Total shared memory per block: 48.0 KB
Total constant memory: 64.0 KB
L2 cache size 1.0 MB
Kernel execution timeout: No
Concurrent copy and execution: Bi-directional
Concurrent kernels support: Yes
Warp size: 32
Max. GPRs/thread block 65536
Max. threads per block 1024
Max. resident threads per MP 2048
PCI device topology: 0a:00.0
NVML id: 1
Fan speed: n/a
GPU temp: 22°C
Utilization: 0%
CUDA Device #4
Name: Tesla K20Xm
Type: discrete
Compute capability: 3.5 (sm_35)
Number of stream processors: 2688 (14 x 192)
Clock rate: 732 Mhz
Memory clock rate (peak) 2600 Mhz
Memory bus width 384 bits
Peak memory bandwidth: 249 GB/s
Total global memory: 5.0 GB (ECC)
Total shared memory per block: 48.0 KB
Total constant memory: 64.0 KB
L2 cache size 1.0 MB
Kernel execution timeout: No
Concurrent copy and execution: Bi-directional
Concurrent kernels support: Yes
Warp size: 32
Max. GPRs/thread block 65536
Max. threads per block 1024
Max. resident threads per MP 2048
PCI device topology: 28:00.0
NVML id: 4
Fan speed: n/a
GPU temp: 27°C
Utilization: 0%
CUDA Device #5
Name: Tesla K20Xm
Type: discrete
Compute capability: 3.5 (sm_35)
Number of stream processors: 2688 (14 x 192)
Clock rate: 732 Mhz
Memory clock rate (peak) 2600 Mhz
Memory bus width 384 bits
Peak memory bandwidth: 249 GB/s
Total global memory: 5.0 GB (ECC)
Total shared memory per block: 48.0 KB
Total constant memory: 64.0 KB
L2 cache size 1.0 MB
Kernel execution timeout: No
Concurrent copy and execution: Bi-directional
Concurrent kernels support: Yes
Warp size: 32
Max. GPRs/thread block 65536
Max. threads per block 1024
Max. resident threads per MP 2048
PCI device topology: 2b:00.0
NVML id: 5
Fan speed: n/a
GPU temp: 27°C
Utilization: 0%
CUDA Device #6
Name: Tesla K20Xm
Type: discrete
Compute capability: 3.5 (sm_35)
Number of stream processors: 2688 (14 x 192)
Clock rate: 732 Mhz
Memory clock rate (peak) 2600 Mhz
Memory bus width 384 bits
Peak memory bandwidth: 249 GB/s
Total global memory: 5.0 GB (ECC)
Total shared memory per block: 48.0 KB
Total constant memory: 64.0 KB
L2 cache size 1.0 MB
Kernel execution timeout: No
Concurrent copy and execution: Bi-directional
Concurrent kernels support: Yes
Warp size: 32
Max. GPRs/thread block 65536
Max. threads per block 1024
Max. resident threads per MP 2048
PCI device topology: 30:00.0
NVML id: 6
Fan speed: n/a
GPU temp: 27°C
Utilization: 0%
CUDA Device #7
Name: Tesla K20Xm
Type: discrete
Compute capability: 3.5 (sm_35)
Number of stream processors: 2688 (14 x 192)
Clock rate: 732 Mhz
Memory clock rate (peak) 2600 Mhz
Memory bus width 384 bits
Peak memory bandwidth: 249 GB/s
Total global memory: 5.0 GB (ECC)
Total shared memory per block: 48.0 KB
Total constant memory: 64.0 KB
L2 cache size 1.0 MB
Kernel execution timeout: No
Concurrent copy and execution: Bi-directional
Concurrent kernels support: Yes
Warp size: 32
Max. GPRs/thread block 65536
Max. threads per block 1024
Max. resident threads per MP 2048
PCI device topology: 33:00.0
NVML id: 7
Fan speed: n/a
GPU temp: 26°C
Utilization: 4%
Best regards
woodspeed
Powered by blists - more mailing lists
Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.