Date: Wed, 17 Dec 2008 04:12:52 +0900 From: "Dumplinger Boy" <nasay.ognad@...il.com> To: john-users@...ts.openwall.com Subject: Bitslice DES fast implementation for AltiVec(PPC) Hello, all. I tried the method of making Bitslice DES can executed at high speed or more in PowerPC. VSEL instruction can replace 3 instruction combination(VAND, VANDC, and VOR). d = vec_or(vec_andc(a, c), vec_and(b, c)); ==> d = vec_sel(a, b, c); It confirms with PowerPC G4(MacOS 10.4) and Cell-PPU on PLAYSTATION 3(Linux). In G4, as for the performance of JtR 126.96.36.199, the one compiled with gcc-3.3(Apple's) was the fastest, and more performance gained (about 20%) with using my code. before: > Benchmarking: Traditional DES [128/128 BS AltiVec]... DONE > Many salts: 725427 c/s real, 728340 c/s virtual > Only one salt: 649676 c/s real, 652285 c/s virtual after: > Benchmarking: Traditional DES [128/128 BS AltiVec]... DONE > Many salts: 867635 c/s real, 871119 c/s virtual > Only one salt: 762291 c/s real, 763818 c/s virtual P.S. As long as I know, scaler-integer and AltiVec instructions can be operated parallel in most PowerPC imprementation. Therefore, there is a possibility of more performance gain. [ CONTENT OF TYPE application/x-gzip SKIPPED ] [ CONTENT OF TYPE application/octet-stream SKIPPED ] -- To unsubscribe, e-mail john-users-unsubscribe@...ts.openwall.com and reply to the automated confirmation request that will be sent to you.
Powered by blists - more mailing lists
Powered by Openwall GNU/*/Linux - Powered by OpenVZ