Follow @Openwall on Twitter for new release announcements and other news
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Date: Mon, 24 Aug 2015 20:27:23 +0200
From: magnum <john.magnum@...hmail.com>
To: john-users@...ts.openwall.com
Subject: Re: Using Internal-encoding=utf8

On 2015-08-24 20:13, Luis Rocha wrote:
> Maybe this has been explained already, but I thought
> --internal-encoding=utf-8 option, among other things, will cover the
> ability to use --rules with special chars. But for the below exercise it
> has the opposite effect.
>
>
> $ file male-names-replace
> male-names-replace: UTF-8 Unicode text
>
>
> $ ./john  /home/data/hashes/2015cmiyc/uncracked/1.nt.pw
>   --wordlist=/home/data/wordlists/Outpost/male-names-replace --rules:Single
> Using default input encoding: UTF-8
> Rules/masks using ISO-8859-1
> Loaded 6728 password hashes with no different salts (NT [MD4 256/256 AVX2
> 8x3])
> Press 'q' or Ctrl-C to abort, almost any other key for status
> Àntònîò          (?)
> Ántónïó          (?)
> (..)
>
> 41g 0:00:00:09 DONE (2015-08-24 20:04) 4.495g/s 15575Kp/s 15575Kc/s
> 105765MC/s gäûltërö1900..gältërö1900
>
>
> $ ./john  /home/data/hashes/2015cmiyc/uncracked/1.nt.pw
>   --wordlist=/home/data/wordlists/Outpost/male-names-replace --rules:Single
> --internal-encoding=utf-8
> Using default input encoding: UTF-8
> Loaded 6728 password hashes with no different salts (NT [MD4 256/256 AVX2
> 8x3])
> Remaining 6687 password hashes with no different salts
> Press 'q' or Ctrl-C to abort, almost any other key for status
> 0g 0:00:00:11 DONE (2015-08-24 20:06) 0g/s 11645Kp/s 11645Kc/s 78224MC/s
> välëntĩnö1900..gũältërĩö1900
> Session completed
>
>
> As you could see when using the --internal-encoding=utf8 it did not found
> any cracks... Is this as expected?

Sure. The --internal-encoding option needs a legacy codepage (as in your 
first example, where it defaults to iso-8859-1 and does crack stuff). 
Setting it to UTF-8 is the same as disabling it: It reverts to legacy 
(or john proper) behavior and doesn't cope with Unicode characters very 
good.

Depending on what names you have in that wordlist, you might want to try 
another run with --internal-encoding=cp737 (Greek) and yet another with 
--internal-encoding=cp1251 (Russian), for instance.

magnum

Powered by blists - more mailing lists

Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.