Date: Sat, 27 Apr 2013 13:42:17 +0200 From: magnum <john.magnum@...hmail.com> To: john-dev@...ts.openwall.com Subject: Re: revised incremental mode and charset files (was: Bleeding-jumbo branch updated from core) On 26 Apr, 2013, at 22:16 , Solar Designer <solar@...nwall.com> wrote: > I intend to use only RockYou for the new .chr files, perhaps with some > simple and documented filtering applied to it (perhaps in the form of > revised external mode filters). $ ../run/john -pot=/Users/magnum/john/rockyou.dupes.utf8.pot --make-charset=../run/all.chr Loaded 32603388 plaintexts Remaining 32603048 plaintexts What's with the 340 dropped? I did not use any filter and only null is excluded from the charset. So what does it drop? It does not seem to be lines longer than PLAINTEXT_BUFFER_SIZE or similar. In the Rockyou case, simply filtering lines longer than 24 is probably Good Enough[tm] to drop most garbage. OTOH that should probably not happen in charset.c because it would be bad for shorter lengths and other uses than "rockyou.pot". We also might want to filter out some rockyou-based strings to reduce rockyou-specific bias but that's another issue. magnum
Powered by blists - more mailing lists
Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.