Creating the masks

Data sources

50k first names from 54 countries, collected by a German computer magazine under GNU Free Documentation License in 2002, are mirrored onto Dropbox (link).
USA people names (1880-2018 yy., link) that occurred at least 20 times are added to the previous source.
4k pet names (link).

In total, 66k names of length 4 and more are compiled within one file after you run ./bash/load_names.sh.

Limitations:

all original words containing symbol | are excluded from the wordlist;
only names of at least 4 characters long are considered to collect the statistics. You can change this option in load_names.sh. Later on, at passwords generation phase, names of all lengths will be generated.

Most used first names

Most used people first names of length 5 and more:

$ awk '{if (length($2) >= 5) print}' names/names.count | head
139989 chris
128067 angel
104553 ester
100876 cally
 99607 inger
 94240 andre
 91455 ander
 86502 erman
 86332 erden
 77870 christ

Alphabet

OMEN was used to create the alphabet of Top2B passwords:

eainr7s0o1lt3542986ducmhgbkpyvfwAzEjIRSNCDxOBTLGHqMFKPUYWJVZXQ-'.!$@+_?#/=:)("~&,%{*`\^}>;[<]|

The last ASCII symbol is |, meaning it's least used in password candidates. The probability of encountering | in a wordlist is 0.008 %. Therefore, we can use this symbol as a mask after we strip all lines from the wordlist that contain |, which results in preserving approx. 99.9 % of words.

The implementation employs an efficient and simple data structure - a trie (a prefix tree) - that has proven to yield the lowest algorithmic complexity of searching substrings in a text when the look-up name strings are drawn from a small alphabet (in our case it's 26 English lowercase characters). The search is case insensitive.

Size: 'wpa' and 'all'

wpa: at least 8 characters long passwords
all: all passwords, no filtering by length

Mode: 'length' and 'single'

Length Mode

Each name in a password is replaced by |, keeping the length:

12vika1992 --> 12||||1992
Gunter@! --> ||||||@!

That's where hashcat masks come into place. To be simple, only

lowercase all
uppercase all
capitalize

masked rules are created for each match. That is, 12vika1992 creates 3 masks, valid for WPA hashcat attack:

12l?l?l?l?1992
12u?u?u?u?1992
12u?l?l?l?1992

Masks in masks.hashcat files are sorted by the num. of occurrences, taken from masks.stats.

Single Char Mode

In the single mode, each match group is replaced with a single symbol |:

12vika1992 --> 12|1992
Gunter@! --> |@!

While not suited directly for hashcat masks, the placeholder | serves for direct substitution of names of arbitrary length (see next chapter). Examples from Top29M wordlist:

$ head -n10 masks/all/single/masks.stats
 242364 |
  44311 |1
  33654 |123
  33307 |s
  31914 |a
  29447 |e
  26605 |2
  25756 |12
  23953 |n
  23168 |3

Here is how you read single char mode output:

plain names of any length were used 242364 times as passwords;
digit 1 was appended to names of any length 44311 times ...

Generate probable passwords

Brute forcing all lower cases in l?l?l?l?l?l?l?1 x 1000 to look for name patterns might be not the best idea. A smarter way is to substitute each | in a single mode and each group of subsequent | characters in length mode by a name from a chosen country. Doing so will dramatically decrease the search space.

Examples:

single mode: 12|1992  --> 12jan1992 12alex1992 12simon1992 12martina1992 12michael1992 ...
length mode: 12||||1992  -->  12alex1992 12lena1992 12lora1992 12vika1992 12sofi1992 ...

(to be continued)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

create_masks.md

create_masks.md

Creating the masks

Data sources

Limitations:

Most used first names

Alphabet

Size: 'wpa' and 'all'

Mode: 'length' and 'single'

Length Mode

Single Char Mode

Generate probable passwords

Files

create_masks.md

Latest commit

History

create_masks.md

File metadata and controls

Creating the masks

Data sources

Limitations:

Most used first names

Alphabet

Size: 'wpa' and 'all'

Mode: 'length' and 'single'

Length Mode

Single Char Mode

Generate probable passwords