Wordlistgen is a tool to pass a list of URLs and get back a list of relevant words for your wordlists. Wordlists are much more effective when you take the application’s context into consideration. wordlistgen pulls out URL components, such as subdomain names, paths, query strings, etc. and spits them back to stdout so you can easily add them to your wordlists.

Installation

If you don’t have Go installed, “go” do that!

go get -u github.com/ameenmaali/wordlistgen

Usage

wordlistgen takes URLs and paths from stdin, of which you will most likely want in a file such as:

$ cat file.txt
https://google.com/home/?q=2&d=asd
http://my.site
/api/v2/auth/me?id=123

Get unique URL components from a file of URLs and/or paths:

cat hosts.txt | wordlistgen

Get unique URL components from a file of URLs and/or paths, including query string values, and save to a file:

cat hosts.txt | wordlistgen -qv > urlComponents.txt

wordlistgen works at it’s best when chained with other tools, such as @tonnomnom’s waybackurls :

cat hosts.txt | waybackurls | wordlistgen