UDdup tool gets a list of URLs, and removes “duplicate” pages in the sense of URL patterns that are probably repetitive and points to the same web template.
For example:
https://www.example.com/product/123
https://www.example.com/product/456
https://www.example.com/product/123?is_prod=false https://www.example.com/product/222?is_debug=true
All the above are probably points to the same product “template”. Therefore it should be enough to scan only some of these URLs by our various scanners.
The result of the above after UDdup should be:
https://www.example.com/product/123?is_prod=false https://www.example.com/product/222?is_debug=true
Why do I need it?
Mostly for better (automated) reconnaissance process, with less noise (for both the tester and the target).
Examples
Take a look at demo.txt
which is the raw URLs file which results in demo-results.txt
.
Installation
pip install uddup
Clone the repository.
git clone https://github.com/rotemreiss/uddup.git
Install the Python requirements.
cd uddup
pip install -r requirements.txt
Usage
uddup -u demo.txt -o ./demo-result.txt
uddup -h
Short Form | Long Form | Description |
---|---|---|
-h | –help | Show this help message and exit |
-u | –urls | File with a list of urls |
-o | –output | Save results to a file |
-s | –silent | Print only the result URLs |
-fp | –filter-path | Filter paths by a given Regex |
Allows filtering custom paths pattern. For example, if we would like to filter all paths that starts with /product
we will need to run:
Single Regex
uddup -u demo.txt -fp “^product”
https://www.example.com/
https://www.example.com/privacy-policy
https://www.example.com/product/1
https://www.example2.com/product/2 https://www.example3.com/product/4
https://www.example.com/
https://www.example.com/privacy-policy
uddup -u demo.txt -fp “(^product)|(^category)”
Cybersecurity tools play a critical role in safeguarding digital assets, systems, and networks from malicious…
MODeflattener is a specialized tool designed to reverse OLLVM's control flow flattening obfuscation through static…
"My Awesome List" is a curated collection of tools, libraries, and resources spanning various domains…
CVE-2018-17463, a type confusion vulnerability in Chrome’s V8 JavaScript engine, allowed attackers to execute arbitrary…
The blog post "Chrome Browser Exploitation, Part 1: Introduction to V8 and JavaScript Internals" provides…
The exploitation of CVE-2018-17463, a type confusion vulnerability in Chrome’s V8 JavaScript engine, relies on…