Goop : Google Search Scraper

Goop can perform google searches without being blocked by the CAPTCHA or hitting any rate limits.

How it works?

Facebook provides a debugger tool for its scraper. Interestingly, Google doesn’t limit the requests made by this debugger (whitelisted?) and hence it can be used to scrap the google search results without being blocked by the CAPTCHA.

Since facebook is involved, a facebook session Cookie must be supplied to the library with each request.

Also Read – Osmedeus : Security Framework For Reconnaissance & Vulnerability Scanning

Usage

Installation

pip install goop

Example

from goop import goop

page_1 = goop.search(‘red shoes’, ”)
page_2 = goop.search(‘red_shoes’, ”, page=’1′)
include_omitted_results = goop.search(‘red_shoes’, ”, page=’8′, full=True)

The returned is a dict of following structure

{
“0”: {
“url”: “https://example.com”,
“text”: “Example webpage”,
“summary”: “This is an example webpage whose aim is to demonstrate the usage of …”
},
“1”: {

cli.py demonstrates the usage by performing google searches from the terminal with the following command

python cli.py <query> <number_of_pages>

Disclaimer

Scraping google search results is illegal. This library is merely a proof of concept of the bypass. The author isn’t responsible for the actions of the end users.

R K

Recent Posts

Install MySQL on Ubuntu 20.04: Setup, Security, and Root Access

MySQL is the most popular open-source relational database management system. It is fast, reliable, and a…

15 hours ago

Install Git on Ubuntu 20.04: Apt, Source, and Configuration

Git is the most widely used version control system in the world. It was created by…

15 hours ago

Install Go on Ubuntu 20.04: Download, Setup, and First Program

Go (also called Golang) is an open-source programming language built by Google. It is designed to…

15 hours ago

Install VS Code on Ubuntu 20.04: Snap Package and Apt Guide

Visual Studio Code (VS Code) is an open-source code editor developed by Microsoft. It is one…

15 hours ago

Install Nginx on Ubuntu 20.04: Setup, Firewall, and Config Guide

Nginx (pronounced "engine x") is an open-source, high-performance web server and reverse proxy. It is used…

15 hours ago

Install Apache on Ubuntu 20.04: Setup and Virtual Host Guide

Apache is one of the most widely used open-source web servers in the world. It is…

2 days ago