LK_Scraper : An Fully Configurable LinkedIn Scrape

Lk_scraper is an fully configurable LinkedIn scrape : scrape anything within LinkedIn

Installation

$ pip install git+git://github.com/jqueguiner/lk_scraper

Setup

  • Using Docker compose

$ docker-compose up -d
$ docker-compose run lk_scraper python3

  • Using Docker only for selenium server

First, you need to run a selenium server

$ docker run -d -p 4444:4444 –shm-size 2g selenium/standalone-firefox:3.141.59-20200326

After running this command, from the browser navigate to your IP address followed by the port number and /grid/console. So the command will be http://localhost:4444/grid/console.

Also Read – Lollipopz : Data Exfiltration Utility For Testing Detection Capabilities

Retrieving Cookie

  • Browser-Independent:
  1. Navigate to Linkedin.com and log in
  2. Open up the browser developer tools (Ctrl-Shift-I or right click -> inspect element)
  • Chrome:
    • Select the Application tab
    • Under the Storage header on the left-hand menu, click the Cookies dropdown and select www.linkedin.com
    • Find the li_at cookie, and double click the value to select it before copying
  • Firefox:
    • Select Storage tab
    • Click the Cookies dropdown and select www.linkedin.com
    • Find and copy the li_at value

Setting Up The Cookie

  • Method 1 : Setting the cookie in the config file

You can add your LinkedIn li_at cookie in the config file that is located in your home (~/.lk_scraper/config.yml) see

  • Method 2 : Setting the cookie at the Scraper level

from lk_scraper import Scraper
li_at = “My_super_linkedin_cookie”
scraper = Scraper(li_at=li_at)

  • Method 3 : Using Variable Environment

(Not implemented Yet)

$ export LI_AT=”My_super_linkedin_cookie”

Example

run the jupyter notebook linkedin-example.ipynb

  • Usage

>>from lk_scraper import Scraper
>>scraper = Scraper()

  • Company Scraping

>>from lk_scraper import Scraper
>>scraper = Scraper()
>>company = scraper.get_object(object_name=’company’, object_id=’apple’)

  • Profile Scraping

>>from lk_scraper import Scraper
>>scraper = Scraper()
>>profil = scraper.get_object(object_name=’profil’, object_id=’jlqueguiner’)

R K

Recent Posts

Understanding the Model Context Protocol (MCP) and How It Works

Introduction to the Model Context Protocol (MCP) The Model Context Protocol (MCP) is an open…

5 days ago

The file Command – Quickly Identify File Contents in Linux

While file extensions in Linux are optional and often misleading, the file command helps decode what a…

6 days ago

How to Use the touch Command in Linux

The touch command is one of the quickest ways to create new empty files or update timestamps…

6 days ago

How to Search Files and Folders in Linux Using the find Command

Handling large numbers of files is routine for Linux users, and that’s where the find command shines.…

6 days ago

How to Move and Rename Files in Linux with the mv Command

Managing files and directories is foundational for Linux workflows, and the mv (“move”) command makes it easy…

6 days ago

How to Create Directories in Linux with the mkdir Command

Creating directories is one of the earliest skills you'll use on a Linux system. The mkdir (make…

6 days ago