LK_Scraper : An Fully Configurable LinkedIn Scrape

Lk_scraper is an fully configurable LinkedIn scrape : scrape anything within LinkedIn

Installation

$ pip install git+git://github.com/jqueguiner/lk_scraper

Setup

  • Using Docker compose

$ docker-compose up -d
$ docker-compose run lk_scraper python3

  • Using Docker only for selenium server

First, you need to run a selenium server

$ docker run -d -p 4444:4444 –shm-size 2g selenium/standalone-firefox:3.141.59-20200326

After running this command, from the browser navigate to your IP address followed by the port number and /grid/console. So the command will be http://localhost:4444/grid/console.

Also Read – Lollipopz : Data Exfiltration Utility For Testing Detection Capabilities

Retrieving Cookie

  • Browser-Independent:
  1. Navigate to Linkedin.com and log in
  2. Open up the browser developer tools (Ctrl-Shift-I or right click -> inspect element)
  • Chrome:
    • Select the Application tab
    • Under the Storage header on the left-hand menu, click the Cookies dropdown and select www.linkedin.com
    • Find the li_at cookie, and double click the value to select it before copying
  • Firefox:
    • Select Storage tab
    • Click the Cookies dropdown and select www.linkedin.com
    • Find and copy the li_at value

Setting Up The Cookie

  • Method 1 : Setting the cookie in the config file

You can add your LinkedIn li_at cookie in the config file that is located in your home (~/.lk_scraper/config.yml) see

  • Method 2 : Setting the cookie at the Scraper level

from lk_scraper import Scraper
li_at = “My_super_linkedin_cookie”
scraper = Scraper(li_at=li_at)

  • Method 3 : Using Variable Environment

(Not implemented Yet)

$ export LI_AT=”My_super_linkedin_cookie”

Example

run the jupyter notebook linkedin-example.ipynb

  • Usage

>>from lk_scraper import Scraper
>>scraper = Scraper()

  • Company Scraping

>>from lk_scraper import Scraper
>>scraper = Scraper()
>>company = scraper.get_object(object_name=’company’, object_id=’apple’)

  • Profile Scraping

>>from lk_scraper import Scraper
>>scraper = Scraper()
>>profil = scraper.get_object(object_name=’profil’, object_id=’jlqueguiner’)

R K

Recent Posts

Install Gitea Ubuntu: Complete Setup Guide for Developers

Managing source code efficiently is essential for modern software development, and Install Gitea Ubuntu is…

11 hours ago

Install Ruby Ubuntu – 3 Easy Ways to Set Up Ruby on Ubuntu 20.04

Ruby remains one of the most popular programming languages for web development, automation, and software…

12 hours ago

Plex Media Server Setup: Install and Configure on Ubuntu 20.04

A Plex Media Server Setup on Ubuntu 20.04 is one of the easiest ways to…

13 hours ago

Why Deploying AI Is Just the Beginning: The Case for Ongoing AI Operations Monitoring

Most enterprise AI programs treat deployment as the destination. The business case is built around…

1 day ago

Bash Scripting Best Practices Every Beginner Should Know

Introduction Bash scripting is a powerful way to automate Linux tasks, but writing a script…

6 days ago

How To Create A Self-Signed SSL Certificate Using Bash And OpenSSL

Introduction A self-signed SSL certificate is a certificate that is created and signed by the…

6 days ago