Profile Scrapper

Introduction:

LinkedIn is a Social Networking site with over 750 million users. The User Data is valuable as it can be used to conduct various research. LinkedIn has deployed a strict Anti-Scraping system which makes direct data scraping almost impossible.

The main goal of this project is to develop an algorithm that can bypass anti-scraping algorithms and to extract data of alumni of a particular organization and of a particular domain. Selenium, BeautifulSoup libraries in Python are used. This is used to automate the complete process.

The program scraps user data in a specified domain for the specified institute and stores the scrapped data as a CSV file in your current working directory.

Libraries:

# Install Scrapping tolls.
pip install selenium
pip install bs4

# Install api
pip install linkedin_api

⚠️ Run the code only once in a while : As the code uses automated browser for scraping, Google and LinkedIn will detect frequent requests sent from the bot if run several times. Please do not run the code often. Or else Your LinkedIn account might get banned. If Possible please use a demo linkedin account .

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
README.md		README.md
Scrapper.py		Scrapper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Profile Scrapper

Introduction:

Libraries:

Output:

CSV File:

About

Releases

Packages

Languages

Aravindhan-G/Profile_Scrapper

Folders and files

Latest commit

History

Repository files navigation

Profile Scrapper

Introduction:

Libraries:

Output:

CSV File:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages