Indeed Job Scrapper

This Python-based web scraping project utilizes Selenium and BeautifulSoup to extract job listings for Python Developer positions in Gurgaon, Haryana, from Indeed.

Technologies Used

Make sure you have the following installed before running the application:

Features

Extracting Utilizes Selenium and BeautifulSoup to extract job listings for Python Developer positions in Gurgaon, Haryana
Cleaning The scraped data is then cleaned, including converting job salaries to per annum values and handling missing values.
Storage The cleaned data is stored in a MongoDB database using MongoDB Compass and Atlas.
Admin Panel An admin panel is implemented using Django to perform CRUD operations on the job data.

Database Structure

The application uses a database table with the following structure:

Field	Type	Description
job_title	VARCHAR(255)	Title of the job position
company_name	VARCHAR(255)	Name of the hiring company
company_location	VARCHAR(255)	Location of the company
total_salary	INT	Annual salary for the job
City	VARCHAR(255)	City where the job is located
State	VARCHAR(255)	State where the job is located
link	VARCHAR(255)	Link to the job listing

Getting Started

Clone the Repository:

git clone https://github.com/hardik2207/AST-Consulting.git
cd ASTCONSULTING

Run the Scrapper
```
 run Job scrapper.py
```
Run Admin Panel
```
python manage.py runserver
```

Overview

Scrapped Jobs
Cleaned Jobs
Data stored in Mongodb
Admin Panel using Django
1. Admin Login
2. Add Job
3. Search Job
4. Edit Job
5. Delete Job
Mean salary of Python developers in Delhi

Contributing

Contributions are welcome! Feel free to open issues or submit pull requests to improve the project.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
adminpanel		adminpanel
Average salary for Python developers in Delhi.ipynb		Average salary for Python developers in Delhi.ipynb
Job scrapper.py		Job scrapper.py
Procfile		Procfile
README.md		README.md
Storing scrapped data in MongoDB.py		Storing scrapped data in MongoDB.py
cleaned job data.csv		cleaned job data.csv
job cleaning.ipynb		job cleaning.ipynb
jobs_indeed_python_developer.csv		jobs_indeed_python_developer.csv
requirements.txt		requirements.txt
runtime.txt		runtime.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Indeed Job Scrapper

Table of Contents

Technologies Used

Features

Database Structure

Getting Started

Overview

Contributing

About

Releases

Packages

Languages

HARDIK2207/Indeed-Job-Scrapper

Folders and files

Latest commit

History

Repository files navigation

Indeed Job Scrapper

Table of Contents

Technologies Used

Features

Database Structure

Getting Started

Overview

Contributing

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages