GitHub - Bacdong/web-crawler: Crawler website with requests library in python

Web Crawler with requests and beautifulsoup4(bs4) library in python

Installing for Development:

IDE:

- Download python (https://www.python.org/downloads/)
- Install IDE support compile Python: VSCode, Pycharm, Sublime Text, ...

Extension for IDE:

+ For VSCode, you need install some extension to support code python:
    . HTML CSS Support
    . Python
    . Remote Development
  + For IDE different: Search more information on google

To run:

- OPEN TERMINAL:
  + cd crawler
  + pip install requests // (pip3 install requests)
  + pip install beautifulsoup4 // (pip3 install beautifullsoup4) 
- OPEN getData.py file:
  ( * If you no need save data into database:
      . Comment some function  use to connect database: Eg. insertData..(),...
      . No need to worry about connecting and dealing with databases.
  )
  - Replace current available "url" variable in file with the one url address you want.

  - Reopen terminal and run with: python getData.py

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
SonNhaDep		SonNhaDep
crawler		crawler
first_project		first_project
README.md		README.md
bash.exe.stackdump		bash.exe.stackdump

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Web Crawler with requests and beautifulsoup4(bs4) library in python

Installing for Development:

About

Releases

Packages

Languages

Bacdong/web-crawler

Folders and files

Latest commit

History

Repository files navigation

Web Crawler with requests and beautifulsoup4(bs4) library in python

Installing for Development:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages