Simple crawler utilizing Scrapy library to extract articles from Vietnamese news websites given urls.
Currently support:
On the command line, navigate to the respective subfolder
- kenh14_crawler: type
scrapy crawl kenh14_content_crawler
- soha_crawler: type
scrapy crawl soha_content_crawler