- To scrape images using
FastClass
, runfcd -c BING -s 0 -o raw_images config/fastclass.csv
- To scrape images using
google-images-download
, rungoogleimagesdownload -cf ../config/google_images_download.json
- To rename image files to zero-padded sequential numbers, run
ls | cat -n | while read n f; do mv "$f" `printf "%03d.extension" $n`; done
- Refer to google-images-download and FastClass for documentation of usage.
- Google crawler from
FastClass
won't work. - If you face issues when using
google-images-download
, see #360 and #298 to learn how to download the forked version that has the bug fix. It seems likegoogle-images-download
is not maintained by the current repo owner anymore. - You can use TinyPNG and its VS Code extension to resize and compress your scraped images. Get your API key here.
- By default, this repo comes with a
devcontainer.json
configuration. If you have Docker installed, you can start the container to get all packages and extensions installed at one go.