Skip to content

hashshura/php-nim-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

php-nim-scraper

Naive approach in scraping all of Institut Teknologi Bandung NIMs (Nomor Induk Mahasiswa) from ITB Network Information Center (nic.itb.ac.id). To see all of the scraped data in action, see ITB NIM Finder.

How-to

  • cURL is needed for curl-crawl.php, whereas fgc-crawl.php only uses file_get_contents (but is considerably slower).
  • Include your logged-in cookie from nic.itb.ac.id to the source (try making a HTTP call here).
  • Run the following command via CLI:
$ php curl-crawl.php
or
$ php fgc-crawl.php
  • Sample output (without emails to avoid spam): crawled.out.

About

Data crawler for ITB NIM Finder.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages