Skip to content

a simple library that gets the http, parse it and saves to mongodb

Notifications You must be signed in to change notification settings

funtusov/Web2Mongo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 

Repository files navigation

A small library that downloads web pages, parse them and saves the needed data in mongodb.

Uses HPricot.

Right now it's parsing IMDB movies, using simple multithreading, it's a quick trial of some functions. The IMDB part was inspired by an article that used the technique. I wondered it threading and mongodb from there.

About

a simple library that gets the http, parse it and saves to mongodb

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published