jmoiron plays the blues

2191ce73 to pdf2jpg er, restore good stuff, remove junk ;)
d11fb2a3 to pdf2jpg remove junk from repos
a6c59672 to pdf2jpg adding git ignore
created pdf2jpg, converts image-based pdfs to a zipfile full of extracted images
created pdf2zip, converts image-based pdfs to a zipfile full of extracted images
c45982ec to pdf2jpg initial commit.. works for me
635720e0 to jmoiron.net switch to new version of flask, switch from modules to blueprints, add a simple flatpages app
f3a869c6 to aranha change the SimpleHandler semantics to issue callbacks with only the job, change the crawler to tack on the current crawler to the job during pre processing (so that job callbacks can access the crawler's job queue and add new ones independently of the handler on that job), change simple.startjobs to be able to create a simple handler from a callback so you can pass a BaseHandler or a callable as the handler kwarg
f3a869c6 to aranha change the SimpleHandler semantics to issue callbacks with only the job, change the crawler to tack on the current crawler to the job during pre processing (so that job callbacks can access the crawler's job queue and add new ones independently of the handler on that job), change simple.startjobs to be able to create a simple handler from a callback so you can pass a BaseHandler or a callable as the handler kwarg
6db3a298 to aranha add gevent and httplib2 to requirements
43e71ad2 to aranha get aranha simple version up to snuff; can now be used pretty easily to fetch many pages in parallel
6db3a298 to aranha add gevent and httplib2 to requirements
43e71ad2 to aranha get aranha simple version up to snuff; can now be used pretty easily to fetch many pages in parallel
46ea0c25 to aranha initial commit of aranha spider code
e04b67e5 to aranha initial commit
created aranha, simple python gevent web spider
created aranha, simple python gevent web spider
created aranha, simple python gevent web spider
created aranha, simple python gevent web spider
created aranha, simple python gevent web spider