Stani,
Stani's Python Editor wrote:
...
I've been doing quite some website scraping recently, and this is my ideal cocktail if gets more complicate than posting a http header:*for browsing
- mechanize: higher level than urllib* library, it makes it very easy to fill in forms, automatically handles cookies, can act like if it was firefox (as a lot of sites block robots), etc...*for analyzing:
- firebug extension for firefox: a must have for scraping to study interactively the html source, both for reading & browsing (this has nothing to do with python)*for reading:
- beautiful soup: python libraryI guess on the comp.lang.python mailing list you'll probably get also good feedback.
Thanks for the tips.
Good luck,
I'll need it
Werner