[Gllug] extracting data from specific sites

Tethys sta296 at astradyne.co.uk
Mon Dec 17 03:28:04 UTC 2007


--------

"Nevzat Hami" writes:

>if there is no worry about copyright, :-) any ideas how to do it?

Write a program that behaves like a web browser that downloads the
page and extracts the content you're interested in. There are modules
in both python and perl that let you do this. Failing that, a simple
wget/curl and sed/grep might suffice, depending on the nature of the
information you're trying to get at.

Tet
-- 
Gllug mailing list  -  Gllug at gllug.org.uk
http://lists.gllug.org.uk/mailman/listinfo/gllug




More information about the GLLUG mailing list