[Gllug] extracting data from specific sites

Andy Farnsworth farnsaw at stonedoor.com
Mon Dec 17 03:00:24 UTC 2007


Unless the site provides a specific feed (RSS, Atom, or some other 
standard or proprietary format) what you are trying to do is called 
screen scraping and will probably break each and every time the site you 
are scraping changes.  You can use Perl and the LWP perl module to do 
this.  If it is for personal work, then that is ok, but if you are doing 
this professionally, it would be better to contact the site / company 
and arrange a data feed.

What are you trying to accomplish?

Andrew

Nevzat Hami wrote:
> hi,
>
> i want to extract specific content from specific sites.
>
>  Is it possible to do this with any program? or any ideas how to do it?
>
> thanks,
>
> nevz@

-- 
Gllug mailing list  -  Gllug at gllug.org.uk
http://lists.gllug.org.uk/mailman/listinfo/gllug




More information about the GLLUG mailing list