[Gllug] extracting data from specific sites

Nevzat Hami nevzathami at gmail.com
Mon Dec 17 03:23:58 UTC 2007


if there is no worry about copyright, :-) any ideas how to do it?

nevz@

On Dec 17, 2007 3:26 AM, Caroline Ford <caroline.ford.work at googlemail.com>
wrote:

> If they want you to use their content they will provide a feed of some
> sort.
>
> Caroline
>
> On Mon, 2007-12-17 at 03:07 +0000, Nevzat Hami wrote:
> > i want to get some contents from specific sites, and use this contents
> > in our site which we are developing.
> >
> > nevz@
> >
> > On Dec 17, 2007 3:00 AM, Andy Farnsworth <farnsaw at stonedoor.com>
> > wrote:
> >         Unless the site provides a specific feed (RSS, Atom, or some
> >         other
> >         standard or proprietary format) what you are trying to do is
> >         called
> >         screen scraping and will probably break each and every time
> >         the site you
> >         are scraping changes.  You can use Perl and the LWP perl
> >         module to do
> >         this.  If it is for personal work, then that is ok, but if you
> >         are doing
> >         this professionally, it would be better to contact the site /
> >         company
> >         and arrange a data feed.
> >
> >         What are you trying to accomplish?
> >
> >         Andrew
> >
> >
> >         Nevzat Hami wrote:
> >         > hi,
> >         >
> >         > i want to extract specific content from specific sites.
> >         >
> >         >  Is it possible to do this with any program? or any ideas
> >         how to do it?
> >         >
> >         > thanks,
> >         >
> >         > nevz@
> >
> >
> >         --
> >         Gllug mailing list  -  Gllug at gllug.org.uk
> >         http://lists.gllug.org.uk/mailman/listinfo/gllug
> >
>
> --
> Gllug mailing list  -  Gllug at gllug.org.uk
> http://lists.gllug.org.uk/mailman/listinfo/gllug
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.lug.org.uk/pipermail/gllug/attachments/20071217/50834cd9/attachment.html>
-------------- next part --------------
-- 
Gllug mailing list  -  Gllug at gllug.org.uk
http://lists.gllug.org.uk/mailman/listinfo/gllug


More information about the GLLUG mailing list