Web page capture util

Joshua Zeidner jjzeidner at gmail.com
Mon Aug 28 19:13:52 MST 2006


Shawn,

  It may be the 'user agent' HTTP field, as some sites prohibit wget user
agents to prevent bots and scraping.

try something such as:

  *wget* --*user*-*agent*="monduna.com/2.5" http://news.google.com

 -jmz



On 8/28/06, Shawn Badger <sbadger at cskauto.com> wrote:
>
> Does anyone know of a CLI app that can capture a web page to a jpg or
> better a pdf? I need to capture a dynamic page on daily basis and e-mail
> the captured image to various people. I have tried using wget, but it
> saves some weird results. I suspect that is because the page I am
> polling is generated with PHP.
>
> Any ideas would be appreciated.
>
>
>
> ---------------------------------------------------
> PLUG-discuss mailing list - PLUG-discuss at lists.plug.phoenix.az.us
> To subscribe, unsubscribe, or to change  you mail settings:
> http://lists.PLUG.phoenix.az.us/mailman/listinfo/plug-discuss
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.plug.phoenix.az.us/pipermail/plug-discuss/attachments/20060828/4d72369c/attachment.htm


More information about the PLUG-discuss mailing list