Power to the People of Iran



Hello...
..and welcome to eeeGadgets! This blog is dedicated to everything associated with the wide (and interesting) field of mobile computing. The main focus is on presenting all the various hardware modifications I made to my eeePC, but I will also give short reviews of other interesting Mobile Internet Devices I come across. Further I want to share some tips and tricks I found out to be helpful in getting the most performance out of Ubuntu linux...
On the right side under "LABELS" you can navigate between various assorted article categories. Also, on the very bottom of the site you will find some useful weblinks, as well as a small broadband connection speed test (which can be very useful sometimes).
If you want to, you can subscribe to my RSS news feed too, to be always up to date about my latest articles - simply click on "subscribe now!" in the little box on the right side. Alternatively, you can subscribe to my simple email newsletter by clicking here.
For questions, feedback, suggestions, criticism and everything else, don't hesitate to contact me:
emailfuerdennis@web.de

....have fun reading!

If you like my site and want to support me , you can always donate to my personal "coffee funds" using the PayPal "donate" button further down on the right! :)
Dennis

Wednesday, March 18, 2009

HowTo: Make navigable website snapshots with linux and 'wget'

Everyone has his own favorite websites - for some folks they are even important enough to store a snapshot on their hard disk for backup and/or archiving purposes. While the google cache surely helps alot when it comes to retrieving lost or otherwise inaccessible information on the web, a real backup copy on your own harddisk is sometimes more appropriate. For these purpose there exist anumber of software solutions, but by far the easiest, most convenient and cheapest way to store a fully clickable snapshot of your favorite website is using the linux command line tool "wget". For windows users there also exists a port of wget written by Christopher Lewis found here.

Now assuming you have a working version of wget installed on your system, let's go through the parameters that are neccessary to have Wget do what we want it to, using the following example for making a snapshot of all my posts on eeeGadgets:

wget -r -l1 -N -k -x http://eeegadgets.blogspot.com/search/label/all posts



-r wget searches the target website recursively for subfolders and external folders that are being linked to

-l1 defines the depth of links to store in our snapshot (in most cases 1 or 2 is sufficient here)

-N wget adds a timestamp to the snapshot for easy later archiving of different snapshots

-k causes wget to change "absolute" (real web) links into "relative" (snapshot file) links, this should always be enabled if you want the links in your snapshot to also work when offline viewing it

-x wget uses the original folder structure as found on the target website (there's also an opposite parameter -nd which causes all files to be written into just one big directory)


When called with these parameters, wget will download all necessary files into the current folder, so you should first make an appropriate one ("mkdir /home/MySnapshotFolderName && cd /home/MySnapshotFolderName") before running the "wget..." command.

And voilà - a complete snapshot of your favorite website, stored on your personal harddisk for eternal backup...

No comments:

Broadband Connection Test:

Just click on "Speedtest starten" to evaluate your actual downlink in MBit/sec (takes 10 seconds approx.)


(c) dsl-speed-messung.de - DSL Speed Test