Download a website

krishnandu

Skilled
Dec 25, 2009
1,840
219
152
34
Kolkata
www.krishnandusarkar.com
Hi friends, I want to download the whole website and browse from my machine as I browse online. I don't know whether it's legal or not. If it's illegal then mods please delete the thread. Otherwise please suggest any software with which I can do this. I mean if I specify the url the software will automatically download the home website by crawling through it. Is it possible?? BTW I want to download W3Schools Online Web Tutorials :)
 

Speedz

Adept
Sep 22, 2007
377
3
28
32
Isn't this how google started?

That freak Sergey decided to download the whole web onto his desktop?? :p

I am pretty sure this is legal.
 

mrintech

Adept
Jun 6, 2009
463
10
81
Win HT Track is an awesome application. But it will consume a whole lot of Bandwidth (at your end) and Server Resources (at the Website's end). May be the Webmaster Ban your IP Range

Also it is not fully guaranteed that you will get the Website on your PC AS IT IS. There might be some/many errors.

You can easily Save Full/Specific Webpages in Image Format using Screengrab Firefox Addon for later reference :)
 
  • Like
Reactions: 1 person

Uriel

Disciple
Feb 22, 2009
70
5
21
37
Also, when configuring HT Track, make sure you restrict yourself to the same domain in the settings, or you may end up downloading the internet. Ok, that's an exaggeration, but it downloads all links on a page.
 
  • Like
Reactions: 1 person

broadway

Disciple
Jun 3, 2009
139
5
31
39
Internet Download Manager has a think called "grabber" which will work to. There are many softwares that'll do the job. Just remember that the software should modify the links for offline browsing. Meaning if 123.com/abcd.html has a link inside that points 123.com/hijk.html then the modified link should be just hijk.html
 
  • Like
Reactions: 1 person

Crazy_Eddy

Staff member
Super Mod
Feb 7, 2005
8,882
2,286
378
123
Uriel said:
Also, when configuring HT Track, make sure you restrict yourself to the same domain in the settings, or you may end up downloading the internet. Ok, that's an exaggeration, but it downloads all links on a page.

Good advice. Where is this option located?

In any case, I was using it a few days back, and thankfully it automatically popped up an alert about pages/links from another domain - in this case the google ads - and offered me the option to ignore all links under that domain.
 

Uriel

Disciple
Feb 22, 2009
70
5
21
37
^Once you have started downloading, go to Mirror>Modify Options>Limits>Maximum external depth and change it to 0 (no value by default).
 

swapnil0545

Disciple
Dec 22, 2009
111
23
82
34
dude I have the w3schools offline version on my pc.

Temme if u want it, but its not the latest W3schools site its of 2005-06 I think.
 

krishnandu

Skilled
Dec 25, 2009
1,840
219
152
34
Kolkata
www.krishnandusarkar.com
Oh thank you. I recently downloaded it. Downloaded it from rapidshare and working perfectly. Got it by googling around.

Thanks guys for all the help and suggestions. My need is fullfilled but yet if you people don't feel disturbed can you suggest the configuration. I want to know these for just my curiosity and for later purpose.
 

Uriel

Disciple
Feb 22, 2009
70
5
21
37
Im not sure of this, but I think what you did wrong was use the URL "http://www.w3schools.com/xhtml/default.asp" as the base URL. It is better to use just "http://www.w3schools.com"... correct me if I am wrong on this, and let us know what went wrong.

If you don't feel disturbed :p:
The settings change depend on what website you are downloading. For example, this particular website has a forums section, but that is on another domain. You would get a heap of useless pages if it were on the same domain. HT Track generates a "mirror" of the website you are downloading on the local hard drive, so the folder structure and the pages you need may not be easy to find or use unless you know what you are doing, although HT Track does create an index page of all downloads. HT Track would work well for a site like say Wikipedia (don't try unless you have loads of space), but not for a site such as digg. These are at opposite ends of the spectrum. Another problem is that ads, widgets to third party sites, even the images used on the widgets etc, are all copied, which you do not really need. It's not a perfect operation, and even if you do download properly, there is ALWAYS, some pruning involved after the operation, to delete data that you do not need. At least in my experience.
 

Uriel

Disciple
Feb 22, 2009
70
5
21
37
hmm... strange... Im sorry, I cannot figure this out too. Downloaded the website myself. Faced the same problem - nothing is showing up.

For some reason, the HT Track is downloading only the headers of the files, stopping at the body. :bewildered:
 

aadish

Disciple
Feb 8, 2010
14
0
0
33
i hv come across something called pagenestfree. Didn't try it out... Google it out , try, and plz tell me too(*Lazy bones i m!!*)
 

SaTo

Disciple
May 19, 2008
127
16
32
40
teleport pro...awesome software for website downloads...i used it while doing BE projects...