Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Use httrack to download just one site, not external sites

Tags:

httrack

I tried using httrack to download my phpbb forum, but no matter what setup I use, I cannot get it to stop downloading the entire wikipedia site as well, and many other websites whose links are anywhere in the forum...

What I managed to do it make it download the index page only, but that's not good either.

I thought that setting

+forum.mysite.com/*

in the Options->Scan Rules would do the trick, but it went on to download the entire wikipedia again :(

like image 825
Predrag Stojadinović Avatar asked Oct 29 '22 15:10

Predrag Stojadinović


1 Answers

Found a questionable solution here: Subject: Re: prevent download of external content.

The problem is that now external links point to a page that looks pretty ugly, which is fixable.

However, embedded content, like youtube, is now also replaced by this ugly page :(

At least it is not downloading the entire internet anymore...

like image 189
Predrag Stojadinović Avatar answered Dec 27 '22 13:12

Predrag Stojadinović