Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How can I make HTTrack only download files on the current domain?

No matter how hard I try, I can't seem to get httrack to leave links going to other domains intact. I've tried using the --stay-on-same-domain argument, and that doesn't seem to do it. I've also tried adding a filter doesn't do it.

There simply must be some option I'm missing here.

like image 469
alexgolec Avatar asked May 02 '14 05:05

alexgolec


2 Answers

Setting the option "Maximum external depth" to 0 did not work , even though it should be expected.

What works:

Go to > Options > Scan Rules and enter in the text field (extra line): -* +*yourdomain.com/*

Here are more settings to learn about: HTTrack: How to download folders only from a certain subfolder level?

like image 84
Avatar Avatar answered Nov 03 '22 00:11

Avatar


Set maximum external depth to 0. In the GUI that this can be found here:

enter image description here

If you are using the command line version, the option is

%e0

[Note: not an expert on HTTRACK, so please correct if necessary]

like image 32
thomasB Avatar answered Nov 02 '22 22:11

thomasB