I am trying to use httrack (http://www.httrack.com/) in order to download a single page, not the entire site. So, for example, when using httrack in order to download www.google.com it should only download the html found under www.google.com along with all stylesheets, images and javascript and not follow any links to images.google.com, labs.google.com or www.google.com/subdir/ etc. I tried the <code>-w</code> option but that did not make any difference. What would be the right command? EDIT I tried using <code>httrack "http://www.google.com/" -O "./www.google.com" "http://www.google.com/" -v -s0 --depth=1</code> but then it wont copy any images. What I basically want is just downloading the index file of that domain along with all assets, but not the content of any external or internal links.

<pre class="prettyprint"><code>httrack "http://www.google.com/" -O "./www.google.com" "http://www.google.com/" -v -s0 --depth=1 -n </code></pre> -n option (or --near) will download images on a webpage no matter where it is located. Say images are located in google.com/foo/bar/logo.png. as, you are using s0(stay on same directory), it will not download the image unless you specify --near

<ul> <li>Click on "Set Options"</li> <li>Go to the tab "Limits"</li> <li>Set "Maximum external depth" to 0</li> </ul> <img src="https://i.stack.imgur.com/X4kcy.jpg" alt="copy one page only with httrack">

mirror single page with httrack

Tags:

I am trying to use httrack (http://www.httrack.com/) in order to download a single page, not the entire site. So, for example, when using httrack in order to download www.google.com it should only download the html found under www.google.com along with all stylesheets, images and javascript and not follow any links to images.google.com, labs.google.com or www.google.com/subdir/ etc.

I tried the -w option but that did not make any difference.

What would be the right command?

EDIT

I tried using httrack "http://www.google.com/" -O "./www.google.com" "http://www.google.com/" -v -s0 --depth=1 but then it wont copy any images.

What I basically want is just downloading the index file of that domain along with all assets, but not the content of any external or internal links.

600

asked Dec 28 '09 07:12

Max

2 Answers

httrack "http://www.google.com/" -O "./www.google.com" "http://www.google.com/" -v -s0  --depth=1 -n

-n option (or --near) will download images on a webpage no matter where it is located.

Say images are located in google.com/foo/bar/logo.png. as, you are using s0(stay on same directory), it will not download the image unless you specify --near

198

answered Oct 15 '22 22:10

Sourav Ghosh

Click on "Set Options"
Go to the tab "Limits"
Set "Maximum external depth" to 0

copy one page only with httrack

answered Oct 15 '22 23:10

Lucas Bustamante

Related questions
                            
                                Traits in javascript [closed]
                            
                                Avoid specifying all arguments in a subclass
                            
                                Hibernate: Enabling lazy fetching in Criteria API
                            
                                Is there something like Snoop (WPF) or FireBug (ASP.NET) for Windows Forms? [closed]
                            
                                XSD.exe and "Circular Group references"
                            
                                C# Events between threads executed in their own thread (How to)?
                            
                                How to populate an array with recordset data
                            
                                dynamically modify webkit animation with javascript
                            
                                django return file over HttpResponse - file is not served correctly
                            
                                Ruby on Rails - nested attributes: How do I access the parent model from child model
                            
                                Port iPhone application to Android [closed]
                            
                                How HTML5 Geolocation Feature Works?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With