How can wget save only certain file types linked to from pages linked to by the target page, regardless of the domain in which the certain files are? Trying to speed up a task I have to do often. I've been rooting through the wget docs and googling, but nothing seems to work. I keep on either getting just the target page or the subpages without the files (even using -H), so I'm obviously doing badly at this. So, essentially, example.com/index1/ contains links to example.com/subpage1/ and example.com/subpage2/, while the subpages contain links to example2.com/file.ext and example2.com/file2.ext, etc. However, example.com/index1.html may link to example.com/index2/ which has links to more subpages I don't want. Can wget even do this, and if not then what do you suggest I use? Thanks.

Following command worked for me. <pre class="prettyprint"><code>wget -r --accept "*.ext" --level 2 "example.com/index1/" </code></pre> Need to do recursively so <code>-r</code> should be added.

How can wget save only certains file types linked to from pages linked to by the target page?

Tags:

linux

wget

How can wget save only certain file types linked to from pages linked to by the target page, regardless of the domain in which the certain files are?

Trying to speed up a task I have to do often.

I've been rooting through the wget docs and googling, but nothing seems to work. I keep on either getting just the target page or the subpages without the files (even using -H), so I'm obviously doing badly at this.

So, essentially, example.com/index1/ contains links to example.com/subpage1/ and example.com/subpage2/, while the subpages contain links to example2.com/file.ext and example2.com/file2.ext, etc. However, example.com/index1.html may link to example.com/index2/ which has links to more subpages I don't want.

Can wget even do this, and if not then what do you suggest I use? Thanks.

657

asked Jul 10 '11 20:07

Nomen

1 Answers

Following command worked for me.

wget -r --accept "*.ext" --level 2 "example.com/index1/"

Need to do recursively so -r should be added.

114

answered Sep 17 '22 13:09

TheKojuEffect

Related questions
                            
                                Comparison of static code analysis tools in Linux? [closed]
                            
                                How to get errno when epoll_wait returns EPOLLERR?
                            
                                Sorting csv file by 5th column using bash
                            
                                Linux kernel AIO, open system call
                            
                                ASP.NET 5 : Is the "dotnet" command replacing "dnu" and "dnx" commands?
                            
                                Reverse engineering the "Target Display Mode" on an iMac
                            
                                $${HOME} or ${HOME} in Makefile?
                            
                                Reading / writing from using I2C on Linux
                            
                                Running Scheme from the command line
                            
                                What encoding used when invoke fopen or open?
                            
                                Linux kernel interrupt handler mutex protection?
                            
                                git: Is there a command line option for "Sort by date" for gitk?
                            
                                where is hardware timer interrupt?
                            
                                Setting variable in bash -c
                            
                                Reading the contents of an ELF section(programmatically)
                            
                                How to run a .sh-script from any path in a terminal?
                            
                                spell check in Rstudio
                            
                                How to run SWF without a browser (on a linux server)?
                            
                                Can I replace a Linux kernel function with a module?
                            
                                Running python script as another user

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With