Alternative to WebClient

Question

I've just seen a web crawler in action on my computer and it downloads like thousands of metatag info in only a few minutes.

And when I use WebClient to download pages and then parse them locally, why does it take WebClient about 40seconds just to download a single webpage? Is there an alternative to downloading webpages?

thanks:)

Jon Skeet · Accepted Answer

A few things to consider:

How many pages are you downloading at once? Web crawlers tend to work in a highly parallel way.
By default the .NET framework restricts the number of parallel requests to a single site. That's generally a nice thing to do - you may want to raise the limit a bit, but ideally target different sites in parallel. The <connectionManagement> element is the one you need to look at.
Have you used WireShark to see what's going on at the network level? If the web site is taking 40 seconds to serve the page, it's hard to see how changing from using WebClient would help.
Could you post some code to show exactly what you're doing?

It's possible that using a different API (possibly even just WebRequest) will speed things up, but you really need to find the current bottleneck first.

Alternative to WebClient

Tags:

c#

webclient

jay_t55

1 Answers

Jon Skeet

Recent Activity

Donate For Us

Alternative to WebClient

Tags:

c#

webclient

jay_t55

1 Answers

Jon Skeet

Related questions

Recent Activity

Donate For Us