Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

HTML Agility Pack HtmlDocument Show All Html?

Tags:

I am using the following to get a web page which works fine

    public static HtmlDocument GetWebPageFromUrl(string url)
    {
        var hw = new HtmlWeb();
        return hw.Load(url);
    }

But how to I spit the entire contents of the HTML out from the HtmlDocument into a string?

I tried HtmlDocument.ToString() but that doesn't give me all the HTML in the document? Any ideas?

like image 673
YodasMyDad Avatar asked Apr 08 '11 18:04

YodasMyDad


1 Answers

DocumentNode.OuterHtml contains the full html:

HtmlAgilityPack.HtmlDocument doc = new HtmlAgilityPack.HtmlDocument();
doc.Load("sample.html");
string html = doc.DocumentNode.OuterHtml;

In your example:

public static string GetWebPageHtmlFromUrl(string url)
{
    var hw = new HtmlWeb();
    HtmlDocument doc = hw.Load(url);
    return doc.DocumentNode.OuterHtml;
}
like image 137
BrokenGlass Avatar answered Oct 13 '22 09:10

BrokenGlass